Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate Flakey Tests in E2E CI #40

Open
aiyengar2 opened this issue Dec 19, 2022 · 2 comments
Open

Investigate Flakey Tests in E2E CI #40

aiyengar2 opened this issue Dec 19, 2022 · 2 comments
Assignees

Comments

@aiyengar2
Copy link

The following dashboard queries always seem to be flakey in E2E CI:

db/kubernetes-compute-resources-namespace-pods/IOPS(Reads+Writes)_query0
db/kubernetes-compute-resources-namespace-pods/Current_Storage_IO_query0
db/kubernetes-compute-resources-namespace-pods/Current_Storage_IO_query1
db/kubernetes-compute-resources-namespace-pods/Current_Storage_IO_query2

db/kubernetes-compute-resources-pod/IOPS_query0
db/kubernetes-compute-resources-pod/IOPS_query1
db/kubernetes-compute-resources-pod/IOPS(Reads+Writes)_query0
db/kubernetes-compute-resources-pod/Current_Storage_IO_query0
db/kubernetes-compute-resources-pod/Current_Storage_IO_query1
db/kubernetes-compute-resources-pod/Current_Storage_IO_query2

db/kubernetes-compute-resources-project/IOPS(Reads+Writes)_query0
db/kubernetes-compute-resources-project/Current_Storage_IO_query0
db/kubernetes-compute-resources-project/Current_Storage_IO_query1
db/kubernetes-compute-resources-project/Current_Storage_IO_query2

In #39, these tests are being automatically skipped but we should ideally investigate why these dashboards tend to have no data in random runs and put in fixes for them.

@aiyengar2
Copy link
Author

In addition to fixing these flakey tests, we should also introduce a step into CI to perform a helm install of Longhorn before creating the ProjectHelmChart so that we can trigger the ProjectHelmChart to enable persistent storage for Prometheus and Grafana; this will ensure that we can validate that the Persistent Volume Metrics that are currently being skipped can be validated

@aiyengar2
Copy link
Author

We should also investigate the issues in https://github.com/rancher/prometheus-federator/actions/runs/3734091815/jobs/6335719834; this is probably related to rancher/rancher#39430

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants