Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Metric AI service #1322

Open
wants to merge 70 commits into
base: main
Choose a base branch
from
Open

Metric AI service #1322

wants to merge 70 commits into from

Conversation

tybalex
Copy link
Contributor

@tybalex tybalex commented Apr 19, 2023

This is the python service implementation of the Epic attached.
a few concepts in this service:

  • A job: A list of namespaces of a cluster to be watched, selected by user, similar to the watchlist we already have in log anomaly detection
  • A job run: run metric analysis for a job at the requested timestamp.

What's included in this PR:

  1. Http server in main.py: provide a few APIs, the most important one is /run_job/{job_id}/, which is the logic to run a job. The other APIs are used by gateway plugin.
  2. filter_anomaly_metric.py defines the rules to filter out normal metrics.
  3. the model dir, cnn_model.py defines the structure of the CNN model and metric_pattern_classification.py defines the logic to train/eval/predict with the model.
  4. Grafana_dashboard_utils.py defines functions to create/delete dashboardjson.
  5. the tests dir: unit-test

kralicky and others added 29 commits March 2, 2023 13:08
…etrics, and it can correctly capture some spikes.
@tybalex tybalex linked an issue Apr 19, 2023 that may be closed by this pull request
7 tasks
@tybalex tybalex requested a review from kralicky April 28, 2023 03:13
@tybalex
Copy link
Contributor Author

tybalex commented Apr 28, 2023

Regard to the model file, @AmartC will move it to Opni's internal S3 bucket.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
2 participants