Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: initial draft of custom metric tool and its systemd timer #1963

Open
wants to merge 3 commits into
base: master
Choose a base branch
from

Conversation

nabdullindfinity
Copy link
Contributor

@nabdullindfinity nabdullindfinity commented Oct 10, 2024

To support the performance monitoring on mainnet, add a tool where custom metrics can be calculated and exported to prometheus's node_exporter through the textfile collector.

Currently, the total number of TLB shootdowns across all CPUs will be exposed as sum_tlb_shootdowns, collected once per minute, as the latest node_exporter does not allow filtering of data of its built-in interrupts collector that could otherwise do it for us (until prometheus/node_exporter#3028 is included in the release branches) and will add many metrics with high cardinality otherwise.

NODE-1445

rs/ic_os/metrics_tool/src/lib.rs Outdated Show resolved Hide resolved
rs/ic_os/metrics_tool/src/main.rs Outdated Show resolved Hide resolved
rs/ic_os/metrics_tool/src/lib.rs Show resolved Hide resolved
rs/ic_os/metrics_tool/src/main.rs Show resolved Hide resolved
rs/ic_os/metrics_tool/src/lib.rs Show resolved Hide resolved
rs/ic_os/metrics_tool/src/lib.rs Outdated Show resolved Hide resolved
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants