Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integration of LibKineto CUpti Events and Metrics with PyTorch Profiler #973

Open
marceloamaral opened this issue Aug 6, 2024 · 0 comments

Comments

@marceloamaral
Copy link

marceloamaral commented Aug 6, 2024

Hi team,

It looks like LibKineto can enable GPU hardware counters (Events) and/or GPU Metrics, but the integration with PyTorch profiler is not clear for me. I can see the potential use of the CUDA metrics in kDerivedMetrics, in EventProfilerTest and its initialization in EventProfiler::initMetrics...

So my questions are:

  • Is there any documentation on how to use LibKineto CUpti events and metrics within the PyTorch profiler?
  • If integration with the PyTorch profiler isn’t the intended use case, what are the main applications for these metrics?
  • Can LibKineto produce Chrome traces that include GPU events and/or metrics?

I could not find examples of PyTorch profiler using GPU counter events, which suggests potential integration challenges. Clarifying these points would really help in understanding how to leverage LibKineto’s capabilities. Specially because I am trying to extend LibKineto to support metrics of a different AI hardware accelerator.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant