Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there any document for performance benchmark result vs pytorch2.1 compile mode? #377

Open
fsword73 opened this issue May 16, 2024 · 1 comment

Comments

@fsword73
Copy link

Pytorch 2.1 compile mode supports fused kernels which updated the performance flag.
Is there any document to compare the performance with pytorch 2.1 compile mode . For example, 5 CNN models and 5 LLM models. It is better to have per layer compare.
In this way , developers can decide to move this software stack or not. Usually developers does not expect to benchmark before baseline data were available

@ghostplant
Copy link
Contributor

ghostplant commented Jun 15, 2024

Hi @fsword73. Thanks for your questions.

First, Pytorch 2.1 compile mode & Antares don't have any conflicts between each other which allows users to enable both to maximize everything, so I'm confused of the context to "move this software stack".

For benchmarking, your suggestion is great, can you share some concrete model repos that are optimized by Pytorch 2.1 compile mode? We didn't keep track on Pytorch 2.1 compile mode in the past. To avoid any unfairness of comparison, we'll use your suggested repo to do benchmarking.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants