Releases: ROCm/omniperf
Omniperf 2.1.0 for ROCm 6.2.2
Changes
- enable rocprofv1 for MI300 hardware (#391)
- refactoring and updating documemtation (#362, #394, #398, #414, #420)
- branch renaming and workflow updates (#389, #404, #409)
- bug fix for analysis output
- add dependency checks on application launch (#393)
- patch for profiling multi-process/multi-GPU applications (#376, #396)
- packaging updates (#386)
- rename CHANGES to CHANGELOG.md (#410)
- rollback Grafana version in Dockerfile for Angular plugin compatibility (#416)
- enable CI triggers for Azure CI (#426)
- add GPU model distinction for MI300 systems (#423)
- new MAINTAINERS.md guide for omniperf publishing procedures (#402)
Optimizations
- reduced running time of Omniperf when profiling (#384)
- console logging improvements
rocm-6.2.2
ROCm release v6.2.2
rocm-6.2.1
ROCm release v6.2.1
rocm-6.2.0
ROCm release v6.2.0
v2.0.1 (03 June 2024)
Updates
- new option to force hardware target via OMNIPERF_ARCH_OVERRIDE global (#370)
- CI/CD support for MI300 hardware (#373)
- cmake build improvements (#374)
- support for MI308X hardware (#375)
Documentation available at https://rocm.github.io/omniperf/
Associated release tarball: omniperf-v2.0.1.tar.gz
v2.0.0 (17 May 2024)
Updates
- improved logging than spans all modes (#177) (#317) (#335) (#341)
- overhauled CI/CD that spans all modes (#179)
- extensible SoC classes to support easy addition of new hardware configs (#180)
- --kernel-verbose no longer overwrites kernel names (#193)
- general cleanup and improved organization of source code (#200) (#210)
- separate requirement files for docs and testing dependencies (#205) (#262) (#358)
- add support for MI300 hardware (#231)
- upgrade Grafana assets and build script to latest release (#235)
- update minimum ROCm and Python requirements (#277)
- sort rocprofiler input files prior to profiling (#304)
- new --quiet option will suppress verbose output and show a progress bar (#308)
- roofline support for Ubuntu 22.04 (#319)
Documentation available at https://rocm.github.io/omniperf/
Associated release tarball: omniperf-v2.0.0.tar.gz
v2.0.0 RC #1 (15 March 2024)
This is the first release candidate for a forthcoming v2.0.0 release expected in March 2024. The 2.0 release provides a significant refactor of the underlying code base and introduces support for MI300.
Tech Preview documentation available at: https://amdresearch.github.io/omniperf/2.x
Associated release tarball: omniperf-2.0.0-RC1.tar.gz
v2.0.0 Tech Preview #1 (03 March 2024)
This is a tech preview release for a forthcoming v2.0.0 release expected in March 2024. The 2.0 release provides a significant refactor of the underlying code base and introduces support for MI300.
Tech Preview documentation available at: https://rocm.github.io/omniperf/2.x
Associated release tarball: omniperf-2.0.0-Tech-Preview1.tar.gz
v1.1.0-PR1 (13 October 2023)
Updates
- standardize headers to use 'avg' instead of 'mean'
- add color code thresholds to standalone gui to match grafana
- modify kernel name shortener to use cpp_filt (#168)
- enable stochastic kernel dispatch selection (#183)
- patch grafana plugin module to address a known issue in the latest version (#186)
- enhanced communication between analyze mode kernel flags (#187)
Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.1.0-PR1.tar.gz
v1.0.10 (22 August 2023)
Updates
- critical patch for detection of llvm in rocm installs on SLURM systems
Documentation available at https://rocm.github.io/omniperf/1.x/
Associated release tarball: omniperf-v1.0.10.tar.gz