Releases: ROCm/hipBLASLt
Releases · ROCm/hipBLASLt
hipBLASLt 0.8.0 for ROCm 6.2.2
hipBLASLt code for ROCm 6.2.2 did not change. The library was rebuilt for the updated ROCm 6.2.2 stack.
hipBLASLt 0.8.0 for ROCm 6.2.1
hipBLASLt code for ROCm 6.2.1 did not change. The library was rebuilt for the updated ROCm 6.2.1 stack.
hipBLASLt 0.8.0 for ROCm 6.2.0
hipBLASLt code for ROCm 6.2.0 did not change. The library was rebuilt for the updated ROCm 6.2.0 stack.
hipBLASLt 0.7.0 for ROCm 6.1.2
hipBLASLt code for ROCm 6.1.2 did not change. The library was rebuilt for the updated ROCm 6.1.2 stack.
hipBLASLt 0.7.0 for ROCm 6.1.1
hipBLASLt code for ROCm 6.1.1 did not change. The library was rebuilt for the updated ROCm 6.1.1 stack.
hipBLASLt 0.7.0 for ROCm 6.1.0
Additions
- Added
hipblasltExtSoftmax
extension API - Added
hipblasltExtLayerNorm
extension API - Added
hipblasltExtAMax
extension API - Added
GemmTuning
extension parameter to set split-k by user - Support for mix precision datatype: fp16/fp8 in with fp16 out
hipBLASLt 0.6.0 for ROCm 6.0.2
Added
- Add UserArguments for GroupedGemm
- Support datatype: fp16 in with fp32 out
- Add samples
- Support datatype: Int8 in Int32 out
- Support platform gfx94x
- Support fp8/bf8 datatype (only for gfx94x platform)
- Support Scalar A,B,C,D for fp8/bf8 datatype
Changed
- Replace hipblasDatatype_t with hipDataType
- Replace hipblasLtComputeType_t with hipblasComputeType_t
- Deprecate HIPBLASLT_MATMUL_DESC_D_SCALE_VECTOR_POINTER
hipBLASLt 0.6.0 for ROCm 6.0.0
Added
- Add UserArguments for GroupedGemm
- Support datatype: fp16 in with fp32 out
- Add samples
- Support datatype: Int8 in Int32 out
- Support platform gfx94x
- Support fp8/bf8 datatype (only for gfx94x platform)
- Support Scalar A,B,C,D for fp8/bf8 datatype
Changed
- Replace hipblasDatatype_t with hipDataType
- Replace hipblasLtComputeType_t with hipblasComputeType_t
- Deprecate HIPBLASLT_MATMUL_DESC_D_SCALE_VECTOR_POINTER
hipBLASLt 0.3.0 for ROCm 5.7.1
Added
- Add getAllAlgos extension APIs
- TensileLite support new epilogues: gradient gelu, gradient D, gradient A/B, aux
- Add sample package including three sample apps
- Add new C++ GEMM class in hipblaslt extension
Changed
- refactor GroupGemm APIs as C++ class in hipblaslt extension
- change scaleD vector enum as HIPBLASLT_MATMUL_DESC_D_SCALE_VECTOR_POINTER
Fixed
- Enable norm check validation for CI
Optimizations
- GSU kernel optimization: wider memory, PGR N
- update logic yaml to improve some FP16 NN sizes
- GroupGemm support GSU kernel
- Add grouped gemm tuning for aldebaran
hipBLASLt 0.3.0 for ROCm 5.7.0
Added
- Add getAllAlgos extension APIs
- TensileLite support new epilogues: gradient gelu, gradient D, gradient A/B, aux
- Add sample package including three sample apps
- Add new C++ GEMM class in hipblaslt extension
Changed
- refactor GroupGemm APIs as C++ class in hipblaslt extension
- change scaleD vector enum as HIPBLASLT_MATMUL_DESC_D_SCALE_VECTOR_POINTER
Fixed
- Enable norm check validation for CI
Optimizations
- GSU kernel optimization: wider memory, PGR N
- update logic yaml to improve some FP16 NN sizes
- GroupGemm support GSU kernel
- Add grouped gemm tuning for aldebaran