Skip to content
forked from ROCm/Tensile

Stretching GPU performance for GEMMs and tensor contractions.

License

Notifications You must be signed in to change notification settings

hanamizuki-ai/Tensile

 
 

Repository files navigation

Tensile is a tool for creating benchmark-driven backend libraries for GEMMs, GEMM-like problems (such as batched GEMM), and general N-dimensional tensor contractions on a GPU. The Tensile library is mainly used as backend library to rocBLAS. Tensile acts as the performance backbone for a wide variety of 'compute' applications running on AMD GPUs.

See Tensile Wiki for documentation.

About

Stretching GPU performance for GEMMs and tensor contractions.

Resources

License

Stars

Watchers

Forks

Packages

 
 
 

Languages

  • Python 49.9%
  • C++ 30.4%
  • Assembly 15.5%
  • TeX 1.4%
  • CMake 1.2%
  • Shell 1.1%
  • Other 0.5%