[GPU] Disable unaligned to instrinsic batch matmul codegen with vector distribute #18935

nirvedhmeshram · 2024-10-29T17:04:14Z

This path doesnt support all batch matmul shapes but tries to and fails
e.g. #18601

So this PR makes this change because by default we should favor higher functionality support over performance. Solution is to keep this path behind a flag which is off by default.

Fixes : #18601

If we bail out here, we will go down SIMT (note that we do anyway for non batch matmul GEMMs for such shapes) for now with Tile and Fuse pipeline support planned for the future. In the time being models who have shapes that are supported by this path can do so using the provided flag. And tuners can always use this pipeline if it works for the shape. We can also turn this on by default if we can add correct heuristics on when it is okay to use this path.

…r distribute Signed-off-by: Nirvedh <nirvedh@gmail.com>

nirvedhmeshram requested review from MaheshRavishankar, qedawkins, kuhar and Groverkss as code owners October 29, 2024 17:04

[GPU] Disable unaligned to instrinsic batch matmul codegen with vecto…

e602dde

…r distribute Signed-off-by: Nirvedh <nirvedh@gmail.com>

nirvedhmeshram force-pushed the disable_unaligned_bmm_vectordistribute branch from bde1b32 to e602dde Compare October 29, 2024 17:06

fix test

8acc5b6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GPU] Disable unaligned to instrinsic batch matmul codegen with vector distribute #18935

[GPU] Disable unaligned to instrinsic batch matmul codegen with vector distribute #18935

nirvedhmeshram commented Oct 29, 2024 •

edited

Loading

[GPU] Disable unaligned to instrinsic batch matmul codegen with vector distribute #18935

Are you sure you want to change the base?

[GPU] Disable unaligned to instrinsic batch matmul codegen with vector distribute #18935

Conversation

nirvedhmeshram commented Oct 29, 2024 • edited Loading

nirvedhmeshram commented Oct 29, 2024 •

edited

Loading