Add lowerings for mma, register and allocate #86

harsh-nod · 2024-08-16T20:39:01Z

This PR adds a mma unit test which lowers to
vector.loads/stores and amdgpu.mfmas. Also supports shared memory promotion.

This PR adds a mma unit test which lowers to vector.loads/stores and amdgpu.mfmas. Also supports shared memory promotion. Signed-off-by: Harsh Menon <harsh@nod-labs.com>

shark_turbine/kernel/wave/wave.py

lit_tests/kernel/wave/codegen.py

shark_turbine/kernel/wave/codegen.py

shark_turbine/kernel/wave/constraints.py

Signed-off-by: Harsh Menon <harsh@nod-labs.com>

raikonenfnu · 2024-08-19T06:18:17Z

shark_turbine/kernel/wave/wave.py

        emitter.emit(graph.get_root_graph())
        emitter.finish()

+        if kwargs.get("canonicalize", False):


When do we not want to canonicalize?

You don't want to canonicalize when you will lose all your IR on canonicalization (because they have no uses). This happens on some of the other tests (you can try it out by setting canonicalize=True on some of the other tests).

raikonenfnu · 2024-08-19T06:32:11Z

shark_turbine/kernel/wave/utils.py

+            named_sequence = transform_d.NamedSequenceOp(
+                "__transform_main", [any_op_t()], []
+            )
+            with InsertionPoint(named_sequence.body):


Just curious, is this the best way to setup canonicalization? by setting up a TD-like structure to do the canonicalization? I'd assume we'd have better API support from upstream to call canonicalization and or other standard patterns on.

Good question and I was thinking about this myself. I think we can avoid TD by using other patterns, but there could be some advantages to using TD for now (as it has support for other patterns like LICM etc.). If we discover that we don't need that, we can always rewrite this without using TD.

Hardcode84

I will need to look at indexing more later, but let's merge it for now so we can make progress.

Hardcode84 · 2024-08-19T14:36:22Z

shark_turbine/kernel/wave/codegen.py

+        raise CodegenError("No hardware constraints found.")
+
+    result = None
+    for constraint in hardware_constraints:


We probably should validate len(hardware_constraints) == 1 and get rid of this loop.

Sure, will add this to the follow on PR.

harsh-nod force-pushed the mma branch from 54c1fd7 to d02c27b Compare August 16, 2024 20:48

Add lowerings for mma, register and allocate

91eea4a

This PR adds a mma unit test which lowers to vector.loads/stores and amdgpu.mfmas. Also supports shared memory promotion. Signed-off-by: Harsh Menon <harsh@nod-labs.com>

harsh-nod force-pushed the mma branch from d02c27b to 91eea4a Compare August 16, 2024 21:17

harsh-nod requested review from Hardcode84, martin-luecke and raikonenfnu August 16, 2024 21:17

Hardcode84 reviewed Aug 16, 2024

View reviewed changes

Address Ivan's comments

b3612ab

Signed-off-by: Harsh Menon <harsh@nod-labs.com>

harsh-nod force-pushed the mma branch from 7087a9e to b3612ab Compare August 17, 2024 03:27

raikonenfnu reviewed Aug 19, 2024

View reviewed changes

Hardcode84 approved these changes Aug 19, 2024

View reviewed changes

harsh-nod merged commit 344c65d into iree-org:main Aug 19, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add lowerings for mma, register and allocate #86

Add lowerings for mma, register and allocate #86

harsh-nod commented Aug 16, 2024

raikonenfnu Aug 19, 2024

harsh-nod Aug 19, 2024

raikonenfnu Aug 19, 2024

harsh-nod Aug 19, 2024

Hardcode84 left a comment

Hardcode84 Aug 19, 2024

harsh-nod Aug 19, 2024

Add lowerings for mma, register and allocate #86

Add lowerings for mma, register and allocate #86

Conversation

harsh-nod commented Aug 16, 2024

raikonenfnu Aug 19, 2024

Choose a reason for hiding this comment

harsh-nod Aug 19, 2024

Choose a reason for hiding this comment

raikonenfnu Aug 19, 2024

Choose a reason for hiding this comment

harsh-nod Aug 19, 2024

Choose a reason for hiding this comment

Hardcode84 left a comment

Choose a reason for hiding this comment

Hardcode84 Aug 19, 2024

Choose a reason for hiding this comment

harsh-nod Aug 19, 2024

Choose a reason for hiding this comment