Add indexing to nodes #59

harsh-nod · 2024-07-31T00:34:11Z

This primary purpose of this PR is to annotate nodes with their access patterns based on the workgroup, tiling and MMA constraints. This is accomplished prior to expansion and propagates through expansion to the expanded nodes.

raikonenfnu · 2024-07-31T18:43:58Z

shark_turbine/kernel/ops/wave_ops.py

+
+    @property
+    def acc_index(self) -> list[IndexSequence]:
+        operand_map = {tkl.sym.MMA_LHS: 0, tkl.sym.MMA_RHS: 0, tkl.sym.MMA_ACC: 1}


Since we are subbing IndexExpr.sub by an operand_map, would the symbolic expression already have the tkl.sym.MMA_LHS, tkl.sym.MMA_RHS, and/or tkl.sym.MMA_ACC as part of the expression?

Yes, so this what I was talking about in the meeting. For each operator, we specify a dimension-specific index. So for MMA, a separate index of M, N and K. Rather than partition these dimensional indices further into operand specific indices, I have them represented as a piecewise function where the conditions depend on MMA_{LHS/RHS/ACC} (In the current PR, just ACC). So in order to extract the operand specific parts, we just substitute the appropriate values as above. The advantage of this piecewise function approach is it allows you to see where the dimensional mapping bifurcates and for which operands and allows you to reason about "layout changes". (For example, you could ask questions like - what setting of LHS, RHS and ACC would make the indices of MMA_0 be the same as that of the LHS of MMA1?)

shark_turbine/kernel/wave/distribution_symbols.py

shark_turbine/kernel/wave/indexing.py

raikonenfnu · 2024-07-31T18:53:48Z

shark_turbine/kernel/wave/expansion.py

+                elif dim == constraint.dim:
+                    custom.index[dim] += constraint.apply()
+        if custom.index:
+            setattr(custom.fx_node, "index", custom.index)


Does this mean during handle_op, we cshould be able to just access this "index" attribute? do we still need to take the size from elem_per_thread, or would it be handled somewhere here as well?

Yes, you should be able to just access the index attribute, but you will have to check if it is None. You will still need to get the size from elem_per_thread. The index sequence tells you how many you can load but that could be different from how many the user has requested to load.

raikonenfnu · 2024-07-31T18:55:24Z

shark_turbine/kernel/wave/expansion.py

+                    continue
+                if not custom.index:
+                    custom.index = {
+                        dim: IndexSequence(0, 1) for dim in custom.indexing_dims


Do we expect the handling of the not custom index and loop over custom.indexing_dims in this for loop? would it not make sense to handle this outside of for dim in custom.indexing_dims:?

My intention here was to only set the index attribute on operators that were affected by the constraints. The reason for this is to distinguish between the following 3 scenarios: an operator that has no index, an operator that has an index but the index is None and an operator that has an index that is not None. I dont think we need to distinguish between the first 2 options - so we can always set the index, but the index could be None. So with that I could move the initialization of custom_index outside the loop.

shark_turbine/kernel/wave/wave.py

raikonenfnu

LGTM!

shark_turbine/kernel/wave/distribution_symbols.py

This primary purpose of this PR is to annotate nodes with their access patterns based on the workgroup, tiling and MMA constraints. This is accomplished prior to expansion and propagates through expansion to the expanded nodes. Signed-off-by: Harsh Menon <harsh@nod-labs.com>

- Removed add operation on index sequences - General cleanup / refactor Signed-off-by: Harsh Menon <harsh@nod-labs.com>

raikonenfnu reviewed Jul 31, 2024

View reviewed changes

shark_turbine/kernel/wave/distribution_symbols.py Show resolved Hide resolved

raikonenfnu reviewed Jul 31, 2024

View reviewed changes

shark_turbine/kernel/wave/indexing.py Outdated Show resolved Hide resolved

raikonenfnu reviewed Jul 31, 2024

View reviewed changes

shark_turbine/kernel/wave/wave.py Show resolved Hide resolved

harsh-nod force-pushed the indexing branch from 5209297 to 565bb7e Compare July 31, 2024 21:25

harsh-nod requested review from Hardcode84 and martin-luecke July 31, 2024 21:30

raikonenfnu approved these changes Jul 31, 2024

View reviewed changes

Hardcode84 reviewed Jul 31, 2024

View reviewed changes

shark_turbine/kernel/wave/distribution_symbols.py Show resolved Hide resolved

Hardcode84 approved these changes Jul 31, 2024

View reviewed changes

harsh-nod added 2 commits July 31, 2024 16:19

Update based on Stan's comments

9982ca0

- Removed add operation on index sequences - General cleanup / refactor Signed-off-by: Harsh Menon <harsh@nod-labs.com>

harsh-nod force-pushed the indexing branch from 565bb7e to 9982ca0 Compare July 31, 2024 23:19

harsh-nod merged commit b094185 into iree-org:main Aug 1, 2024
5 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add indexing to nodes #59

Add indexing to nodes #59

harsh-nod commented Jul 31, 2024

raikonenfnu Jul 31, 2024

harsh-nod Jul 31, 2024

raikonenfnu Jul 31, 2024

harsh-nod Jul 31, 2024

raikonenfnu Jul 31, 2024

harsh-nod Jul 31, 2024

raikonenfnu left a comment

Add indexing to nodes #59

Add indexing to nodes #59

Conversation

harsh-nod commented Jul 31, 2024

raikonenfnu Jul 31, 2024

Choose a reason for hiding this comment

harsh-nod Jul 31, 2024

Choose a reason for hiding this comment

raikonenfnu Jul 31, 2024

Choose a reason for hiding this comment

harsh-nod Jul 31, 2024

Choose a reason for hiding this comment

raikonenfnu Jul 31, 2024

Choose a reason for hiding this comment

harsh-nod Jul 31, 2024

Choose a reason for hiding this comment

raikonenfnu left a comment

Choose a reason for hiding this comment