[TKW] Fix types, shapes and propagate resolved indexing #177

raikonenfnu · 2024-09-27T18:30:16Z

This PR add supports for doing accumulate on non-induction variables. For most part our stack already supports this, but we'd need to fix reduction symbolic shape which used to be just the input shape to input shape - reduction dim. Without this our thread shape analysis wouldn't be able to handle broadcasting of reduced op properly.

To support the above, we also introduce Extract op in addition to the existing ExtractSlice to fix the shape types. Specifically, ExtractSlice follow upstream's semantic where we just slice but do not reduce any dimensions. For the ReduceOp case, specifically the local reduction, we'd want it to have a reducing semantic on the fastest dimension.

Additionally, we also add a propagation of our resolution for thread shape. This is helpful in the test we added which is a broadcast-sub followed by an exp2.

raikonenfnu · 2024-09-30T19:04:52Z

Currently broken on igemm because ExtractSlice used in iGEMM semantic needs to follow upstream. We need to implement an extract/indexing op for semantic required in local thread reduction

iree/turbine/kernel/ops/wave_ops.py

iree/turbine/kernel/wave/codegen.py

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

iree/turbine/kernel/ops/wave_ops.py

iree/turbine/kernel/wave/thread_shape_analysis.py

harsh-nod · 2024-10-17T09:45:34Z

iree/turbine/kernel/ops/wave_ops.py

+
+        # Typically only fastest dim has non-unit dim,
+        # but if all unit-dim get fastest/last one.
+        all_unit_dims = lambda index: all(x.size == 1 for x in index.values())


Do we need this check? Can we merge this with the non_unit_dim check? if len(non_unit_dim) == 0, use fastest dim.

done! I think :)

harsh-nod

This looks pretty good. Just a few comments

harsh-nod

lgtm! just some minor nits

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

raikonenfnu force-pushed the fixTypesUpstream branch 3 times, most recently from 25c6d23 to e03e3ce Compare September 27, 2024 21:34

raikonenfnu force-pushed the fixTypesUpstream branch 3 times, most recently from d3f6edb to ab70a52 Compare October 6, 2024 22:00

raikonenfnu changed the title ~~[TKW] Fix types and shapes for accumulate on non IV~~ [TKW] Fix types, shapes and propagate resolved indexing Oct 6, 2024

raikonenfnu requested a review from harsh-nod October 6, 2024 22:03

harsh-nod reviewed Oct 7, 2024

View reviewed changes

iree/turbine/kernel/ops/wave_ops.py Show resolved Hide resolved

harsh-nod reviewed Oct 7, 2024

View reviewed changes

iree/turbine/kernel/ops/wave_ops.py Show resolved Hide resolved

harsh-nod reviewed Oct 7, 2024

View reviewed changes

iree/turbine/kernel/ops/wave_ops.py Show resolved Hide resolved

harsh-nod reviewed Oct 7, 2024

View reviewed changes

iree/turbine/kernel/wave/codegen.py Show resolved Hide resolved

stellaraccident force-pushed the main branch from 1f75cd5 to bacfdcd Compare October 13, 2024 02:44

[TKW] Fix types and shapes for accumulate on non IV

08d43e0

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

raikonenfnu force-pushed the fixTypesUpstream branch from ab70a52 to 08d43e0 Compare October 17, 2024 00:30

handle nits such as using vector.extract and update comment

908c4d0

Signed-off-by: Stanley Winata <stanley.winata@amd.com>

raikonenfnu requested a review from harsh-nod October 17, 2024 01:08