[aievec] Add new shuffle ops #1516

jsetoain · 2024-05-28T12:31:46Z

This replaces the old aievec.shuffle op with a new one with a better syntax that supports all cases supported by the intrinsics, and has strong type guarantees.

For legacy purposes, we leave the old shuffle instruction renamed as aievec.legacyshuffle.

jamestcl-amd · 2024-05-28T21:24:12Z

lib/Targets/AIEVecToCpp/TranslateAIEVecToCpp.cpp

+    os << emitter.getOrCreateName(rhs);
+    os << ", ";
+  }
+  os << "eShuffleMode::shuffle_T" << &stringifyEnum(mode).data()[1];


Not sure if we can do substr(1) here, which is easier to understand...

Yup! You caught me being lazy 😂

jamestcl-amd · 2024-05-28T21:36:00Z

lib/Dialect/AIEVec/IR/AIEVecOps.cpp

+    return emitError() << "shuffle mode '" << stringifyEnum(mode)
+                       << "' requires vectors of " << modeBitWidth
+                       << "-bit elements";
+


Should we also check if the operands and results are in 512-bit vectors?

That's being checked by the type constraints in the op definition, this exclusively verifies whether the shuffle mode is "compatible" with the element type and number of operands. It's up for debate whether we want to impose this restriction or not, since the hardware is agnostic to the mode (it just shuffles bytes around), but I'd rather be conservative for the time being.

jamestcl-amd · 2024-05-28T21:42:25Z

test/aievec/conv2d_i8_after_polygeist.mlir

@@ -38,7 +38,7 @@ module attributes {dlti.dl_spec = #dlti.dl_spec<#dlti.dl_entry<"dlti.endianness"
 //      CHECK:    %[[C16:.*]] = arith.constant 16 : index
 //      CHECK:    %[[C0:.*]] = arith.constant 0 : index
 //      CHECK:    %[[T0:.*]] = aievec.upd %[[A1]][%[[C0:.*]]] {index = 0 : i8, offset = 0 : i32} : memref<?xi8>, vector<64xi8>
-//      CHECK:    %[[T1:.*]] = aievec.shuffle %[[T0:.*]] {mode = 0 : i32} : vector<64xi8>, vector<64xi8>
+//      CHECK:    %[[T1:.*]] = aievec.legacyshuffle %[[T0:.*]] {mode = 0 : i32} : vector<64xi8>, vector<64xi8>


Given the mode=0 (shuffle_T8_64x2_lo), I'm a bit confused why this can work without a second operand...

"Shuffle" always takes two arguments. When translating to C++, the single operand "shuffle" C++ intrinsic calls a two operand version with "undef" in the rhs. For the new shuffle, we will have to inject the undef when lowering to LLVM IR. With the new constraints, I'm just trying to keep things semantically consistent.

jamestcl-amd · 2024-05-28T21:43:03Z

test/aievec/conv2d_uij_i8_noinit_aie-ml.mlir

@@ -80,7 +80,7 @@ func.func @conv2d (%A: memref<18x288xi8>, %B: memref<48xi8>, %C: memref<16x256xi
 //      CHECK:    %[[C8:.*]] = arith.constant 8 : i32
 //      CHECK:    %[[C0:.*]] = arith.constant 0 : index
 //      CHECK:    %[[T0:.*]] = aievec.upd %[[A1]][%[[C0]]] {index = 0 : i8, offset = 0 : i32} : memref<48xi8>, vector<64xi8>
-//      CHECK:    %[[T1:.*]] = aievec.shuffle %[[T0]] {mode = 0 : i32} : vector<64xi8>, vector<64xi8>
+//      CHECK:    %[[T1:.*]] = aievec.legacyshuffle %[[T0]] {mode = 0 : i32} : vector<64xi8>, vector<64xi8>


And this one too...

jamestcl-amd · 2024-05-28T21:46:45Z

include/aie/Dialect/AIEVec/IR/AIEVecOps.td

+                 AIEVec_ShuffleModeAttr:$mode)>,
+  Results<(outs AnyVector:$result)> {
+  let summary = "AIE2 shuffle";
+  let description = [{


Thanks for the nice and clear explanation!

david-vc

Very nice work! It's great to see the symbolic mode in the textual representation.

include/aie/Dialect/AIEVec/IR/AIEVecOps.td

david-vc · 2024-05-29T00:37:44Z

include/aie/Dialect/AIEVec/IR/AIEVecOps.td

+
+         Shuffle Mode       | Operands           | Types Supported
+        :------------------:|:------------------:|:------------------:
+         t8_8x4             | `lhs`              | `vector<2x32xi8>`


The entries in the 'Types Supported' seem inconsistent.
I would expect to see only 1D vectors. So for the entry here, I would expect vector<64xi8>.

This is something I'd like to discuss. I stumbled upon this "issue" when looking over each of the shuffle modes. For some of them, like this one, it performs the transposition on the two halves of the vector, as if we had two (in this case) 8x4 matrices. I'm inclined to treat it as a flat 64xi8 and this would be yet another set of "special" cases. The ISA is not particularly semantically consistent. It does what it does, except when it doesn't 😉

david-vc · 2024-05-29T00:42:16Z

include/aie/Dialect/AIEVec/IR/AIEVecOps.td

+         t8_64x2_hi         | ^                  |  ^
+         t8_2x64_lo         | ^                  |  ^
+         t8_2x64_hi         | ^                  |  ^
+         t16_4x2            | `lhs`              | `vector<4x8xi16>` or `vector<4x8xbf16>`


the 3rd column should read vector<8xi16> or vector<8xbf16>.

If anything, vector<32xi16>/vector<32xbf16>. See above.

This replaces the old `aievec.shuffle` op with a new one with a better syntax that supports all cases supported by the intrinsics, and has strong type guarantees. For legacy purpose, we leave the old shuffle instruction renamed as `aievec.legacyshuffle`.

jsetoain requested review from david-vc, jamestcl-amd and muradq-amd May 28, 2024 12:31

jsetoain requested a review from makslevental as a code owner May 28, 2024 12:31

jsetoain force-pushed the add-aievec-shuffle-ops branch from c02363e to 362ad18 Compare May 28, 2024 14:35

jamestcl-amd reviewed May 28, 2024

View reviewed changes

david-vc reviewed May 29, 2024

View reviewed changes

jsetoain force-pushed the add-aievec-shuffle-ops branch 2 times, most recently from b4b19db to 27bdc9b Compare May 29, 2024 14:27

jsetoain mentioned this pull request May 29, 2024

[aievec] Add aievec shuffle to xllvm translation #1523

Merged

jsetoain force-pushed the add-aievec-shuffle-ops branch from 65aaaf3 to 61e42bf Compare May 30, 2024 09:13

[aievec] Add new shuffle ops

871a5f4

This replaces the old `aievec.shuffle` op with a new one with a better syntax that supports all cases supported by the intrinsics, and has strong type guarantees. For legacy purpose, we leave the old shuffle instruction renamed as `aievec.legacyshuffle`.

jsetoain force-pushed the add-aievec-shuffle-ops branch from 61e42bf to 871a5f4 Compare May 30, 2024 09:14

jsetoain added this pull request to the merge queue May 30, 2024

Merged via the queue into Xilinx:main with commit 652933e May 30, 2024
51 checks passed

jsetoain deleted the add-aievec-shuffle-ops branch May 30, 2024 10:39

singagan pushed a commit that referenced this pull request Jun 5, 2024

[aievec] Add new shuffle ops (#1516)

89c8fe5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[aievec] Add new shuffle ops #1516

[aievec] Add new shuffle ops #1516

jsetoain commented May 28, 2024

jamestcl-amd May 28, 2024

jsetoain May 29, 2024

jamestcl-amd May 28, 2024

jsetoain May 28, 2024 •

edited

Loading

jamestcl-amd May 28, 2024

jsetoain May 28, 2024

jamestcl-amd May 28, 2024

jamestcl-amd May 28, 2024

david-vc left a comment

david-vc May 29, 2024

jsetoain May 29, 2024

david-vc May 29, 2024

jsetoain May 29, 2024

[aievec] Add new shuffle ops #1516

[aievec] Add new shuffle ops #1516

Conversation

jsetoain commented May 28, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jsetoain May 28, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

david-vc left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jsetoain May 28, 2024 •

edited

Loading