[AutoBump] Merge with fixes of 77d7f644 (Jun 13, needs LLVM bump) (66) #303

mgehre-amd · 2024-09-09T14:52:07Z

A PR landed when moving away from a deprecated cast function. Updated the corresponding lines to pass.

1. Support conv_transpose1d and conv_transpose3d 2. Fix bugs of convertTransposedConv func in lib/Conversion/TorchToStablehlo/Linear.cpp

as title

Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

This addresses 7 of the model failures I'm seeing in the test suite. See [Shark-Turbine issue llvm#566](nod-ai/SHARK-ModelDev#566). Need the op ```linalg.conv_2d_ngchw_gfchw_q``` to be added upstream before merging this. See [llvm-project PR #92136 ](llvm/llvm-project#92136). A small additional expansion to operand quantization is included in this patch to address a model failure that occurs when unblocking the quantized group convolutions in one of these onnx models.

@kuhar

@kuhar mentioned in the previous PR that we should use ld.lld. I kept using ld because for my LLD version, it worked. After updating to a new LLD version, that became necessary.

fixes nod-ai/SHARK-ModelDev#653 --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>

Resolving `bool` literals can result in a type change to uint8. This needs to be converted back to the expected type before returning to the wrapped `torch` operators.

This patch adds two `memref` passes to `torch-mlir-opt`, which already occur in the pass pipeline `torch-backend-to-linalg-on-tensors-backend-pipeline`. Additionally, necessary op interface external models are included to address issue llvm#3352.

Support lowering unsigned integer type to stablehlo as discussed in llvm#2184. The things I do in this PR: 1. create `setupBackendTypeConversionForStablehlo()`, `createFuncBackendTypeConversionForStablehloPass` and `createFinalizingBackendTypeConversionForStablehloPass`. 2. remove `InferTypeOpInterface` from `torch_c.to_builtin_tensor`, because it's different result type between linalg backend and stablehlo backend: ``` // linalg backend func.func @forward(%arg0: !torch.vtensor<[3],ui8>) -> tensor<3xf32> { %c = torch_c.to_builtin_tensor %arg0 : (!torch.vtensor<[3], ui8> -> tensor<3xi8> %0 = tensor.empty() : tensor<3xf32> %1 = linalg.generic {indexing_maps = [#map, #map], iterator_types = ["parallel"]} ins(%arg0 : tensor<3xi8>) outs(%0 : tensor<3xf32>) { ^bb0(%in: i8, %out: f32): %2 = arith.uitofp %in : i8 to f32 linalg.yield %2 : f32 } -> tensor<3xf32> return %1 : tensor<3xf32> } // stablehlo backend func.func @forward(%arg0: !torch.vtensor<[3],ui8>) -> tensor<3xf32> { %c = torch_c.to_builtin_tensor %arg0 : (!torch.vtensor<[3], ui8> -> tensor<3xui8> %0 = stablehlo.convert %arg0 : (tensor<3xui8> -> tensor<3xf32> return %0 : tensor<3xf32> } ``` 3. fix stablehlo and linalg's conversion

llvm#3367 and llvm#3364 introduced new dependencies, causing the [Bazel workflow](https://github.com/llvm/torch-mlir/actions/workflows/bazelBuildAndTest.yml) to fail. These need to be fixed in Bazel.

This commit also adds the Torch declaration for aten.max_unpool2d and aten.max_unpool3d op. The TorchToLinalg lowering for the same will be added in a follow-up commit. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

This commit also fixes the average pool op' test failing for OnnxToLinalg lowering. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

Set PyTorch and TorchVision version to nightly release 2024-05-14. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

Inspired by PyTorch decompositions.py. See https://github.com/pytorch/pytorch/blob/ec58f1f74ebcec744d2ab90ad34abd09c1018e92/torch/_decomp/decompositions.py#L3923-L4086 Only support paddingMode=0 or 1 and interpolationMode=0 or 1

@main

Torch Dialect with symbolic shape expressions: ```ll module { func.func @main(%arg0: !torch.vtensor<[?,?,3],f32>, %arg1: !torch.vtensor<[?,?,3],f32>) -> !torch.vtensor<[?,?,3],f32> { %0 = torch.symbolic_int "s0" {min_val = 5, max_val = 10} : !torch.int %1 = torch.symbolic_int "s1" {min_val = 0, max_val = 100} : !torch.int %2 = torch.symbolic_int "s3" {min_val = 0, max_val = 50} : !torch.int torch.bind_symbolic_shape %arg0, [%0, %1], #affine_map<()[s0, s1] -> (s0, s1, 3)> : !torch.vtensor<[?,?,3],f32> torch.bind_symbolic_shape %arg1, [%0, %2], #affine_map<()[s0, s1] -> (s0, s1, 3)> : !torch.vtensor<[?,?,3],f32> %3 = torch.aten.tanh %arg0 : !torch.vtensor<[?,?,3],f32> -> !torch.vtensor<[?,?,3],f32> torch.bind_symbolic_shape %3, [%0, %1], #affine_map<()[s0, s1] -> (s0, s1, 3)> : !torch.vtensor<[?,?,3],f32> %4 = torch.aten.sigmoid %arg1 : !torch.vtensor<[?,?,3],f32> -> !torch.vtensor<[?,?,3],f32> torch.bind_symbolic_shape %4, [%0, %2], #affine_map<()[s0, s1] -> (s0, s1, 3)> : !torch.vtensor<[?,?,3],f32> %5 = torch.prim.ListConstruct %3, %3, %4 : (!torch.vtensor<[?,?,3],f32>, !torch.vtensor<[?,?,3],f32>, !torch.vtensor<[?,?,3],f32>) -> !torch.list<vtensor> %int1 = torch.constant.int 1 %6 = torch.aten.cat %5, %int1 : !torch.list<vtensor>, !torch.int -> !torch.vtensor<[?,?,3],f32> torch.bind_symbolic_shape %6, [%0, %1, %2], #affine_map<()[s0, s1, s2] -> (s0, s1 * 2 + s2, 3)> : !torch.vtensor<[?,?,3],f32> return %6 : !torch.vtensor<[?,?,3],f32> } } ``` For reference, this is the TorchDynamo exported program with symbolic shape expressions that the above Torch dialect program is imported from: ```py ExportedProgram: class GraphModule(torch.nn.Module): def forward(self, x: "f32[s0, s1, 3]", y: "f32[s0, s3, 3]"): # File: /home/sambhav.jain/workspaces/cruise/src/3p/torch-mlir/test/python/fx_importer/symbolic_shape_expr_test.py:31 in forward, code: a = torch.tanh(x) tanh: "f32[s0, s1, 3]" = torch.ops.aten.tanh.default(x); x = None # File: /home/sambhav.jain/workspaces/cruise/src/3p/torch-mlir/test/python/fx_importer/symbolic_shape_expr_test.py:32 in forward, code: b = torch.sigmoid(y) sigmoid: "f32[s0, s3, 3]" = torch.ops.aten.sigmoid.default(y); y = None # File: /home/sambhav.jain/workspaces/cruise/src/3p/torch-mlir/test/python/fx_importer/symbolic_shape_expr_test.py:33 in forward, code: return torch.cat((a, a, b), dim=1) cat: "f32[s0, 2*s1 + s3, 3]" = torch.ops.aten.cat.default([tanh, tanh, sigmoid], 1); tanh = sigmoid = None return (cat,) Graph signature: ExportGraphSignature(input_specs=[InputSpec(kind=<InputKind.USER_INPUT: 1>, arg=TensorArgument(name='x'), target=None, persistent=None), InputSpec(kind=<InputKind.USER_INPUT: 1>, arg=TensorArgument(name='y'), target=None, persistent=None)], output_specs=[OutputSpec(kind=<OutputKind.USER_OUTPUT: 1>, arg=TensorArgument(name='cat'), target=None)]) Range constraints: {s0: ValueRanges(lower=5, upper=10, is_bool=False), s1: ValueRanges(lower=0, upper=100, is_bool=False), s3: ValueRanges(lower=0, upper=50, is_bool=False)} ``` Huge credit to @stellaraccident for the inputs that helped evaluate the various design options and arrive at the representation of choice. - [x] Op definitions for symbolic_int and bind_symbolic_shape ops - [x] fx_importer updates to import range constraints + create symbolic_int ops - [x] fx_importer changes for AffineMapAttr building + adding bind_symbolic_shape ops - [x] custom printer/parser for inlined AffineMap expressions in mlir assembly - [x] Dialect lit test - [x] fx_importer python lit tests - [ ] Cleanup pass to remove these ops (can add in a follow-on)

This is needed after llvm#3372.

Supports asymmetric padding by performing a torch.nn.functional.pad on the input before performing the convolution. Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>

Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

…lvm#3421)

Missing types for tracing float8 types.

Linalg conversion requires mapping for f8 types

* Let `toBuiltinTensor()` reflects the original dtype of `!torch.vtensor`. * Backend handles dtype conversion themselves.

This commit adds the lowering for SequenceAt, SequenceEmpty, SequenceInsert, SequenceErase op Signed-Off By: Vivek Khandelwal<vivekkhandelwal1424@gmail.com>

Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

Tests the basic constructs of registering a custom op and its abstract implementations (with FakeTensors) in python, going through TorchDynamo export, followed by importing the shape expressions in the Torch dialect. Also fixes the importer were previously the symbolic bind op insertion was not gated in one place.

this fixes the following issue: llvm#3418

half_pixel is also the default mode used by ONNX, see https://onnx.ai/onnx/operators/onnx__Resize.html

…995c908

…ump_to_77d7f644

rsuderman and others added 30 commits May 31, 2024 17:31

[NFC] Fix member cast change to global for landing collision (llvm#3407)

617b00b

A PR landed when moving away from a deprecated cast function. Updated the corresponding lines to pass.

[Torch]Support conv_transpose1d and conv_transpose3d (llvm#3286)

23b5305

1. Support conv_transpose1d and conv_transpose3d 2. Fix bugs of convertTransposedConv func in lib/Conversion/TorchToStablehlo/Linear.cpp

[Torch] decompose AtenLerpTensorOp (llvm#3251)

267052d

as title

[Torch] Emit rrelu and decompose it (llvm#3250)

285b087

as title

[ONNX] Add OnnxToTorch lowering for SpaceToDepth op (llvm#3393)

6382dbb

Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

Update development.md to use ld.lld (llvm#3412)

948981a

@kuhar mentioned in the previous PR that we should use ld.lld. I kept using ld because for my LLD version, it worked. After updating to a new LLD version, that became necessary.

Fix reducesum onnx lit test to linalg lowering fails (llvm#3218)

11c3281

fixes nod-ai/SHARK-ModelDev#653 --------- Co-authored-by: Xida Ren <xida.ren.dev@gmail.com>

Add conversion operation for bool resolved_literal (llvm#3410)

0a6861b

Resolving `bool` literals can result in a type change to uint8. This needs to be converted back to the expected type before returning to the wrapped `torch` operators.

[Bazel] Fix bazel deps (llvm#3414)

89f7d24

llvm#3367 and llvm#3364 introduced new dependencies, causing the [Bazel workflow](https://github.com/llvm/torch-mlir/actions/workflows/bazelBuildAndTest.yml) to fail. These need to be fixed in Bazel.

[MLIR][Torch] Add TorchToLinalg lowering for AtenAvgPool3dOp (llvm#3030)

661be2d

This commit also fixes the average pool op' test failing for OnnxToLinalg lowering. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

[Linalg] Promote type for compare tensor op (llvm#3416)

d59d0b6

build: manually update PyTorch version (llvm#3340)

72837fb

Set PyTorch and TorchVision version to nightly release 2024-05-14. Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

[Stablehlo] Add lowering of GridSampler Op (llvm#3084)

431d98b

Inspired by PyTorch decompositions.py. See https://github.com/pytorch/pytorch/blob/ec58f1f74ebcec744d2ab90ad34abd09c1018e92/torch/_decomp/decompositions.py#L3923-L4086 Only support paddingMode=0 or 1 and interpolationMode=0 or 1

[Bazel] Add BuiltinDialectTdFiles dep to MLIRTorchOpsIncGen (llvm#3430)

94838ca

This is needed after llvm#3372.

[ONNX] Conv op adds support for asymmetric padding. (llvm#3426)

1c2778d

Supports asymmetric padding by performing a torch.nn.functional.pad on the input before performing the convolution. Signed-off-by: Suraj Sudhir <suraj.sudhir@arm.com>

[Onnx] Add Onnx->Torch lowering for Onnx.Shrink Op (llvm#3385)

1a9c0a3

Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

add resize nearest mode round_prefer_floor, round_prefer_ceil, ceil (l…

f794582

…lvm#3421)

Add f8 types to fx importer (llvm#3434)

7f188eb

Missing types for tracing float8 types.

[torch] Add support for f8 types for linalg conversion (llvm#3436)

75af64f

Linalg conversion requires mapping for f8 types

[Torch] fix toBuiltinTensor() (llvm#3415)

689efc8

* Let `toBuiltinTensor()` reflects the original dtype of `!torch.vtensor`. * Backend handles dtype conversion themselves.

[ONNX] Add OnnxToTorch Lowering for Sequence Ops (llvm#3425)

d35b6b4

This commit adds the lowering for SequenceAt, SequenceEmpty, SequenceInsert, SequenceErase op Signed-Off By: Vivek Khandelwal<vivekkhandelwal1424@gmail.com>

[ONNX] Lower Onnx.Concat lowering version (llvm#3437)

5bc6264

Signed-Off By: Vivek Khandelwal <vivekkhandelwal1424@gmail.com>

[torch-mlir][sparse] re-enable all sparse tests (llvm#3444)

d77bab3

this fixes the following issue: llvm#3418

onnx.resize: Add support for coordTfMode "half_pixel" (llvm#3441)

e07a0bf

half_pixel is also the default mode used by ONNX, see https://onnx.ai/onnx/operators/onnx__Resize.html

mgehre-amd and others added 21 commits August 28, 2024 15:21

[AutoBump] Merge with 617b00b (May 31)

75c2a81

[AutoBump] Merge with fixes of 23b5305 (Jun 03)

c639e26

Update xfail

7b01213

[AutoBump] Merge with 267052d (Jun 03)

029673a

[AutoBump] Merge with fixes of 285b087 (Jun 03)

4a5fdf3

[AutoBump] Merge with 6382dbb (Jun 03)

ad1facc

[AutoBump] Merge with fixes of 8995c90 (Jun 03)

e698f4a

Merge branch 'bump_to_878ba72c' into bump_to_617b00b9

accf7f6

Merge branch 'bump_to_617b00b9' into bump_to_23b53050

ca733c5

Merge branch 'bump_to_23b53050' into bump_to_267052df

f724438

Merge branch 'bump_to_267052df' into bump_to_285b087a

a22c27c

Merge branch 'bump_to_285b087a' into bump_to_6382dbbc

0ef5530

Merge remote-tracking branch 'origin/bump_to_6382dbbc' into bump_to_8…

56770da

…995c908

Update LLVM

977b3a7

[AutoBump] Merge with 56d21cb (Jun 04)

fbb1cca

[AutoBump] Merge with fixes of 50f7103 (Jun 04)

813abc3

[AutoBump] Merge with 1a9c0a3 (Jun 07)

7c5a142

[AutoBump] Merge with fixes of f794582 (Jun 07)

077a2ee

[AutoBump] Merge with 41d04a8 (Jun 12)

23b2b30

[AutoBump] Merge with fixes of ae6f5e8 (Jun 12)

5f167e7

[AutoBump] Merge with fixes of 77d7f64 (Jun 13)

4ffe137

mgehre-amd changed the title ~~[AutoBump] Merge with fixes of 77d7f644 (Jun 13) (66)~~ [AutoBump] Merge with fixes of 77d7f644 (Jun 13, needs LLVM bump) (66) Sep 9, 2024

mgehre-amd mentioned this pull request Sep 9, 2024

[AutoBump] Merge with fixes of 52050f3f (Jun 10, requires torch bump) (67) Xilinx/llvm-project#330

Closed

mgehre-amd changed the base branch from bump_to_ae6f5e82 to feature/backport_ea1_ops September 11, 2024 11:27

mgehre-amd requested a review from cferry-AMD September 11, 2024 11:27

mgehre-amd enabled auto-merge September 11, 2024 11:27

Merge remote-tracking branch 'origin/feature/backport_ea1_ops' into b…

2b86be6

…ump_to_77d7f644

cferry-AMD approved these changes Sep 11, 2024

View reviewed changes

mgehre-amd merged commit 4670b65 into feature/backport_ea1_ops Sep 11, 2024
4 checks passed

mgehre-amd deleted the bump_to_77d7f644 branch September 11, 2024 12:08

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[AutoBump] Merge with fixes of 77d7f644 (Jun 13, needs LLVM bump) (66) #303

[AutoBump] Merge with fixes of 77d7f644 (Jun 13, needs LLVM bump) (66) #303

mgehre-amd commented Sep 9, 2024 •

edited

Loading

[AutoBump] Merge with fixes of 77d7f644 (Jun 13, needs LLVM bump) (66) #303

[AutoBump] Merge with fixes of 77d7f644 (Jun 13, needs LLVM bump) (66) #303

Conversation

mgehre-amd commented Sep 9, 2024 • edited Loading

mgehre-amd commented Sep 9, 2024 •

edited

Loading