Skip to content

Commit

Permalink
[Codegen][CUDA] Add emulateNarrowTypes to LLVMGPU Codegen
Browse files Browse the repository at this point in the history
  • Loading branch information
Groverkss committed Aug 25, 2023
1 parent eba7eac commit 0520b59
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions compiler/src/iree/compiler/Codegen/LLVMGPU/Passes.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -547,7 +547,11 @@ static void addLowerToLLVMGPUPasses(OpPassManager &pm, bool useROCM) {

pm.addNestedPass<func::FuncOp>(memref::createExpandOpsPass());
pm.addPass(memref::createExpandStridedMetadataPass());
pm.addPass(memref::createFoldMemRefAliasOpsPass());
pm.addPass(createEmulateNarrowTypePass());
pm.addPass(createLowerAffinePass());
pm.addPass(createCanonicalizerPass());
pm.addPass(createCSEPass());
// Strip out the debug info for the kernel as CUDA driver doesn't diggest PTX
// debug info well.
pm.addPass(createStripDebugInfoPass());
Expand Down

0 comments on commit 0520b59

Please sign in to comment.