Skip to content

[Issue]: IR explosion in ConvertTritonAMDGPUToLLVM #940

Description

@pabloantoniom

Problem Description

In the Gluon program attached in this issue, if passed to --convert-triton-amdgpu-to-llvm, it will take several minutes to compile and will generate an output of ~14GB. This seems to be due an IR explosion generated by the --convert-triton-amdgpu-to-llvm pass. Note that I set arch to gfx942, but this seems to be arch-independent.

explode.py

Operating System

Ubuntu 22.04.5 LTS (Jammy Jellyfish)

CPU

Intel(R) Xeon(R) Platinum 8480C

GPU

AMD Instinct MI300X

ROCm Version

7.0

ROCm Component

No response

Steps to Reproduce

$ time python3 explode.py | triton-opt --convert-triton-amdgpu-to-llvm=arch=gfx942 | wc -c
14204339549

real    2m11.133s
user    2m8.865s
sys     0m9.593s

(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support

No response

Additional Information

No response

Metadata

Metadata

Assignees

Labels

bugSomething isn't workinggood first issueGood for newcomers

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions