Problem Description
In the Gluon program attached in this issue, if passed to --convert-triton-amdgpu-to-llvm, it will take several minutes to compile and will generate an output of ~14GB. This seems to be due an IR explosion generated by the --convert-triton-amdgpu-to-llvm pass. Note that I set arch to gfx942, but this seems to be arch-independent.
explode.py
Operating System
Ubuntu 22.04.5 LTS (Jammy Jellyfish)
CPU
Intel(R) Xeon(R) Platinum 8480C
GPU
AMD Instinct MI300X
ROCm Version
7.0
ROCm Component
No response
Steps to Reproduce
$ time python3 explode.py | triton-opt --convert-triton-amdgpu-to-llvm=arch=gfx942 | wc -c
14204339549
real 2m11.133s
user 2m8.865s
sys 0m9.593s
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
No response
Additional Information
No response
Problem Description
In the Gluon program attached in this issue, if passed to
--convert-triton-amdgpu-to-llvm, it will take several minutes to compile and will generate an output of ~14GB. This seems to be due an IR explosion generated by the--convert-triton-amdgpu-to-llvmpass. Note that I set arch to gfx942, but this seems to be arch-independent.explode.py
Operating System
Ubuntu 22.04.5 LTS (Jammy Jellyfish)
CPU
Intel(R) Xeon(R) Platinum 8480C
GPU
AMD Instinct MI300X
ROCm Version
7.0
ROCm Component
No response
Steps to Reproduce
(Optional for Linux users) Output of /opt/rocm/bin/rocminfo --support
No response
Additional Information
No response