Skip to content

[CUDA] Fix QMoE int4/int8 weight prepack to always use SM80 layout#28978

Merged
tianleiwu merged 6 commits into
mainfrom
tlwu/refactor_qmoe_prepack_sm
Jun 11, 2026
Merged

[CUDA] Fix QMoE int4/int8 weight prepack to always use SM80 layout#28978
tianleiwu merged 6 commits into
mainfrom
tlwu/refactor_qmoe_prepack_sm

Address QMoE review feedback on SM80 prepack docs and checks

7cf33d3
Select commit
Loading
Failed to load commit list.
Azure Pipelines / Linux Android Emulator QNN CI Pipeline succeeded Jun 10, 2026 in 13m 10s

Build #20260610.21 succeeded