Skip to content

[CUDA] Fix QMoE int4/int8 weight prepack to always use SM80 layout#28978

Merged
tianleiwu merged 6 commits into
mainfrom
tlwu/refactor_qmoe_prepack_sm
Jun 11, 2026
Merged

[CUDA] Fix QMoE int4/int8 weight prepack to always use SM80 layout#28978
tianleiwu merged 6 commits into
mainfrom
tlwu/refactor_qmoe_prepack_sm

Address QMoE review feedback on SM80 prepack docs and checks

7cf33d3
Select commit
Loading
Failed to load commit list.
Microsoft GitHub Policy Service / license/cla succeeded Jun 10, 2026 in 0s

All CLA requirements met.

This check verifies that the author has agreed to a CLA with Microsoft.