Skip to content

[CUDA] Fix QMoE int4/int8 weight prepack to always use SM80 layout#28978

Merged
tianleiwu merged 6 commits into
mainfrom
tlwu/refactor_qmoe_prepack_sm
Jun 11, 2026
Merged

[CUDA] Fix QMoE int4/int8 weight prepack to always use SM80 layout#28978
tianleiwu merged 6 commits into
mainfrom
tlwu/refactor_qmoe_prepack_sm

Commits

Commits on Jun 9, 2026

Commits on Jun 10, 2026