Skip to content

QMoE: prepack int4/int8 expert weights in PrePack hook (symmetric with MatMulNBits) #56383

QMoE: prepack int4/int8 expert weights in PrePack hook (symmetric with MatMulNBits)

QMoE: prepack int4/int8 expert weights in PrePack hook (symmetric with MatMulNBits) #56383