Skip to content

QMoE: prepack int4/int8 expert weights in PrePack hook (symmetric with MatMulNBits) #15596

QMoE: prepack int4/int8 expert weights in PrePack hook (symmetric with MatMulNBits)

QMoE: prepack int4/int8 expert weights in PrePack hook (symmetric with MatMulNBits) #15596