Skip to content

QMoE: prepack int4/int8 expert weights in PrePack hook (symmetric with MatMulNBits) #16165

QMoE: prepack int4/int8 expert weights in PrePack hook (symmetric with MatMulNBits)

QMoE: prepack int4/int8 expert weights in PrePack hook (symmetric with MatMulNBits) #16165