File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -2004,7 +2004,7 @@ dsr1-fp8-b300-sglang:
20042004# DeepSeek-V4-Pro on B300 with sglang (non-MTP).
20052005# Uses nightly image with megamoe backend for high-concurrency profiles.
20062006dsv4-fp4-b300-sglang :
2007- image : lmsysorg/sglang:nightly-dev-cu13-20260529-a8cfae0b
2007+ image : lmsysorg/sglang:nightly-dev-cu13-20260624-b2c8f7a2
20082008 model : deepseek-ai/DeepSeek-V4-Pro
20092009 model-prefix : dsv4
20102010 runner : b300
Original file line number Diff line number Diff line change 41534153 - " Run the PR #1891 MiniMax-M3 MXFP8 B300 Dynamo-vLLM recipe set on top of current main."
41544154 - " Uses the vllm/vllm-openai:minimax-m3-0618-x86_64-cu130 image and the TEP4/TEP8 8k1k topologies not covered by PR #1890."
41554155 pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/1891
4156+
4157+ - config-keys :
4158+ - dsv4-fp4-b300-sglang
4159+ description :
4160+ - " Update B300 FP4 SGLang (non-MTP) image to latest nightly: lmsysorg/sglang:nightly-dev-cu13-20260624-b2c8f7a2 (was nightly-dev-cu13-20260529-a8cfae0b)."
4161+ pr-link : https://github.com/SemiAnalysisAI/InferenceX/pull/XXX
You can’t perform that action at this time.
0 commit comments