Skip to content

Commit 64b7f21

Browse files
Update B300 FP4 SGLang (non-MTP) image to latest nightly
Bumps dsv4-fp4-b300-sglang image from lmsysorg/sglang:nightly-dev-cu13-20260529-a8cfae0b to lmsysorg/sglang:nightly-dev-cu13-20260624-b2c8f7a2.
1 parent 86e7761 commit 64b7f21

2 files changed

Lines changed: 7 additions & 1 deletion

File tree

.github/configs/nvidia-master.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2004,7 +2004,7 @@ dsr1-fp8-b300-sglang:
20042004
# DeepSeek-V4-Pro on B300 with sglang (non-MTP).
20052005
# Uses nightly image with megamoe backend for high-concurrency profiles.
20062006
dsv4-fp4-b300-sglang:
2007-
image: lmsysorg/sglang:nightly-dev-cu13-20260529-a8cfae0b
2007+
image: lmsysorg/sglang:nightly-dev-cu13-20260624-b2c8f7a2
20082008
model: deepseek-ai/DeepSeek-V4-Pro
20092009
model-prefix: dsv4
20102010
runner: b300

perf-changelog.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -4153,3 +4153,9 @@
41534153
- "Run the PR #1891 MiniMax-M3 MXFP8 B300 Dynamo-vLLM recipe set on top of current main."
41544154
- "Uses the vllm/vllm-openai:minimax-m3-0618-x86_64-cu130 image and the TEP4/TEP8 8k1k topologies not covered by PR #1890."
41554155
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/1891
4156+
4157+
- config-keys:
4158+
- dsv4-fp4-b300-sglang
4159+
description:
4160+
- "Update B300 FP4 SGLang (non-MTP) image to latest nightly: lmsysorg/sglang:nightly-dev-cu13-20260624-b2c8f7a2 (was nightly-dev-cu13-20260529-a8cfae0b)."
4161+
pr-link: https://github.com/SemiAnalysisAI/InferenceX/pull/XXX

0 commit comments

Comments
 (0)