Commit be92334
committed
[AMD] server_vllm.sh: default PREFILL/DECODE_TP_SIZE to a full node
Mirror server_sglang.sh / server_atom.sh so the bench.sh GPU count never
resolves to 0 if submit.sh did not export the per-worker TP size.1 parent 6c0e812 commit be92334
1 file changed
Lines changed: 6 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
39 | 39 | | |
40 | 40 | | |
41 | 41 | | |
| 42 | + | |
| 43 | + | |
| 44 | + | |
| 45 | + | |
| 46 | + | |
| 47 | + | |
42 | 48 | | |
43 | 49 | | |
44 | 50 | | |
| |||
0 commit comments