extension/llm/server: token-ID prompt segments for tool-use resume (V2b.1.5) #1500
mlx.yml
on: pull_request
test-mlx
/
test-mlx
1s
test-mlx-qwen35-moe
/
test-mlx-qwen35-moe
1s
test-mlx-parakeet
/
test-mlx-parakeet
1s
test-mlx-voxtral
/
test-mlx-voxtral
1s
test-mlx-voxtral-realtime
/
test-mlx-voxtral-realtime
1s
test-mlx-whisper
/
test-mlx-whisper
1s
test-mlx-stories110m
/
test-mlx-stories110m
1s
Matrix: backend-tester
Matrix: test-mlx-llm
Annotations
47 errors
|
test-mlx-llm (unsloth/Llama-3.2-1B-Instruct, llama-1b, false, 4w, macos-14-xlarge) / test-mlx-llm-llama-1b-4w
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-whisper / test-mlx-whisper
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx / test-mlx
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-parakeet / test-mlx-parakeet
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
backend-tester (models) / test-mlx-backend-models
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-stories110m / test-mlx-stories110m
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-voxtral-realtime / test-mlx-voxtral-realtime
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-qwen35-moe / test-mlx-qwen35-moe
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-voxtral / test-mlx-voxtral
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (unsloth/Llama-3.2-1B-Instruct, llama-1b, false, nvfp4, macos-14-xlarge) / test-mlx-llm-llama-1b-nvfp4
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (unsloth/Llama-3.2-1B-Instruct, llama-1b, true, 4w, macos-14-xlarge) / test-mlx-llm-llama-1b-custom-4w
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (unsloth/Qwen3-0.6B, qwen3-0.6b, false, 4w, macos-14-xlarge) / test-mlx-llm-qwen3-0.6b-4w
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
backend-tester (operators) / test-mlx-backend-operators
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (unsloth/gemma-3-1b-it, gemma3-1b, false, nvfp4, macos-14-xlarge) / test-mlx-llm-gemma3-1b-nvfp4
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (unsloth/Qwen3-0.6B, qwen3-0.6b, true, nvfp4, macos-14-xlarge) / test-mlx-llm-qwen3-0.6b-custom-nvfp4
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (google/gemma-4-E2B-it, gemma4-e2b, true, 4w, macos-15-xlarge) / test-mlx-llm-gemma4-e2b-custom-4w
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (unsloth/gemma-3-1b-it, gemma3-1b, true, 4w, macos-14-xlarge) / test-mlx-llm-gemma3-1b-custom-4w
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (unsloth/gemma-3-1b-it, gemma3-1b, false, 4w, macos-14-xlarge) / test-mlx-llm-gemma3-1b-4w
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (unsloth/Qwen3-0.6B, qwen3-0.6b, true, 4w, macos-14-xlarge) / test-mlx-llm-qwen3-0.6b-custom-4w
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (unsloth/Llama-3.2-1B-Instruct, llama-1b, true, nvfp4, macos-14-xlarge) / test-mlx-llm-llama-1b-custom-nvfp4
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (unsloth/gemma-3-1b-it, gemma3-1b, true, nvfp4, macos-14-xlarge) / test-mlx-llm-gemma3-1b-custom-nvfp4
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (unsloth/Qwen3-0.6B, qwen3-0.6b, false, nvfp4, macos-14-xlarge) / test-mlx-llm-qwen3-0.6b-nvfp4
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
test-mlx-llm (google/gemma-4-E2B-it, gemma4-e2b, false, 4w, macos-15-xlarge) / test-mlx-llm-gemma4-e2b-4w
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|
|
MLX
Canceling since a higher priority waiting request for MLX-20161-false exists
|