Skip to content

Self-hosted runner scale set (AMD mi325 scheduled CI caller) #4

Self-hosted runner scale set (AMD mi325 scheduled CI caller)

Self-hosted runner scale set (AMD mi325 scheduled CI caller) #4

Triggered via workflow run October 26, 2025 02:47
@lkhllkhl
completed 354567d
Status Failure
Total duration 24s
Artifacts
Matrix: DeepSpeed CI / Check Runners
Matrix: Example CI / Check Runners
Matrix: Model CI / Check Runners
Matrix: Torch pipeline CI / Check Runners
Matrix: DeepSpeed CI / Setup
Matrix: DeepSpeed CI / Examples directory
Matrix: DeepSpeed CI / PyTorch pipelines
Matrix: DeepSpeed CI / Torch ROCm deepspeed tests
Matrix: Example CI / Setup
Matrix: Example CI / Examples directory
Matrix: Example CI / PyTorch pipelines
Matrix: Example CI / Torch ROCm deepspeed tests
Matrix: Model CI / Setup
Matrix: Model CI / Examples directory
Matrix: Model CI / PyTorch pipelines
Matrix: Model CI / Torch ROCm deepspeed tests
Matrix: Torch pipeline CI / Setup
Matrix: Torch pipeline CI / Examples directory
Matrix: Torch pipeline CI / PyTorch pipelines
Matrix: Torch pipeline CI / Torch ROCm deepspeed tests
Matrix: DeepSpeed CI / Single GPU tests
Waiting for pending jobs
Matrix: Example CI / Single GPU tests
Waiting for pending jobs
Matrix: Model CI / Single GPU tests
Waiting for pending jobs
Matrix: Torch pipeline CI / Single GPU tests
Waiting for pending jobs
DeepSpeed CI  /  ...  /  Send results to webhook
20s
DeepSpeed CI / Slack Report / Send results to webhook
Example CI  /  ...  /  Send results to webhook
16s
Example CI / Slack Report / Send results to webhook
Model CI  /  ...  /  Send results to webhook
18s
Model CI / Slack Report / Send results to webhook
Torch pipeline CI  /  ...  /  Send results to webhook
14s
Torch pipeline CI / Slack Report / Send results to webhook
Fit to window
Zoom out
Zoom in

Annotations

12 errors
Model CI / Check Runners (2gpu)
Required runner group 'amd-mi325-2gpu' not found
Example CI / Check Runners (1gpu)
The strategy configuration was canceled because "example-ci.check_runners._2gpu" failed
DeepSpeed CI / Check Runners (1gpu)
Required runner group 'amd-mi325-1gpu' not found
Example CI / Check Runners (2gpu)
Required runner group 'amd-mi325-2gpu' not found
DeepSpeed CI / Check Runners (2gpu)
The strategy configuration was canceled because "deepspeed-ci.check_runners._1gpu" failed
Torch pipeline CI / Check Runners (2gpu)
Required runner group 'amd-mi325-2gpu' not found
Model CI / Check Runners (1gpu)
The strategy configuration was canceled because "model-ci.check_runners._2gpu" failed
Torch pipeline CI / Check Runners (1gpu)
Required runner group 'amd-mi325-1gpu' not found
Torch pipeline CI / Slack Report / Send results to webhook
Process completed with exit code 1.
Example CI / Slack Report / Send results to webhook
Process completed with exit code 1.
Model CI / Slack Report / Send results to webhook
Process completed with exit code 1.
DeepSpeed CI / Slack Report / Send results to webhook
Process completed with exit code 1.