Skip to content

Nvidia CI

Nvidia CI #110

Triggered via schedule October 16, 2025 02:44
Status Failure
Total duration 53s
Artifacts 1
Matrix: DeepSpeed CI / Setup
Matrix: Example CI / Setup
Matrix: Model CI / Setup
Matrix: Quantization CI / Setup
Matrix: Torch pipeline CI / Setup
Matrix: Trainer/FSDP CI / Setup
Matrix: DeepSpeed CI / Examples directory
Matrix: DeepSpeed CI / PyTorch pipelines
Matrix: DeepSpeed CI / Torch CUDA extension tests
Matrix: Example CI / Examples directory
Matrix: Example CI / PyTorch pipelines
Matrix: Example CI / Torch CUDA extension tests
Matrix: Model CI / Examples directory
Matrix: Model CI / PyTorch pipelines
Matrix: Model CI / Torch CUDA extension tests
Matrix: Quantization CI / Examples directory
Matrix: Quantization CI / PyTorch pipelines
Matrix: Quantization CI / Torch CUDA extension tests
Matrix: Torch pipeline CI / Examples directory
Matrix: Torch pipeline CI / PyTorch pipelines
Matrix: Torch pipeline CI / Torch CUDA extension tests
Matrix: Trainer/FSDP CI / Examples directory
Matrix: Trainer/FSDP CI / PyTorch pipelines
Matrix: Trainer/FSDP CI / Torch CUDA extension tests
Matrix: DeepSpeed CI /
Waiting for pending jobs
Matrix: DeepSpeed CI /
Matrix: DeepSpeed CI /
Waiting for pending jobs
Matrix: Example CI /
Waiting for pending jobs
Matrix: Example CI /
Matrix: Example CI /
Waiting for pending jobs
Matrix: Model CI /
Waiting for pending jobs
Matrix: Model CI /
Matrix: Model CI /
Waiting for pending jobs
Matrix: Quantization CI /
Waiting for pending jobs
Matrix: Quantization CI /
Matrix: Quantization CI /
Waiting for pending jobs
Matrix: Torch pipeline CI /
Waiting for pending jobs
Matrix: Torch pipeline CI /
Matrix: Torch pipeline CI /
Waiting for pending jobs
Matrix: Trainer/FSDP CI /
Waiting for pending jobs
Matrix: Trainer/FSDP CI /
Matrix: Trainer/FSDP CI /
Waiting for pending jobs
DeepSpeed CI  /  Extract warnings in CI artifacts
0s
DeepSpeed CI / Extract warnings in CI artifacts
Example CI  /  Extract warnings in CI artifacts
0s
Example CI / Extract warnings in CI artifacts
Model CI  /  Extract warnings in CI artifacts
30s
Model CI / Extract warnings in CI artifacts
Quantization CI  /  Extract warnings in CI artifacts
Quantization CI / Extract warnings in CI artifacts
Torch pipeline CI  /  Extract warnings in CI artifacts
0s
Torch pipeline CI / Extract warnings in CI artifacts
Trainer/FSDP CI  /  Extract warnings in CI artifacts
Trainer/FSDP CI / Extract warnings in CI artifacts
DeepSpeed CI  /  ...  /  Send results to webhook
20s
DeepSpeed CI / Slack Report / Send results to webhook
Example CI  /  ...  /  Send results to webhook
19s
Example CI / Slack Report / Send results to webhook
Model CI  /  ...  /  Send results to webhook
15s
Model CI / Slack Report / Send results to webhook
Quantization CI  /  ...  /  Send results to webhook
15s
Quantization CI / Slack Report / Send results to webhook
Torch pipeline CI  /  ...  /  Send results to webhook
21s
Torch pipeline CI / Slack Report / Send results to webhook
Trainer/FSDP CI  /  ...  /  Send results to webhook
18s
Trainer/FSDP CI / Slack Report / Send results to webhook
DeepSpeed CI  /  ...  /  
DeepSpeed CI / Check new failures /
Example CI  /  ...  /  
Example CI / Check new failures /
Model CI  /  ...  /  
Model CI / Check new failures /
Quantization CI  /  ...  /  
Quantization CI / Check new failures /
Torch pipeline CI  /  ...  /  
Torch pipeline CI / Check new failures /
Trainer/FSDP CI  /  ...  /  
Trainer/FSDP CI / Check new failures /
Fit to window
Zoom out
Zoom in

Annotations

18 errors and 1 warning
Model CI / Setup (aws-g5-4xlarge-cache)
Required runner group 'aws-g5-4xlarge-cache' not found in an org
Torch pipeline CI / PyTorch pipelines (aws-g5-4xlarge-cache)
Required runner group 'aws-g5-4xlarge-cache' not found in an org
Model CI / Setup (aws-g5-12xlarge-cache)
Required runner group 'aws-g5-12xlarge-cache' not found in an org
Quantization CI / Setup (aws-g5-12xlarge-cache)
Required runner group 'aws-g5-12xlarge-cache' not found in an org
Trainer/FSDP CI / Setup (aws-g5-4xlarge-cache)
Required runner group 'aws-g5-4xlarge-cache' not found in an org
Example CI / Examples directory (aws-g5-4xlarge-cache)
Required runner group 'aws-g5-4xlarge-cache' not found in an org
DeepSpeed CI / Torch CUDA extension tests (aws-g5-4xlarge-cache)
Required runner group 'aws-g5-4xlarge-cache' not found in an org
Trainer/FSDP CI / Setup (aws-g5-12xlarge-cache)
Required runner group 'aws-g5-12xlarge-cache' not found in an org
Torch pipeline CI / PyTorch pipelines (aws-g5-12xlarge-cache)
Required runner group 'aws-g5-12xlarge-cache' not found in an org
Quantization CI / Setup (aws-g5-4xlarge-cache)
Required runner group 'aws-g5-4xlarge-cache' not found in an org
DeepSpeed CI / Torch CUDA extension tests (aws-g5-12xlarge-cache)
Required runner group 'aws-g5-12xlarge-cache' not found in an org
Quantization CI / Slack Report / Send results to webhook
Process completed with exit code 1.
Example CI / Slack Report / Send results to webhook
Process completed with exit code 1.
Trainer/FSDP CI / Slack Report / Send results to webhook
Process completed with exit code 1.
Torch pipeline CI / Slack Report / Send results to webhook
Process completed with exit code 1.
DeepSpeed CI / Slack Report / Send results to webhook
Process completed with exit code 1.
Model CI / Extract warnings in CI artifacts
Process completed with exit code 2.
Model CI / Slack Report / Send results to webhook
Process completed with exit code 1.
Model CI / Extract warnings in CI artifacts
No files were found with the provided path: warnings_in_ci/selected_warnings.json. No artifacts will be uploaded.

Artifacts

Produced during runtime
Name Size Digest
setup_values Expired
310 Bytes
sha256:d8e8d410730e5a01c82a0c03e8f6ad1f436f12e80558c6d65efbb3e86fe1e791