ci: backport LLM container pipeline to release 1.3.1 by kcywinski0 · Pull Request #1823 · ai-dynamo/nixl

kcywinski0 · 2026-06-24T10:56:45Z

Summary

Backports the LLM container build/test pipeline changes from ci: add LLM container build pipeline for vllm/sglang images #1664 onto release/1.3.1.
Adds the missing build/test matrix files and vLLM/SGLang container test scripts needed by the Jenkins LLM container jobs.
Includes the related Dockerfile and JJB updates from the original PR.

Test plan

Not run locally; this is a clean cherry-pick of ci: add LLM container build pipeline for vllm/sglang images #1664 onto release/1.3.1.
Expected validation: Jenkins LLM container pipeline should now find .ci/jenkins/lib/build-llm-container-matrix.yaml.

Adds a new Jenkins matrix pipeline that builds the three NIXL inference container variants (vllm-nixl, sglang-nixl, sglang-cu13-nixl) from a published NIXL wheel set, for x86_64 and aarch64, and publishes multi-arch manifests. Mirrors the manual procedure documented at README.md so that release-candidate inference images can be cut without running the build steps by hand. Uses native podman for build/push/manifest operations (no docker on the build pod). Cross-arch aarch64 builds run on x86_64 hosts via QEMU. Signed-off-by: Iaroslav Sydoruk <isydoruk@nvidia.com>

Signed-off-by: Iaroslav Sydoruk <isydoruk@nvidia.com>

TinyLlama/TinyLlama-1.1B-Chat-v1.0 (~550MB) — smoke/perf for both vllm and sglang Qwen/Qwen3-8B (~16GB) — accuracy tests for sglang only (vllm accuracy is a no-op) Signed-off-by: Iaroslav Sydoruk <isydoruk@nvidia.com>

copy-pr-bot · 2026-06-24T10:56:48Z

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

github-actions · 2026-06-24T10:56:55Z

👋 Hi kcywinski0! Thank you for contributing to ai-dynamo/nixl.

Your PR reviewers will review your contribution then trigger the CI to test your changes.

🚀

Replace the deprecated huggingface-cli invocation so LLM container builds can prefetch models with current Hugging Face images.

Use huggingface_hub.snapshot_download directly so LLM container builds do not depend on deprecated or missing Hugging Face CLI entrypoints.

Install or upgrade huggingface_hub before prefetching LLM test models so container builds do not depend on base image contents.

isdrk and others added 4 commits June 24, 2026 12:56

ci: add vllm cu12 llm container target

1477d75

ci: use latest cu129 vllm base image

4938aed

Signed-off-by: Iaroslav Sydoruk <isydoruk@nvidia.com>

container: download models

5b360cf

TinyLlama/TinyLlama-1.1B-Chat-v1.0 (~550MB) — smoke/perf for both vllm and sglang Qwen/Qwen3-8B (~16GB) — accuracy tests for sglang only (vllm accuracy is a no-op) Signed-off-by: Iaroslav Sydoruk <isydoruk@nvidia.com>

kcywinski0 requested review from a team as code owners June 24, 2026 10:56

pull-request-size Bot added the size/XXL label Jun 24, 2026

github-actions Bot added the external-contribution label Jun 24, 2026

kcywinski0 added 3 commits June 24, 2026 13:21

ci: use hf CLI for model downloads

af77735

Replace the deprecated huggingface-cli invocation so LLM container builds can prefetch models with current Hugging Face images.

ci: download HF models via Python API

b17c32d

Use huggingface_hub.snapshot_download directly so LLM container builds do not depend on deprecated or missing Hugging Face CLI entrypoints.

ci: ensure HF hub package before model download

0cda3ee

Install or upgrade huggingface_hub before prefetching LLM test models so container builds do not depend on base image contents.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ci: backport LLM container pipeline to release 1.3.1#1823

ci: backport LLM container pipeline to release 1.3.1#1823
kcywinski0 wants to merge 7 commits into
ai-dynamo:release/1.3.1from
kcywinski0:kcywinski0/release-1.3.1-pr-1664

kcywinski0 commented Jun 24, 2026

Uh oh!

copy-pr-bot Bot commented Jun 24, 2026

Uh oh!

github-actions Bot commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

kcywinski0 commented Jun 24, 2026

Summary

Test plan

Uh oh!

copy-pr-bot Bot commented Jun 24, 2026

Uh oh!

github-actions Bot commented Jun 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants