Modify TheRock manylinux Dockerfile to take an arch build arg. by tsrw2048 · Pull Request #470 · ROCm/rocm-jax

tsrw2048 · 2026-06-19T18:27:20Z

Motivation

TheRock manylinux docker image currently builds only for the gfx950 architecture. This PR has the Dockerfile take an input argument instead and builds the image with the desired architecture. If no argument is provided it defaults to gfx950, preserving existing behavior.

Changes

All changes are contained to TheRock manylinux dockerfile: docker/manylinux/Dockerfile.jax-manylinux_2_28-therock

mminutoli

LGTM

ScottTodd · 2026-06-23T17:30:09Z


 ### Container Build Arguments:
 # TheRock nightly index URL (GPU-family-specific).
 ARG THEROCK_INDEX_URL=https://rocm.nightlies.amd.com/v2/gfx950-dcgpu/


This is still using a per-family index as the default, and that index does not include packages supporting any device-* extras (pip will warn but not error when you try to install an extra that does not exist).

I'd suggest switching to https://rocm.nightlies.amd.com/whl-multi-arch/.

This is just a default which is never used. The multi-arch index is passed in through CI so we never actually use the single-arch index anymore. This shouldn't affect functionality. I can clean this up though. You are right that this shouldn't be around, even as a leftover artifact.

ScottTodd · 2026-06-23T17:32:56Z

-        "rocm[libraries,devel,device-gfx950]${THEROCK_VERSION:+==}${THEROCK_VERSION}" && \
+        "rocm[libraries,devel,device-${GFX_ARCH}]${THEROCK_VERSION:+==}${THEROCK_VERSION}" && \


There are many cases where you may want to install support for multiple devices, but this only allows a single device-. At least this can take all as the extra, but another expected scenario is device-gfx942,device-gfx950 for MI300 + MI355.

See the docs at https://github.com/ROCm/TheRock/blob/main/RELEASES.md#installing-multi-arch-rocm-python-packages.

I guess you could technically set the gfx942,device-gfx950 to install multiple? Only the first device would need to omit the device- prefix...

@ScottTodd We usually only run tests in our CI with a single arch at a time. Would this be a blocker for TheRock CI? I understand that the workaround is a little unwieldy. I am making a PR to clean up the other comments but none of those will change current functionality.

Here's some of what we do with gfx arch lists:

CI builds on pull_request and push events use amdgpu_family_info_matrix_presubmit from https://github.com/ROCm/TheRock/blob/main/build_tools/github_actions/amdgpu_family_matrix.py. Currently that means building ROCm artifacts (e.g. hipBLASLt) for with a few GPU "families": [gfx942], [gfx1100,gfx1101,gfx1102,gfx1103], [gfx1151], [gfx1200,gfx1201] and then ROCm python packages and PyTorch python packages for those all individual targets together: [gfx942,gfx1100,gfx1101,gfx1102,gfx1103,gfx1151,gfx1200,gfx1201]

CI builds can also opt-in to building additional GPU families or "all archs" using opt-in github labels (https://github.com/ROCm/TheRock/blob/main/docs/development/ci_behavior_manipulation.md#pull-request)

Release builds use the full list of supported GPU families/targets. Like the CI builds, we build ROCm artifacts for GPU families then build ROCm python packages, PyTorch python packages, JAX python packages, etc. for the combined set of targets in a single build command (e.g. gfx942,gfx1100,gfx1101,gfx1102,gfx1103,gfx1151,gfx1200,gfx1201,gfx950,gfx900,gfx906,gfx908,gfx90a,gfx1010,gfx1011,gfx1012,gfx1030,gfx1031,gfx1032,gfx1033,gfx1034,gfx1035,gfx1036,gfx1150,gfx1152,gfx1153, see https://github.com/ROCm/rockrel/actions/runs/27930498942/job/82641334022#step:14:16)

Test workflow runs will sometimes build for a single GPU target to reduce the load on the build/test runners

Practically speaking, in the CI context when we build for a specific list of targets that list will be what those packages treat as "all" (in tarballs, in python package device extras, etc.), so jobs like the pytorch and jax builds could either use an explicit list of all targets that were built or they can look up the list from the provided rocm packages / use "all".

So I don't think this would necessarily be blocking for integration with TheRock's CI/CD as device-all may be enough (that's what @erman-gurses has on ROCm/TheRock#6054 right now),

@ScottTodd I have just opened a PR here: #473 to fix this just in case.

ScottTodd · 2026-06-23T17:33:30Z

 FROM quay.io/pypa/manylinux_2_28_x86_64

 ### Container Build Arguments:
 # TheRock nightly index URL (GPU-family-specific).


The multi-arch index is not GPU-family-specific.

I will fix this.

Modify TheRock manylinux Dockerfile to take an arch build arg.

0995015

mminutoli approved these changes Jun 19, 2026

View reviewed changes

tsrw2048 merged commit 1482406 into rocm-jax-infra Jun 19, 2026
13 checks passed

ScottTodd reviewed Jun 23, 2026

View reviewed changes

ScottTodd mentioned this pull request Jun 23, 2026

feat: Add JAX release support v0.10.0 ROCm/TheRock#6054

Open

tsrw2048 mentioned this pull request Jun 23, 2026

Clean up single-arch references and device arch arg in Rock manylinux. #473

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Modify TheRock manylinux Dockerfile to take an arch build arg.#470

Modify TheRock manylinux Dockerfile to take an arch build arg.#470
tsrw2048 merged 1 commit into
rocm-jax-infrafrom
add-arch-build-arg

tsrw2048 commented Jun 19, 2026

Uh oh!

mminutoli left a comment

Uh oh!

Uh oh!

ScottTodd Jun 23, 2026

Uh oh!

tsrw2048 Jun 23, 2026

Uh oh!

ScottTodd Jun 23, 2026

Uh oh!

ScottTodd Jun 23, 2026

Uh oh!

tsrw2048 Jun 23, 2026

Uh oh!

ScottTodd Jun 23, 2026

Uh oh!

tsrw2048 Jun 23, 2026

Uh oh!

ScottTodd Jun 23, 2026

Uh oh!

tsrw2048 Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		"rocm[libraries,devel,device-gfx950]${THEROCK_VERSION:+==}${THEROCK_VERSION}" && \
		"rocm[libraries,devel,device-${GFX_ARCH}]${THEROCK_VERSION:+==}${THEROCK_VERSION}" && \

Uh oh!

Conversation

tsrw2048 commented Jun 19, 2026

Motivation

Changes

Uh oh!

mminutoli left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants