Clarify Open-H checkpoint embodiment metadata by beanscg · Pull Request #3 · NVIDIA-Medtech/GR00T-H

beanscg · 2026-06-08T18:31:44Z

Summary

separate the Open-H dataset/platform count from released-checkpoint inference readiness
point users to the existing Hugging Face config alignment validator before trying a new embodiment
clarify that the embodiment comparison table is a dataset/config reference, not a guarantee that every released checkpoint contains processor/statistics metadata for every tag

Context

This addresses the ambiguity raised in #2 without changing model behavior or making new support claims. The README currently mentions 16 Open-H robot platforms, while direct inference depends on the metadata bundled with the selected checkpoint.

Refs #2.

Validation

git diff --check
confirmed scripts/validate_hf_config_alignment.py exists
checked the README contains the new clarification text

greptile-apps · 2026-06-08T18:33:04Z

Greptile Summary

This PR clarifies the README to prevent user confusion about which Open-H robot platforms are inference-ready versus simply covered in the dataset. No code or config files are modified.

The 16-platform count is moved from describing the model to describing the dataset, and a new paragraph directs users to run scripts/validate_hf_config_alignment.py against a specific checkpoint before trying a new embodiment.
A trailing note on the embodiment table (§ Open-H Dataset) makes it explicit that the table is a dataset/config reference and that a released checkpoint may cover a smaller inference-ready set.

Confidence Score: 5/5

Documentation-only change; the referenced script exists and all three modified passages are internally consistent with the rest of the README.

All changes are prose clarifications in README.md. The validator script link resolves to a confirmed existing file, the 16-platform count is now correctly scoped to the dataset rather than the released checkpoints, and the new guidance paragraph does not contradict anything else in the repository.

No files require special attention.

Important Files Changed

Filename	Overview
README.md	Three targeted clarifications: sentence split separating dataset scope from model description, new paragraph pointing to the config-alignment validator, and a note that the embodiment table is a dataset reference not an inference guarantee.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[User wants to run a new embodiment] --> B{Check checkpoint metadata}
    B --> C[Run scripts/validate_hf_config_alignment.py]
    C --> D{Embodiment tag in\nprocessor_config.json\n& statistics.json?}
    D -->|Yes| E[Proceed with direct inference]
    D -->|No| F[Embodiment not inference-ready\nin this checkpoint]
    F --> G[Fine-tune or select\ndifferent checkpoint]

_{Reviews (1): Last reviewed commit: "Clarify Open-H checkpoint embodiment met..." | Re-trigger Greptile}

Clarify Open-H checkpoint embodiment metadata

3c28b71

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clarify Open-H checkpoint embodiment metadata#3

Clarify Open-H checkpoint embodiment metadata#3
beanscg wants to merge 1 commit into
NVIDIA-Medtech:mainfrom
beanscg:bean-labs-clarify-openh-checkpoint-embodiments

beanscg commented Jun 8, 2026

Uh oh!

greptile-apps Bot commented Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

beanscg commented Jun 8, 2026

Summary

Context

Validation

Uh oh!

greptile-apps Bot commented Jun 8, 2026

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant