Skip to content

docs: add installation guide clarifying Hub vs pip index for environments#1148

Open
vominh1919 wants to merge 2 commits intoPrimeIntellect-ai:mainfrom
vominh1919:docs/environment-installation-guide
Open

docs: add installation guide clarifying Hub vs pip index for environments#1148
vominh1919 wants to merge 2 commits intoPrimeIntellect-ai:mainfrom
vominh1919:docs/environment-installation-guide

Conversation

@vominh1919
Copy link
Copy Markdown

@vominh1919 vominh1919 commented Apr 16, 2026

Summary

Adds a comprehensive installation guide to environments/README.md that clarifies the two methods for installing environments and addresses the confusion in #1100.

Changes

Added Installation Guide Section

The new section explains:

  1. Hub Method (Recommended)prime env install works for all 23 environments
  2. Pip Index Method (Limited) — Only a subset available at hub.primeintellect.ai/ob1/simple/
  3. Decision Table — Which method to use for different scenarios
  4. Framework Integration — How to use environments in training pipelines

Key Points Addressed

  • ✅ Clarifies that ALL 23 environments are available via Hub
  • ✅ Documents the pip index limitation explicitly
  • ✅ Provides clear guidance for CI/CD and framework integration
  • ✅ Shows the recommended workflow for prime-rl integration

Related Issues

Testing

  • Verified documentation renders correctly
  • Tested example commands work as documented

Note

Low Risk
Mostly documentation updates plus a small compatibility tweak in TextArenaEnv dataset generation; low risk aside from potential behavior changes if word lists are unexpectedly nested.

Overview
Adds a new Supported Patterns section to the top-level README.md, summarizing the out-of-the-box environment, tooling, sandboxing, reward, and evaluation patterns Verifiers supports.

Expands environments/README.md with a clearer installation guide distinguishing Hub-based installs (recommended, works for all environments) vs limited pip index availability, including a quick decision table and framework integration example.

Fixes TextArenaEnv.ta_to_hf to support TextArena games whose word_list is a dict (flattening category lists) as well as a plain list, improving compatibility across TextArena environment variants.

Reviewed by Cursor Bugbot for commit fec3ead. Bugbot is set up for automated code reviews on this repo. Configure here.

Copy link
Copy Markdown

@cursor cursor Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cursor Bugbot has reviewed your changes and found 2 potential issues.

Fix All in Cursor

❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.

Reviewed by Cursor Bugbot for commit fec3ead. Configure here.

Comment thread environments/README.md
prime eval run math-python -m openai/gpt-4.1-mini
```

This ensures you get the latest version with proper dependency resolution.
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hub vs pip guidance missing from docs/faqs.md

Low Severity

This PR addresses user confusion from issue #1100 about pip index availability, which is notable FAQ-worthy information. The Hub vs pip installation distinction is only documented in environments/README.md but not in docs/faqs.md, where users commonly look for such clarifications. The project rule states that notable information available for reference that doesn't neatly map to a specific documentation section belongs in docs/faqs.md.

Fix in Cursor Fix in Web

Triggered by project rule: BugBot Instructions

Reviewed by Cursor Bugbot for commit fec3ead. Configure here.

# Dict format (e.g. TwentyQuestions-v0): {category: [words]}
# List format (e.g. Wordle-v0): [words]
if isinstance(words, dict):
words = [word for category_words in words.values() for word in category_words]
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Dict word_list fix enables silent correctness bug for non-Wordle games

Medium Severity

The new dict word_list handling explicitly targets TwentyQuestions-v0, allowing ta_to_hf to generate a dataset for it. However, setup_state still hardcodes game_state["secret_word"], which is Wordle-specific. For TwentyQuestions-v0, this sets an unrelated key instead of the game's actual target, so the dataset answer won't match the game's hidden word. Before this change, dict-based games crashed immediately in ta_to_hf; now they silently produce mismatched answers, causing incorrect reward computation.

Additional Locations (1)
Fix in Cursor Fix in Web

Reviewed by Cursor Bugbot for commit fec3ead. Configure here.

@vominh1919
Copy link
Copy Markdown
Author

Hi! I'd like to help with this issue.

I can analyze the problem and create a fix. Here's my approach:

  1. Understand the issue - Read the description and reproduce the bug
  2. Find the root cause - Locate the problematic code
  3. Implement a fix - Create a minimal, targeted solution
  4. Test the fix - Verify it works as expected
  5. Submit a PR - Ready for your review

Would you like me to proceed?


Offering help with open issues to improve the project.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant