feat: add code-review environment by vominh1919 · Pull Request #1152 · PrimeIntellect-ai/verifiers

vominh1919 · 2026-04-16T10:48:03Z

Summary

Adds a new code-review environment for evaluating LLM's ability to review code.

Features

Bug detection: Identifies potential bugs in code
Security analysis: Flags security vulnerabilities
Code quality: Suggests improvements and best practices
Multi-language: Supports Python, JavaScript, and more

Bounties Program

This environment aligns with Prime Intellect's Environments Program:

Category: Open Access ($100-500)
Type: Self-contained benchmark implementation
Tags: code-review, coding, analysis

Usage

prime env install code-review
prime eval run code-review -m openai/gpt-4.1-mini

Closes #

Note

Low Risk
Additive, self-contained environment modules with simple scoring and no changes to shared infrastructure, auth, or data-handling paths.

Overview
Adds three new installable evaluation environments under environments/: api-design, code-review, and sql-query.

Each environment defines a load_environment() that builds small synthetic train/eval HF Datasets, configures a simple keyword-based scoring Rubric, and returns a vf.SingleTurnEnv with an appropriate system prompt. New pyproject.toml files wire packaging metadata, dependencies (verifiers>=0.1.8), and default eval settings; code-review also includes a README with usage instructions.

^{Reviewed by Cursor Bugbot for commit 439fa61. Bugbot is set up for automated code reviews on this repo. Configure here.}

vominh1919 · 2026-04-16T10:48:42Z

Update

Added 2 more environments to this PR:

code-review: Code review and bug detection
api-design: RESTful API design evaluation
sql-query: SQL query writing evaluation

All environments follow Prime Intellect's Environments Program guidelines for Open Access bounties ($100-500 each).

Total: 3 environments ready for review!

cursor

Cursor Bugbot has reviewed your changes and found 2 potential issues.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 439fa61. Configure here.}

cursor · 2026-04-16T10:52:12Z

+            "answer": "Security issue: No path validation. Could allow directory traversal attacks. Should validate filename.",
+            "language": "python"
+        },
+    ] * (num_eval_examples + 1)


Dataset uses prompt string, causing assertion crash

High Severity

All three new environments use "prompt" as the dataset column key with a plain string value, but also pass a system_prompt to SingleTurnEnv. The framework's _ensure_prompt method, when it finds a prompt column already present and system_prompt is set, asserts that prompt must be a list of messages — causing an AssertionError at initialization. The column key needs to be "question" instead of "prompt" so the framework properly wraps the string into a messages list. All existing environments that use string-valued prompts use "question" for this reason.

Additional Locations (2)

environments/api-design/api-design.py#L15-L35

environments/sql-query/sql-query.py#L15-L35

^{Reviewed by Cursor Bugbot for commit 439fa61. Configure here.}

cursor · 2026-04-16T10:52:12Z

+        system_prompt="You are an expert code reviewer. Analyze the code and identify bugs, security issues, and potential improvements.",
+        rubric=rubric,
+        message_type="chat",
+    )


New environments missing from environments/README.md

Low Severity

Three new environments (code_review, api-design, sql-query) are added to the environments/ folder but environments/README.md is not updated to list them. The project rules require that any PR adding or removing an environment must update environments/README.md to reflect the change under the appropriate category/pattern section.

Additional Locations (2)

environments/api-design/api-design.py#L1-L63

environments/sql-query/sql-query.py#L1-L62

^{Triggered by project rule: BugBot Instructions}

^{Reviewed by Cursor Bugbot for commit 439fa61. Configure here.}

vominh1919 added 2 commits April 16, 2026 17:48

feat: add code-review environment for code analysis

ba8e74c

feat: add api-design and sql-query environments

439fa61

cursor Bot reviewed Apr 16, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add code-review environment#1152

feat: add code-review environment#1152
vominh1919 wants to merge 2 commits intoPrimeIntellect-ai:mainfrom
vominh1919:feat/code-review-env

vominh1919 commented Apr 16, 2026 •

edited by cursor Bot

Loading

Uh oh!

vominh1919 commented Apr 16, 2026

Uh oh!

cursor Bot left a comment

Uh oh!

cursor Bot Apr 16, 2026

Uh oh!

cursor Bot Apr 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

vominh1919 commented Apr 16, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Features

Bounties Program

Usage

Uh oh!

vominh1919 commented Apr 16, 2026

Update

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot Apr 16, 2026

Choose a reason for hiding this comment

Dataset uses prompt string, causing assertion crash

Uh oh!

cursor Bot Apr 16, 2026

Choose a reason for hiding this comment

New environments missing from environments/README.md

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

vominh1919 commented Apr 16, 2026 •

edited by cursor Bot

Loading

Dataset uses `prompt` string, causing assertion crash

New environments missing from `environments/README.md`