Thank you for your interest in contributing to RepoScale.
The most impactful contribution right now is curating benchmark cases — real repositories that test specific aspects of repo continuation intelligence. See docs/case-authoring.md for the complete guide, docs/dataset_format.md for the case pack format, and cases/CORPUS.md for the current inventory.
Help define and refine evaluation criteria. See docs/scoring.md for the current scoring model.
Improve task prompts in prompts/ or the LLM judge protocol in prompts/judge.md.
Improve the CLI (src/reposcale/cli.py), scoring layers, or build new runners and scorers.
# Clone the repo
git clone https://github.com/YOUR_ORG/reposcale.git
cd reposcale
# Install in development mode
pip install -e .
# Run tests
python -m pytest tests/ -v
# Validate all case packs
reposcale validate cases/diagnose/diagnose-001/ cases/intent/intent-001/- Commits: use Conventional Commits (
feat:,fix:,docs:, etc.) - Schemas: all case packs must validate against
schemas/case.schema.json - Prompts: write in English, keep them auditable and version-controlled
- Fork the repo
- Read
docs/case-authoring.mdfor the complete guide - Create a new directory under
cases/<track>/(e.g.,cases/diagnose/diagnose-005/) - Include
case.yaml,hints.yaml,tree.txt, andrepo/with real code - Run
reposcale validate cases/<track>/diagnose-005/ - Submit a PR with a brief description of what the case tests
Be respectful, constructive, and evidence-driven — the same principles we ask of the models we evaluate.