agent-judge

Composable second-agent review for agentsys — scope creep detection, hallucination flagging, safety gates

A lightweight, self-hosted judge protocol that routes agent output through a configurable review before finalizing. No cloud platform required.

What It Does

Before an agent's output reaches the user (or triggers irreversible action), agent-judge runs it through a specialized judge that evaluates:

Scope creep: Did the agent do more than asked? Did it modify files outside its scope?
Hallucination: Does the output contain claims not grounded in the provided context?
Reversibility: Is the action reversible? Should it require explicit approval?
Safety: Does the output contain credentials, PII, or dangerous shell commands?

Commands

/judge - Run judge review on current agent output or a specific artifact
/judge-config - Configure judge thresholds and categories for this project

Quick Start

npm install -g agentsys
agentsys  # select agent-judge from the marketplace

Or install directly:

agentsys install agent-judge

Usage

/judge --category scope-creep --threshold warn
/judge --input path/to/diff.txt --category all
/judge-config --block-on safety --warn-on scope-creep,hallucination

The Protocol

Each judge run produces a verdict: PASS | FLAG | BLOCK

PASS: Output is clean, proceed
FLAG: Issue detected but not blocking - annotate and continue with warning
BLOCK: Critical issue - stop and require explicit human approval

Verdicts include a structured rationale explaining exactly what triggered the verdict.

Integration

Post-edit hook (Claude Code)

{
  "hooks": {
    "PostToolUse": [
      {
        "matcher": "Edit|Write|MultiEdit",
        "hooks": [{ "type": "command", "command": "agentsys judge --threshold warn --category scope-creep,safety" }]
      }
    ]
  }
}

Pre-PR gate

agentsys judge --input "$(git diff main...HEAD)" --task "$TASK_DESCRIPTION" --threshold block

Protocol Specification

See JUDGE.md for the full JUDGE protocol v1.0 specification.

References

Part of the agentsys ecosystem
https://agentskills.io

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.github/workflows		.github/workflows
commands		commands
lib		lib
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
JUDGE.md		JUDGE.md
README.md		README.md
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

agent-judge

What It Does

Commands

Quick Start

Usage

The Protocol

Integration

Post-edit hook (Claude Code)

Pre-PR gate

Protocol Specification

References

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

agent-judge

What It Does

Commands

Quick Start

Usage

The Protocol

Integration

Post-edit hook (Claude Code)

Pre-PR gate

Protocol Specification

References

About

Resources

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages