AgentGuard

Autonomous security scanner for AI agents. Detects prompt injection, tool abuse, data exfiltration, and OWASP ASI Top 10 vulnerabilities in agent code.

Why AgentGuard?

AI agents are being deployed at scale -- in coding tools, customer support, trading bots, and autonomous systems. Nobody is scanning their code for security vulnerabilities.

Existing tools (Bandit, Semgrep, CodeQL) scan for traditional vulnerabilities. AgentGuard scans for agent-specific attack vectors that traditional SAST tools miss.

Comparison

Feature	AgentGuard	Semgrep	CodeQL	Bandit
Prompt Injection (ASI01)	Yes	No	No	No
Tool Abuse (ASI02)	Yes	No	No	Partial
Data Exfiltration (ASI03)	Yes	No	No	No
Excessive Agency (ASI04)	Yes	No	No	No
Supply Chain (ASI05)	Yes	No	No	No
Insecure Output (ASI06)	Yes	No	No	No
Credential Exposure (ASI07)	Yes	Partial	Partial	Yes
Context Manipulation (ASI08)	Yes	No	No	No
Agent Loop Exploitation (ASI09)	Yes	No	No	No
Trust Boundary (ASI10)	Yes	No	No	No
OWASP ASI Top 10 Coverage	10/10	1/10	1/10	2/10
MCP Server Mode	Yes	No	No	No
SARIF Output	Yes	Yes	Yes	No
Pre-commit Hook	Yes	Yes	No	No
GitHub Action	Yes	Yes	Yes	No

Quick Start

pip install dfx-agentguard

# Scan a directory
agentguard .

# JSON output for CI/CD
agentguard src/ --format json

# SARIF for GitHub Code Scanning
agentguard . --format sarif > results.sarif

# Only show HIGH and above
agentguard . --min-severity HIGH

# Include test files in scan
agentguard . --include-tests

CLI Usage

agentguard [OPTIONS] [TARGET]

Arguments:
  TARGET                   Directory or file to scan (default: current directory)

Options:
  --format [text|json|sarif]   Output format (default: text)
  --exit-code / --no-exit-code  Exit non-zero if findings found (default: on)
  --min-severity [CRITICAL|HIGH|MEDIUM|LOW|INFO]  Minimum severity to report
  --include-tests               Include test files in scan (default: skip)
  --help                        Show help

OWASP ASI Top 10 Coverage

ID	Vulnerability	Status	Detection Method
ASI01	Prompt Injection	Detected	f-string, .format(), messages array, context stuffing, tool description poisoning
ASI02	Tool Abuse / Unintended Tool Use	Detected	os.system, subprocess, shell tools, unrestricted registration
ASI03	Data Exfiltration	Detected	External URLs, variable URL correlation, fetch/axios, subprocess curl, DNS exfil
ASI04	Unauthorized Actions / Excessive Agency	Detected	Auto-execute, no confirmation, autonomous actions
ASI05	Supply Chain / Untrusted Components	Detected	Dynamic import, unpinned deps, untrusted pip install
ASI06	Insecure Output Handling	Detected	LLM output in HTML/JSX/DOM, innerHTML, document.write, markdown.render
ASI07	Credential / Secret Exposure	Detected	API keys (sk-, ghp_, AKIA, AIza, xox), private keys, passwords, connection strings
ASI08	Context Window Manipulation	Detected	Unbounded context, token stuffing, missing limits
ASI09	Agent Loop Exploitation	Detected	Recursive calls without depth limit, while True, no max iterations
ASI10	Trust Boundary Violation	Detected	Root access, host filesystem mounts, no sandbox, self-modification

CI/CD Integration

GitHub Action

name: Security Scan
on: [push, pull_request]

jobs:
  agentguard:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - uses: actions/setup-python@v5
        with:
          python-version: '3.12'
      - run: pip install dfx-agentguard
      - run: agentguard . --format sarif > results.sarif
      - uses: github/codeql-action/upload-sarif@v3
        with:
          sarif_file: results.sarif

Drop-in GitHub Action

- uses: dockfixlabs/agentguard@v0.4.0
  with:
    path: src/
    format: sarif

Pre-commit Hook

repos:
  - repo: https://github.com/dockfixlabs/agentguard
    rev: v0.4.0
    hooks:
      - id: agentguard
        args: ["--min-severity", "HIGH"]

Programmatic Usage

from agentguard.scanner import scan_directory

result = scan_directory("src/")

print(f"Found {len(result.findings)} issues")
print(f"Critical: {result.critical_count}")
print(f"High: {result.high_count}")

for finding in result.findings:
    print(f"  [{finding.severity}] {finding.rule_name} at {finding.file}:{finding.line}")

MCP Server Mode

Scan agent code directly from Claude Code, Cursor, or any MCP-compatible client:

{
  "mcpServers": {
    "agentguard": {
      "command": "python3",
      "args": ["-m", "agentguard.mcp_server"]
    }
  }
}

Then ask Claude: "Scan my agent code for security vulnerabilities"

Benchmark Results

Tested against 28 vulnerable code samples + 8 real-world attack patterns:

Category      Total   Detected     Rate    FP
ASI01             6          6     100%     0
ASI02             5          5     100%     0
ASI03             4          4     100%     0
ASI07             6          6     100%     0
ASI10             5          5     100%     0
clean             2          0       -      0
TOTAL            28         26    100%     0

100% detection rate, 0% false positives.

Project Ecosystem

Repository	Description
agentguard	Core scanner + CLI + MCP server
mcp-scanner	MCP server configuration scanner
agentguard-app	GitHub App for automated PR reviews
agentguard-vscode	VS Code extension
agentguard-benchmark	Benchmark suite (28 samples)

Roadmap

See the full ROADMAP.md.

Contributing

See CONTRIBUTING.md. Bug reports and feature requests welcome.

Security

See SECURITY.md. Report vulnerabilities privately -- do not open public issues.

License

MIT -- see LICENSE.

Built by Dockfix Labs. Built for the AI agent era.

Name		Name	Last commit message	Last commit date
Latest commit History 47 Commits
.github		.github
agentguard		agentguard
tests		tests
.gitignore		.gitignore
.pre-commit-hooks.yaml		.pre-commit-hooks.yaml
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
action.yml		action.yml
pyproject.toml		pyproject.toml
release_notes_v021.md		release_notes_v021.md
release_notes_v030.md		release_notes_v030.md
release_notes_v040.md		release_notes_v040.md
release_notes_v050.md		release_notes_v050.md
setup.py		setup.py
trusted_publishing.md		trusted_publishing.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AgentGuard

Why AgentGuard?

Comparison

Quick Start

CLI Usage

OWASP ASI Top 10 Coverage

CI/CD Integration

GitHub Action

Drop-in GitHub Action

Pre-commit Hook

Programmatic Usage

MCP Server Mode

Benchmark Results

Project Ecosystem

Roadmap

Contributing

Security

License

About

Uh oh!

Releases 10

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AgentGuard

Why AgentGuard?

Comparison

Quick Start

CLI Usage

OWASP ASI Top 10 Coverage

CI/CD Integration

GitHub Action

Drop-in GitHub Action

Pre-commit Hook

Programmatic Usage

MCP Server Mode

Benchmark Results

Project Ecosystem

Roadmap

Contributing

Security

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 10

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages