PandaFilter

Cut your AI agent's token bill by 60–95% — by removing the noise from what it reads.

Install

brew tap AssafWoo/pandafilter
brew install pandafilter

Linux / any platform:

curl -fsSL https://raw.githubusercontent.com/AssafWoo/homebrew-pandafilter/main/install.sh | bash

First run: PandaFilter downloads the BERT model (~90 MB, all-MiniLM-L6-v2) from HuggingFace and caches it at ~/.cache/huggingface/. Subsequent runs are instant.

Then wire it into your agent:

panda init                              # Claude Code (default)
panda init --agent cursor               # Cursor
panda init --agent gemini               # Gemini CLI
panda init --agent cline                # Cline
panda init --agent copilot              # VS Code Copilot

How it works

When your AI agent runs a command — pip install, cargo build, npm install — PandaFilter intercepts the output and removes everything the model doesn't need: download progress, module graphs, passing test lines, spinners. The agent sees a clean summary with errors, warnings, and results. Nothing useful is dropped.

No config changes. No workflow changes. Runs 100% locally.

See it in action

Run panda gain after a session to see your cumulative savings:

Token savings

Numbers from ccr/tests/handler_benchmarks.rs. Run panda gain to see your own live data.

Operation	Without PandaFilter	With PandaFilter	Savings
`pip install`	1,787	9	−99%
`uv sync`	1,574	15	−99%
`playwright test`	1,367	19	−99%
`docker build`	1,801	24	−99%
`swift build`	1,218	9	−99%
`dotnet build`	438	3	−99%
`cmake`	850	5	−99%
`gradle build`	803	17	−98%
`go test`	4,507	148	−97%
`git merge`	164	5	−97%
`pytest`	3,818	162	−96%
`terraform plan`	3,926	163	−96%
`npm install`	648	25	−96%
`ember build`	3,377	139	−96%
`cargo build`	1,923	93	−95%
`cargo test`	2,782	174	−94%
`git clone`	139	8	−94%
`bazel build`	150	12	−92%
`next build`	549	53	−90%
`cargo clippy`	786	93	−88%
`make`	545	72	−87%
`git diff`	6,370	861	−86%
`git push`	173	24	−86%
`ls`	691	102	−85%
`webpack`	882	143	−84%
`vitest`	625	103	−84%
`nx run-many`	1,541	273	−82%
`turbo run build`	597	115	−81%
`ruff check`	2,035	435	−79%
`eslint`	4,393	974	−78%
`grep`	2,925	691	−76%
`helm install`	224	54	−76%
`docker ps`	1,057	266	−75%
`golangci-lint`	3,678	960	−74%
`git log`	1,573	431	−73%
`git status`	650	184	−72%
`kubectl get pods`	2,306	689	−70%
`vite build`	526	182	−65%
`jest`	330	114	−65%
`env`	1,155	399	−65%
`mvn install`	4,585	1,613	−65%
`brew install`	368	148	−60%
`gh pr list`	774	321	−59%
`biome lint`	1,503	753	−50%
`tsc`	2,598	1,320	−49%
`mypy`	2,053	1,088	−47%
`stylelint`	1,100	845	−23%
Total	81,882	14,347	−82%

Commands

panda gain — see your token savings:

panda gain                    # overall summary
panda gain --breakdown        # per-command table
panda gain --history          # last 14 days
panda gain --insight          # categorized savings + top saves

panda doctor — diagnose the full installation in one command.

panda init --uninstall — remove hooks:

panda init --uninstall                  # Claude Code
panda init --agent cursor --uninstall   # Cursor

panda focus — opt-in: tells the agent which files matter for the current prompt, preventing unnecessary reads:

panda focus --enable             # enable for this repo
panda focus --disable            # disable (keeps index data)
panda focus --status             # show status + index age
panda focus --dry-run            # preview guidance without enabling

panda index — manually rebuild the file-relationship index:

panda index                      # full/incremental build for current repo

Other commands:

panda verify                            # check hook integrity
panda discover                          # scan history for unfiltered commands
panda run git status                    # run a command through PandaFilter manually
panda proxy git status                  # run raw (no filtering), record baseline
panda read-file src/main.rs --level auto  # preview read filtering
panda expand ZI_3                       # restore a collapsed block
panda noise                             # show learned noise patterns; --reset to clear
panda compress --scan-session           # compress current conversation context

Handlers (59 handlers)

59 handlers (70+ command aliases) in ccr/src/handlers/. Lookup cascade:

User filters — .panda/filters.toml or ~/.config/panda/filters.toml
Exact match — direct command name
Static alias table — versioned binaries, wrappers, common aliases
BERT routing — unknown commands matched by embedding similarity

Handler	Keys	Key behavior
cargo	`cargo`	`build`/`clippy`: errors (capped at 15) + warning count. `test`: failures + summary. `nextest run`: FAIL lines + Summary.
git	`git`	`status`: counts. `log`: `--oneline`, cap 50 with total. `diff`: 2 context lines, 200-line cap. `clone`/`merge`/`checkout`/`rebase`: compressed success or full conflict output.
go	`go`	`test`: NDJSON streaming, FAIL blocks + summary. `build`: errors only.
ember	`ember`	`build`: errors + summary; drops fingerprint/asset spam. `test`: failures + summary. `serve`: serving URL only.
tsc	`tsc`	Errors grouped by file; deduplicates repeated TS codes. `Build OK` on clean. Injects `--noEmit`.
vitest	`vitest`	FAIL blocks + summary; drops `✓` lines.
jest	`jest`, `bun`, `deno`	`●` failure blocks + summary; drops `PASS` lines.
pytest	`pytest`	FAILED node IDs + AssertionError + short summary. Injects `--tb=short`.
rspec	`rspec`	Injects `--format json`; example-level failures with message + location.
rubocop	`rubocop`	Injects `--format json`; offenses grouped by severity, capped.
rake	`rake`, `bundle`	Failure/error blocks + summary; drops passing test lines.
mypy	`mypy`	Errors grouped by file, capped at 10 per file. Injects `--no-color`.
ruff	`ruff`	Violations grouped by error code. `format`: summary line only.
uv	`uv`, `uvx`	Strips Downloading/Fetching/Preparing noise; keeps errors + summary.
pip	`pip`, `poetry`, `pdm`, `conda`	`install`: `[complete — N packages]` or already-satisfied short-circuit.
python	`python`	Traceback: keep block + final error. Detects and compresses tabular/CSV, pandas DataFrames, Word (.docx), Excel (.xlsx), and PowerPoint (.pptx) output. Long output: BERT.
eslint	`eslint`	Errors grouped by file, caps at 20 + `[+N more]`.
next	`next`	`build`: route table collapsed. `dev`: errors + ready line.
playwright	`playwright`	Failing test names + error messages; passing tests dropped. Injects `--reporter=list`.
prettier	`prettier`	`--check`: files needing formatting + count.
vite	`vite`	Asset chunk table collapsed, HMR deduplication.
webpack	`webpack`	Module resolution graph dropped; keeps assets, errors, build result.
turbo	`turbo`	Inner task output stripped; cache hit/miss per package + final summary.
nx	`nx`, `npx nx`	Passing tasks collapsed to `[N tasks passed]`; failing task output kept.
stylelint	`stylelint`	Issues grouped by file, caps at 40 + `[+N more]`.
biome	`biome`	Code context snippets stripped; keeps file:line, rule, message.
kubectl	`kubectl`, `k`	`get pods`: aggregates to `[N pods, all running]` or problem-pods table with counts. Smart column selection, log anomaly scoring, describe key sections. `events`: warning-only, capped at 20.
terraform	`terraform`, `tofu`	`plan`: `+`/`-`/`~` + summary. `validate`: short-circuits on success. `output`: compact key=value. `state list`: capped at 50.
aws	`aws`, `gcloud`, `az`	Resource extraction; `--output json` injected for read-only actions.
gh	`gh`	Compact tables for list commands; strips HTML from `pr view`.
helm	`helm`	`list`: compact table. `status`/`diff`/`template`: structured.
docker	`docker`	`logs`: ANSI strip + BERT. `ps`/`images`: formatted tables + total size. `build`: errors + final image ID.
make	`make`, `ninja`	"Nothing to be done" short-circuit; keeps errors. Injects `--no-print-directory`.
golangci-lint	`golangci-lint`	Diagnostics grouped by file; runner noise dropped. Detects v1 text and v2 JSON formats.
prisma	`prisma`	`generate`/`migrate`/`db push` structured summaries.
mvn	`mvn`	Drops `[INFO]` noise; keeps errors + reactor summary.
gradle	`gradle`	UP-TO-DATE tasks collapsed; FAILED tasks and errors kept.
npm/yarn	`npm`, `yarn`	`install`: package count; strips boilerplate.
pnpm	`pnpm`	`install`: summary; drops progress bars.
brew	`brew`	`install`/`update`: status lines + Caveats.
curl	`curl`	JSON → type schema. Non-JSON: cap 30 lines.
grep / rg	`grep`, `rg`	Compact paths, per-file 100-match cap, line numbers preserved, `[N matches in M files]` summary. Injects `--no-heading --with-filename`. Match-centered line truncation.
find	`find`	Groups by directory, caps at 50. Injects `-maxdepth 8` if unset.
journalctl	`journalctl`	Injects `--no-pager -n 200`. BERT anomaly scoring.
psql	`psql`	Strips borders, caps at 20 rows.
tree	`tree`	Auto-injects `-I "node_modules\|.git\|target\|..."`.
diff	`diff`	`+`/`-`/`@@` + 2 context lines, max 5 hunks.
jq	`jq`	Array: schema of first element + `[N items]`.
env	`env`	Categorized sections; sensitive values redacted.
ls	`ls`	Drops noise dirs; top-3 extension summary.
log	`log`	Timestamp/UUID normalization, dedup `[×N]`, error summary block.
rsync	`rsync`	Drops per-file transfer progress lines (`to-chk=`, `MB/s`); keeps file list and final summary.
ffmpeg	`ffmpeg`, `ffprobe`	Drops `frame=` and `size=` real-time progress lines; keeps input/output codec info and final size line.
wget	`wget`	Injects `--quiet` if no verbosity flag set.
swift	`swift`, `swift-build`, `swift-test`	`build`: errors/warnings + `Build complete`. `test`: failures + summary. `package resolve`: strips progress.
dotnet	`dotnet`, `dotnet-cli`	`build`: errors grouped by CS code + summary. Short-circuits on clean build. `test`: failures + summary. `restore`: package count.
cmake	`cmake`, `cmake3`	`configure`: errors + final written-to line. `--build`: errors + `[N targets built]`. Auto-detects mode from args/output.
bazel	`bazel`, `bazelisk`, `bzl`	`build`: errors + completion summary `[N actions, build OK (Xs)]`. `test`: failures + `[N passed, N failed]`. `query`: cap at 30 targets.

Pipeline architecture

0. Hard input ceiling (200k chars — truncates before any stage)
1. Strip ANSI codes
2. Normalize whitespace
3. Global regex pre-filter (progress bars, spinners, download lines, decorators)
4. NDJSON streaming compaction (go test -json, jest JSON reporter)
5. Command-specific pattern filter
6. If over summarize_threshold_lines:
   6a. BERT noise pre-filter
   6b. Entropy-adaptive BERT summarization (up to 7 passes)
7. Hard output cap (50k chars)

Outputs under 15 tokens skip the pipeline entirely. Step 6b falls back to head+tail if BERT is unavailable.

Pre-run cache (fires before execution): git, kubectl, docker, and terraform commands are hashed against live state. A hit skips execution entirely and returns the cached output with a [PC: cached from Xm ago] marker.

Configuration

Config loaded from: ./panda.toml → ~/.config/panda/config.toml → embedded default.

[global]
summarize_threshold_lines = 50
head_lines = 30
tail_lines = 30
strip_ansi = true
normalize_whitespace = true
deduplicate_lines = true
input_char_ceiling = 200000
output_char_cap = 50000
# cost_per_million_tokens = 15.0

[tee]
enabled = true
mode = "aggressive"   # "aggressive" | "always" | "never"

[read]
mode = "auto"   # "passthrough" | "auto" | "strip" | "aggressive"

[focus]
enabled = false       # disabled by default — enable with `panda focus --enable` after testing
min_files = 25        # skip for repos smaller than this
min_lines = 2000      # skip for repos with fewer source lines

[commands.git]
patterns = [
  { regex = "^(Counting|Compressing|Receiving|Resolving) objects:.*", action = "Remove" },
]

[commands.cargo]
patterns = [
  { regex = "^\\s+Compiling \\S+ v[\\d.]+", action = "Collapse" },
  { regex = "^\\s+Downloaded \\S+ v[\\d.]+", action = "Remove"   },
]

Pattern actions: Remove, Collapse, ReplaceWith = "text", TruncateLinesAt = N, HeadLines = N, TailLines = N, MatchOutput = "msg", OnEmpty = "msg".

Pricing uses cost_per_million_tokens from panda.toml if set, otherwise ANTHROPIC_MODEL env var (Opus 4.6: $15, Sonnet 4.6: $3, Haiku 4.5: $0.80), otherwise $3.00.

User-defined filters

Place filters.toml at .panda/filters.toml (project-local) or ~/.config/panda/filters.toml (global). Project-local overrides global for the same key. Runs before any built-in handler.

[commands.myapp]
patterns = [
  { regex = "^DEBUG:",            action = "Remove" },
  { regex = "^\\S+\\.ts\\(",     action = "TruncateLinesAt", max_chars = 120 },
]
on_empty = "(no relevant output)"

[commands.myapp.match_output]
pattern        = "Server started"
message        = "ok — server ready"
unless_pattern = "error"

Session intelligence

State tracked via PANDA_SESSION_ID=$PPID, stored at ~/.local/share/panda/sessions/<id>.json.

Result cache — post-pipeline bytes frozen per input hash; returned identically on repeat calls to prevent prompt cache busts.
Semantic delta — repeated commands emit only new/changed lines: [Δ from turn N: +M new, K repeated — ~T tokens saved].
Cross-turn dedup — identical outputs (cosine > 0.92) collapse to [same output as turn 4 (3m ago) — 1.2k tokens saved].
Elastic context — pipeline pressure scales with session size. At >80% pressure: [⚠ context near full — run panda compress --scan-session].
Intent-aware query — reads the agent's last message from the live session JSONL and uses it as the BERT query.

Hook architecture

Agent	Config	Script
Claude Code	`~/.claude/settings.json`	`~/.claude/hooks/panda-rewrite.sh`
Cursor	`~/.cursor/hooks.json`	`~/.cursor/hooks/panda-rewrite.sh`
Gemini CLI	`~/.gemini/hooks.json`	`~/.gemini/panda-rewrite.sh`
Cline	`.clinerules` (project dir)	— (rules-based)
VS Code Copilot	`.github/hooks/panda-rewrite.json`	`.github/hooks/panda-rewrite.sh`

All agents share the same binary and filtering pipeline.

PreToolUse: known handler → rewrites to panda run <cmd>; unknown → no-op; already wrapped → no double-wrap; compound commands → each segment rewritten independently.

PostToolUse: Bash → full pipeline; Read → BERT + session dedup; Glob → grouped by directory; Grep → compact paths.

UserPromptSubmit: Context Focusing module → queries file graph → injects guidance (recommended + excluded files).

Hook integrity: panda init writes SHA-256 baselines (chmod 0o444). PandaFilter verifies at every invocation and exits 1 with a warning if tampered. panda verify checks all installed agents.

Crate overview

ccr/        CLI binary (panda) — handlers, hooks, session state, commands
ccr-core/   Core library (no I/O) — pipeline, BERT summarizer, config, analytics
ccr-sdk/    Conversation compression — tiered compressor, deduplicator, Ollama
ccr-eval/   Evaluation suite — fixtures against Claude API
config/     Embedded default filter patterns

Uninstall

panda init --uninstall                        # Claude Code
panda init --agent cursor --uninstall         # Cursor
panda init --agent gemini --uninstall         # Gemini CLI
panda init --agent cline --uninstall          # Cline
panda init --agent copilot --uninstall        # VS Code Copilot

brew uninstall pandafilter && brew untap AssafWoo/pandafilter   # Homebrew
# or: cargo uninstall panda

rm -rf ~/.local/share/panda                   # analytics + sessions
rm -rf ~/.cache/huggingface/hub/models--sentence-transformers--all-MiniLM-L6-v2

FAQ

Does PandaFilter change what the agent can see? It removes noise — build progress, passing test lines, module download logs. Errors, file paths, and results are always kept.

What if I don't want a specific command filtered? Add a rule to .panda/filters.toml to customize or override any handler. See the User-defined filters section. You can also use panda proxy <cmd> to run a command raw with no filtering.

What about commands PandaFilter doesn't know? Output passes through unchanged. PandaFilter never silently drops output from unknown commands.

How do I verify it's working? Run panda gain after a session. To see exactly what the agent received from a specific command: panda run git log --oneline -20.

Does PandaFilter send any data outside my machine? No. All processing is fully local. BERT runs on-device.

What is Context Focusing? An opt-in feature that tells the agent which files are relevant for the current prompt, preventing it from reading unrelated files. Enable with panda focus --enable after running panda doctor to confirm the index is ready.

Why PandaFilter? Why Panda?

AI coding sessions are expensive — not because of what you ask, but because of what the agent reads back. Every cargo build or npm install dumps thousands of tokens of noise into the context window. I built PandaFilter to strip that out automatically.

The name comes from how a panda eats: it consumes enormous amounts of raw material and extracts only what it needs.

— Assaf Petronio · github.com/AssafWoo

Contributing

Open an issue or PR on GitHub. To add a handler: implement the Handler trait and register it in ccr/src/handlers/mod.rs — see git.rs as a template.

License

MIT — see LICENSE.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PandaFilter

Install

How it works

See it in action

Token savings

Commands

FAQ

Why PandaFilter? Why Panda?

Contributing

License

About

Uh oh!

Releases 61

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 225 Commits
.github		.github
Formula		Formula
assets		assets
ccr-core		ccr-core
ccr-eval		ccr-eval
ccr-sdk		ccr-sdk
ccr		ccr
config		config
docker		docker
scripts		scripts
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
install.sh		install.sh

Folders and files

Latest commit

History

Repository files navigation

PandaFilter

Install

How it works

See it in action

Token savings

Commands

FAQ

Why PandaFilter? Why Panda?

Contributing

License

About

Topics

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 61

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages