Skip to content

Investigate: can Auto mode's permission classifier be routed through the proxy (and how)? #290

Description

@cryptiklemur

Goal

I'd like Claude Code's Auto mode to work through better-ccflare — including
its permission classifier traffic. I'm not sure that's possible, so this is to
investigate whether it can be done and, if so, how.

Background

Auto mode (released 2026-03-24) runs without routine permission prompts, but per
Anthropic's docs "every shell command or network action first passes through a
separate safety classifier — a second model that vets each action." That
classifier is an additional model call.

For the proxy to fully support Auto mode, that classifier call ideally routes
through better-ccflare too, so it benefits from the same load balancing, account
fallback, and analytics as normal turns. Right now I don't know whether that's
even where the call goes.

What to figure out

  • Does it work at all today? With Claude Code pointed at the proxy via
    ANTHROPIC_BASE_URL, does Auto mode function, and does the classifier
    call go through the proxy or bypass it straight to Anthropic?
  • If it bypasses — can we make it route through? Is the classifier
    endpoint controlled by the same base URL / env config, or is it hardcoded
    / separate? Is there any knob (env var, header, config) that points it at
    the proxy?
  • If it routes through — does ccflare handle it correctly? What model is
    it, does it pick up account selection (session / leastUsed /
    session-affinity), and does its usage show up in analytics rather than
    getting dropped or mis-attributed?
  • Rate-limit impact: if classifier calls do flow through, how much extra
    volume does Auto mode add per account, and does that change which strategy
    is advisable?
  • What (if anything) would the proxy need to change to support routing
    the classifier call?

Notes

I haven't confirmed where the classifier call actually goes — that's the first
thing to pin down, since everything else depends on it. If it turns out the
classifier is server-side / out-of-band and genuinely can't be pointed at a
proxy, that's a valid (if disappointing) answer to capture here.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions