Show session token usage / context window size in chat toolbar

## Summary

Surface a session's token usage directly in the chat toolbar, so you can eyeball context-window occupancy and cost without opening the hidden `SessionDebugPanel`.

<img alt="Image" src="https://github.com/user-attachments/assets/f396e09a-3a97-4b83-8e6a-1082c0b1befc" />
<img alt="Image" src="https://github.com/user-attachments/assets/8740ab4c-7444-4f3d-ab5a-0b1ab205acdb" />

## Problem

Token usage is already captured per run (`RunEntry.usage`) for every backend (Claude, Codex, OpenCode, Cursor), but it's only visible inside the debug panel. While working a session you have no quick signal for:

- How full the model's context window is right now.
- How many input/output tokens the session has burned so far.

## Proposal

Add a compact **usage chip** to `ChatToolbar` (next to the Send button):

- **Chip body:** current context-window size — the last turn's full prompt (`input + cache_read + cache_creation`), e.g. `180k ctx`.
- **Tooltip:** last-turn context size plus session totals (input, output, cache read, cache creation).
- Hidden when the session has no usage yet (new/archived sessions) and on narrow viewports.

## Cross-backend correctness

Usage shape differs per backend, so the implementation normalizes everything to one convention (Anthropic-style: `input_tokens` excludes cached tokens, cache counters are additive):

- **Claude** — `result.usage` is already Anthropic-style. Context must sum all input-side counters (bare `input_tokens` is just the *uncached* slice and badly understates real size).
- **Codex** — `thread/tokenUsage/updated` payload is `ThreadTokenUsage { last, total, … }`; the per-turn breakdown lives under `.last`. `inputTokens` is OpenAI-style (includes cached), so it's normalized to `input = inputTokens − cachedInputTokens`.
- **Cursor** — already Anthropic-style field names; no change.
- **OpenCode** — `step-finish.tokens` separates `cache.read`/`cache.write` from `input`; Anthropic-style, no change.

This is backend-specific, not model-specific, so it works for all LLMs each backend exposes.

## Implementation notes

- Rust: `Session` gains `total_usage` (session billing total) and `latest_usage` (last run's breakdown), both derived in `SessionMetadata::to_session()` from `runs[].usage` — no new write path, no migration (existing runs already carry `usage`).
- Frontend: new `SessionUsageChip` component wired through `ChatToolbar` ← `ChatWindow`.
- No new Tauri commands — only schema additions on the existing `Session` payload.

## Screenshots



## Acceptance criteria

- [ ] Chip shows realistic context size on Claude, Codex, OpenCode, and Cursor sessions.
- [ ] Tooltip shows last-turn context + session input/output/cache totals.
- [ ] Chip hidden for sessions with no usage and on narrow viewports.
- [ ] Values persist across app restart.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Show session token usage / context window size in chat toolbar #383

Summary

Problem

Proposal

Cross-backend correctness

Implementation notes

Screenshots

Acceptance criteria

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Uh oh!

Show session token usage / context window size in chat toolbar #383

Description

Summary

Problem

Proposal

Cross-backend correctness

Implementation notes

Screenshots

Acceptance criteria

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions