Added
- Extractive backend — auto-detected support for
KRLabsOrg/verbatim-rag-modern-bert-v2and other Verbatim-RAG ModernBERT span models. Same CLI, just pointSQUEEZ_LOCAL_MODELat the HF id. - Session stats —
--statsprints token/line savings per call;--summaryshows accumulated savings. Logged to~/.cache/squeez/session_stats.jsonl.
Changed
- Docs reframed around two model types (generative vs extractive) instead of internal backend names.
Install
pip install --upgrade squeez