Skip to content

feat(candle-nn): QuantizedKvCache — INT8 KV cache with attention sinks (TurboQuant)#3577

Open
aryanputta wants to merge 2 commits into
huggingface:mainfrom
aryanputta:feat/turbo-quant-int8-kv-cache
Open

feat(candle-nn): QuantizedKvCache — INT8 KV cache with attention sinks (TurboQuant)#3577
aryanputta wants to merge 2 commits into
huggingface:mainfrom
aryanputta:feat/turbo-quant-int8-kv-cache

Commits

Commits on Jun 3, 2026