-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Pull requests: huggingface/candle
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix causal attention in candle-flash-attn-v3: NULL tile_count_semaphore crash (dense) + silently non-causal varlen
#3606
opened Jun 11, 2026 by
Xueying-VirtueAI
Loading…
fix(transformers): adds support for llamas head_dim config
#3602
opened Jun 11, 2026 by
suniastar
Loading…
fix(kernels): Compilation errors on Turing Cards (Compute Cap 7.5)
#3601
opened Jun 11, 2026 by
suniastar
Loading…
Fix 128x softmax_lse over-allocation in the dense flash-attn-v3 path
#3597
opened Jun 10, 2026 by
Xueying-VirtueAI
Loading…
flash-attn: launch kernels on the caller's CUDA stream
#3596
opened Jun 10, 2026 by
jnises
Contributor
Loading…
fix(gguf): bound tensor size against file before allocating
#3588
opened Jun 7, 2026 by
pjdurden
Loading…
fix(qwen3): build causal mask batch-independently (#3582)
#3586
opened Jun 6, 2026 by
pjdurden
Loading…
Remove redundant creations and operations on zero tensors in Tensor::backward()
#3578
opened Jun 4, 2026 by
TeunVerstraaten
Loading…
feat(candle-nn): QuantizedKvCache — INT8 KV cache with attention sinks (TurboQuant)
#3577
opened Jun 3, 2026 by
aryanputta
Loading…
fix(flash-attn): bump vendored FA2 kernels from Dec 2024 to v2.8.3
#3576
opened Jun 3, 2026 by
aryanputta
Loading…
fix(core): validate shape element count in ShapeWithOneHole blanket impl (#3534)
#3572
opened May 30, 2026 by
oyoyo4556
Loading…
feat: add candle-flash-attn-v4 crate for FlashAttention-4 support
#3561
opened May 22, 2026 by
PhoenixCPH
Loading…
Improve Sequential based on ModuleT to make it more versatile.
#3560
opened May 22, 2026 by
Taswen
Loading…
Add Mamba-3 GPU operator with SISO and MIMO inference support
#3557
opened May 22, 2026 by
PhoenixCPH
Loading…
[Metal] Expose kernel_mul_mm_id wrapper and parallelize rowids fan-out
#3555
opened May 20, 2026 by
fiorelorenzo
Loading…
Previous Next
ProTip!
Mix and match filters to narrow down what you’re looking for.