Skip to content

Pull requests: huggingface/candle

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Optimize CUDA kernels
#3600 opened Jun 11, 2026 by guoqingbao Contributor Loading…
Metal: fix SDPA storage offsets
#3599 opened Jun 11, 2026 by EricLBuehler Member Loading…
Parameter cache for CUDA kernel launch
#3598 opened Jun 11, 2026 by guoqingbao Contributor Loading…
flash-attn: launch kernels on the caller's CUDA stream
#3596 opened Jun 10, 2026 by jnises Contributor Loading…
ONNX: Dequantize Linear operator implementation
#3592 opened Jun 7, 2026 by Kwasus33 Loading…
Add softmax_last_dim backward pass
#3591 opened Jun 7, 2026 by HueCodes Contributor Loading…
Support effective Gemma 4 text weights
#3587 opened Jun 6, 2026 by HueCodes Contributor Loading…
Harden GGUF loader bounds checks
#3583 opened Jun 5, 2026 by ryzhov-artem Loading…
Onnx nonzero implementation
#3571 opened May 28, 2026 by Kwasus33 Loading…
metal: implement upsample_nearest1d kernel support
#3563 opened May 24, 2026 by oglego Loading…
Add QTensor row dequantization helper
#3562 opened May 23, 2026 by rabbitson87 Loading…
ProTip! Mix and match filters to narrow down what you’re looking for.