-
Notifications
You must be signed in to change notification settings - Fork 1.3k
Pull requests: huggingface/candle
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Optimization for CPU Causal Flash Attention (integrated into Qwen3)
#3254
opened Dec 20, 2025 by
DrJesseGlass
Loading…
add the momentum and Nesterov(NAG) support for SGD optimizer like pytorch
#3251
opened Dec 20, 2025 by
donjuanplatinum
Loading…
feat: implementation varlen-flash-attention on cpu
#3250
opened Dec 18, 2025 by
michaelfeil
Loading…
Add Mistral3 vision-language model support (For Flux2 Migration)
#3246
opened Dec 16, 2025 by
SpenserCai
Loading…
5 tasks done
Add bilinear interpolation support (upsample_bilinear2d)
#3237
opened Dec 9, 2025 by
SpenserCai
Loading…
feat(qwen2): add KV cache management and selective attention
#3236
opened Dec 9, 2025 by
danielclough
Loading…
replace cutlass submodule references with explicit build step
#3234
opened Dec 8, 2025 by
jacobgorm
Loading…
Fix cudarc deprecation warnings for memcpy methods
#3228
opened Dec 5, 2025 by
DrJesseGlass
Loading…
feat: Add BertForTokenClassification for Named Entity Recognition in Rust
#3212
opened Nov 24, 2025 by
MSkill1
Loading…
Metal: bound temporary buffer cache and prevent runaway memory usage on large softmax/broadcast/matmul workloads
#3197
opened Nov 17, 2025 by
TimmyOVO
Loading…
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-11-21.