Skip to content

Pull requests: hiyouga/LlamaFactory

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[model] add gemma4_text template for text-only SFT/DPO training
#10362 opened Apr 7, 2026 by leivy-dev Loading…
1 of 2 tasks
replace chat.py pop(0) with deque.popleft()
#10356 opened Apr 5, 2026 by nameearly Loading…
1 task
[fix] parser json.load PosixPath bug
#10354 opened Apr 5, 2026 by Belle0918 Loading…
2 tasks done
Add workflow for building ROCm image
#10330 opened Mar 30, 2026 by ErikJiang Loading…
fix: pin 12 unpinned action(s)
#10325 opened Mar 26, 2026 by dagecko Loading…
[WIP] Support huggingface/kernels
#10319 opened Mar 25, 2026 by zheliuyu Draft
2 tasks
fix: add qwen3_5_moe to MoE configuration in moe.py invalid This doesn't seem right
#10307 opened Mar 21, 2026 by majiayu000 Loading…
[v1] add deepspeed zero3 trigger for low memory usage weight loading
#10300 opened Mar 19, 2026 by jiaqiw09 Loading…
1 of 2 tasks
feat: clearer train_result metrics log through calculate_tps function
#10288 opened Mar 17, 2026 by UmeanNever Loading…
1 of 2 tasks
[V1]support resume training from checkpoint
#10280 opened Mar 13, 2026 by frozenleaves Loading…
feat: add LightOnOCR-2 integration for LoRA/QLoRA fine-tuning
#10192 opened Feb 16, 2026 by johnlockejrr Loading…
2 tasks
Fix memory leak on MPS by explicitly clearing cache in trainer step
#10190 opened Feb 14, 2026 by asebaq Loading…
1 of 2 tasks
[v1] Add hyperparams and training docs
#10188 opened Feb 13, 2026 by frozenleaves Loading…
[deps] Add libibverbs for RDMA support
#10185 opened Feb 12, 2026 by RossCZ Loading…
1 of 2 tasks
Feature: experimental fine-tuning comparison
#10172 opened Feb 6, 2026 by caterina0718 Loading…
[feat] Add DeepSpeed ZeRO-3 LoRA checkpoint save support
#10124 opened Jan 22, 2026 by kimberlykang Loading…
2 tasks done
[model] support NVIDIA's Audio-Flamingo-3 audio model
#9740 opened Jan 9, 2026 by vovanphuc Loading…
4 tasks done
Add entropy logging for SFT training path
#9717 opened Jan 5, 2026 by pankd Loading…
Support loss_mask in dataset to control loss calculation for specific turns solved This problem has been already solved
#9630 opened Dec 18, 2025 by CjangCjengh Loading…
2 tasks
ProTip! Adding no:label will show everything without a label.