hiyouga / LlamaFactory Public

Notifications You must be signed in to change notification settings
Fork 8.5k
Star 69.7k

Code
Issues 923
Pull requests 28
Discussions
Actions
Security and quality 4
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security and quality
Insights

Pull requests: hiyouga/LlamaFactory

Labels 13 Milestones 0

New pull request New

28 Open 1,224 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[model] add gemma4_text template for text-only SFT/DPO training

#10362 opened Apr 7, 2026 by leivy-dev

Loading…

1 of 2 tasks

fix: use json.loads with Path.read_text() instead of json.load with Path

#10361 opened Apr 6, 2026 by satishkc7

Loading…

replace chat.py pop(0) with deque.popleft()

#10356 opened Apr 5, 2026 by nameearly

Loading…

1 task

[fix] parser json.load PosixPath bug

#10354 opened Apr 5, 2026 by Belle0918

Loading…

2 tasks done

[perf] Skip unused lm_head projection and hidden state storage in RM trainer

#10353 opened Apr 5, 2026 by tonywang1990

Loading…

4 tasks done

[ray] fix placement group over-allocation and NCCL hang on GPU-less head node

#10349 opened Apr 3, 2026 by ilover311

Loading…

2 tasks done

Add workflow for building ROCm image

#10330 opened Mar 30, 2026 by ErikJiang

Loading…

fix: pin 12 unpinned action(s)

#10325 opened Mar 26, 2026 by dagecko

Loading…

[WIP] Support huggingface/kernels

#10319 opened Mar 25, 2026 by zheliuyu • Draft

2 tasks

fix: add qwen3_5_moe to MoE configuration in moe.py invalid

This doesn't seem right

#10307 opened Mar 21, 2026 by majiayu000

Loading…

[v1] add deepspeed zero3 trigger for low memory usage weight loading

#10300 opened Mar 19, 2026 by jiaqiw09

Loading…

1 of 2 tasks

fix: mutable default arg and bool comparison

#10297 opened Mar 18, 2026 by LincolnBurrows2017

Loading…

feat: clearer train_result metrics log through calculate_tps function

#10288 opened Mar 17, 2026 by UmeanNever

Loading…

1 of 2 tasks

[V1]support resume training from checkpoint

#10280 opened Mar 13, 2026 by frozenleaves

Loading…

fix qwen3vl moe fuse on transformers 5.x and update docs about timeout

#10274 opened Mar 12, 2026 by addsubmuldiv

Loading…

2 tasks

feat: add LightOnOCR-2 integration for LoRA/QLoRA fine-tuning

#10192 opened Feb 16, 2026 by johnlockejrr

Loading…

2 tasks

Fix memory leak on MPS by explicitly clearing cache in trainer step

#10190 opened Feb 14, 2026 by asebaq

Loading…

1 of 2 tasks

[v1] Add hyperparams and training docs

#10188 opened Feb 13, 2026 by frozenleaves

Loading…

[deps] Add libibverbs for RDMA support

#10185 opened Feb 12, 2026 by RossCZ

Loading…

1 of 2 tasks

Feature: experimental fine-tuning comparison

#10172 opened Feb 6, 2026 by caterina0718

Loading…

[feat] Add DeepSpeed ZeRO-3 LoRA checkpoint save support

#10124 opened Jan 22, 2026 by kimberlykang

Loading…

2 tasks done

[model] support NVIDIA's Audio-Flamingo-3 audio model

#9740 opened Jan 9, 2026 by vovanphuc

Loading…

4 tasks done

[model] support LFM2.5-Audio with liquid_audio integration

#9733 opened Jan 8, 2026 by vovanphuc

Loading…

Add entropy logging for SFT training path

#9717 opened Jan 5, 2026 by pankd

Loading…

Support loss_mask in dataset to control loss calculation for specific turns solved

This problem has been already solved

#9630 opened Dec 18, 2025 by CjangCjengh

Loading…

2 tasks

Previous 1 2 Next

Previous Next

ProTip! Adding no:label will show everything without a label.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!