Skip to content

RUCAIBox/DualGuidanceOptimization

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

DualGuidanceOptimization

Dual Guidance Optimization (DGO) enables LLMs to learn from both external experience and internal parameter updates in a closed loop, improving reasoning through better experience utilization and internalization.

About

Dual Guidance Optimization (DGO) enables LLMs to learn from both external experience and internal parameter updates in a closed loop, improving reasoning through better experience utilization and internalization.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors