feat(pt): optimze HybridMuon by borrowing some ideas from deepseek v4 paper #5424
+133
−50
Codecov / codecov/project/Python
succeeded
Apr 27, 2026 in 1s
86.46% (+0.04%) compared to 9d63816
View this Pull Request on Codecov
86.46% (+0.04%) compared to 9d63816
Loading