Has anyone tried implementing the LiMuon Optimizer?

DoktorMike · September 29, 2025, 8:41am

Has anyone tried implementing the LiMuon optimizer?

I made an issue here in the Optimisers.jl package but thought I’d ask here as well if someone has implemented it elsewhere? Also any hands on experience with the method itself would be interesting to hear about.

According to the paper it beats AdamW in a few benchmarks both on training/testing error and convergence speed.

txenakis · November 6, 2025, 10:17pm

Would be interesting to implement it. Neither “normal” Muon has been implemented, right?

DoktorMike · November 7, 2025, 8:33pm

I have not seen it at least.

Topic		Replies	Views
How to train dense nets several times faster than with Adam Machine Learning optimization , neural-network	4	430	May 8, 2026
How to contribute an optimizer to the Optimisers.jl package Machine Learning optimization	0	162	May 4, 2026
Optimizer not found error General Usage	8	612	December 31, 2022
Reactant.jl and Lux.jl don't work with Adam optimiser General Usage question , package , lux , reactant	9	615	September 30, 2025
Composing Lux with Optim Algorithms General Usage	3	396	July 20, 2023

Has anyone tried implementing the LiMuon Optimizer?

Related topics