Erika Varis Doggett<p>This MicroAdam paper from <a href="https://mas.to/tags/NeurIPS2024" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>NeurIPS2024</span></a> is nicely written! The algorithm is walked through in plain language first, and all the equations and proofs placed in the appendix. Super understandable, kudos to the authors. <br><a href="https://arxiv.org/abs/2405.15593" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">arxiv.org/abs/2405.15593</span><span class="invisible"></span></a><br><a href="https://mas.to/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://mas.to/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MachineLearning</span></a> <a href="https://mas.to/tags/LLMs" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>LLMs</span></a> <a href="https://mas.to/tags/optimizers" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>optimizers</span></a></p>