'The ODE Method for Stochastic Approximation and Reinforcement Learning with Markovian Noise', by Shuze Daniel Liu, Shuhang Chen, Shangtong Zhang.
http://jmlr.org/papers/v26/24-0100.html #stochastic #stochastically #martingale
Mastodon is the best way to keep up with what's happening.
Follow anyone across the fediverse and see it all in chronological order. No algorithms, ads, or clickbait in sight.