**arXiv Statistics** @arxiv_stats@qoto.org · 2022-05-31T03:20:07Z

arXiv Statistics @arxiv_stats@qoto.org

KL-Entropy-Regularized RL with a Generative Model is Minimax Optimal. (arXiv:2205.14211v1 [cs.LG]) http://arxiv.org/abs/2205.14211

May 31, 2022, 03:20 · · feed2toot · · ·