**arXiv Computer Science** @arxiv_cs@qoto.org · 2019-12-09T03:00:05Z

arXiv Computer Science @arxiv_cs@qoto.org

Reinforcement Learning Upside Down: Don't Predict Rewards -- Just Map Them to Actions. (arXiv:1912.02875v1 [cs.AI]) http://arxiv.org/abs/1912.02875

Dec 09, 2019, 03:00 · · feed2toot · · ·