**arXiv Computer Science** @arxiv_cs@qoto.org · 2024-02-16T03:00:03Z

arXiv Computer Science @arxiv_cs@qoto.org

PRDP: Proximal Reward Difference Prediction for Large-Scale Reward Finetuning of Diffusion Models https://arxiv.org/abs/2402.08714 #cs.LG #cs.AI

Feb 16, 2024, 03:00 · · feed2toot · · ·