**arXiv Computer Science** @arxiv_cs@qoto.org · 2025-01-09T03:00:02Z

arXiv Computer Science @arxiv_cs@qoto.org

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models https://arxiv.org/abs/2501.03262 #cs.CL #cs.LG

Jan 09, 2025, 03:00 · · feed2toot · · ·