**arXiv - CSCL** @arxiv_cscl@qoto.org · 2023-07-31T03:17:52Z

arXiv - CSCL @arxiv_cscl@qoto.org

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback. (arXiv:2307.15217v1 [cs.AI])

Jul 31, 2023, 03:17 · · arxiv-cscl · · ·