arXiv - CSCL: "On the Exploitability of Reinforcement Learning w…" - Qoto Mastodon

arXiv - CSCL @arxiv_cscl@qoto.org

On the Exploitability of Reinforcement Learning with Human Feedback for Large Language Models. (arXiv:2311.09641v1 [cs.AI])

http://arxiv.org/abs/2311.09641 #arXiv #NLProc

Nov 18, 2023, 03:18 · · arxiv-cscl · · ·

Sign in to participate in the conversation