arXiv - CSCL: "BeaverTails: Towards Improved Safety Alignment of…" - Qoto Mastodon

arXiv - CSCL @arxiv_cscl@qoto.org

BeaverTails: Towards Improved Safety Alignment of LLM via a Human-Preference Dataset. (arXiv:2307.04657v3 [cs.CL] UPDATED)

http://arxiv.org/abs/2307.04657 #arXiv #NLProc

Nov 08, 2023, 03:18 · · arxiv-cscl · · ·

Sign in to participate in the conversation