arXiv - CSCL: "Are Large Language Models Reliable Judges? A Stud…" - Qoto Mastodon

arXiv - CSCL @arxiv_cscl@qoto.org

Are Large Language Models Reliable Judges? A Study on the Factuality Evaluation Capabilities of LLMs. (arXiv:2311.00681v1 [cs.CL])

http://arxiv.org/abs/2311.00681 #arXiv #NLProc

Nov 02, 2023, 03:18 · · arxiv-cscl · · ·

Sign in to participate in the conversation