arXiv - CSCL: "Exploring the Robustness of Model-Graded Evaluati…" - Qoto Mastodon

arXiv - CSCL @arxiv_cscl@qoto.org

Exploring the Robustness of Model-Graded Evaluations and Automated Interpretability. (arXiv:2312.03721v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2312.03721 #arXiv #NLProc

Dec 11, 2023, 03:20 · · arxiv-cscl · · ·

Sign in to participate in the conversation