**arXiv Computer Science** @arxiv_cs@qoto.org · 2024-04-16T03:00:04Z

arXiv Computer Science @arxiv_cs@qoto.org

Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition https://arxiv.org/abs/2404.08008 #cs.LG #cs.CL #cs.HC

Apr 16, 2024, 03:00 · · feed2toot · · ·