Sample-Efficient Human Evaluation of Large Language Models via Maximum Discrepancy Competition https://arxiv.org/abs/2404.08008 #cs.LG #cs.CL #cs.HC
QOTO: Question Others to Teach Ourselves An inclusive, Academic Freedom, instance All cultures welcome. Hate speech and harassment strictly forbidden.