arXiv - CSCL: "NLP Evaluation in trouble: On the Need to Measure…" - Qoto Mastodon

arXiv - CSCL @arxiv_cscl@qoto.org

NLP Evaluation in trouble: On the Need to Measure LLM Data Contamination for each Benchmark. (arXiv:2310.18018v1 [cs.CL])

http://arxiv.org/abs/2310.18018 #arXiv #NLProc

Oct 30, 2023, 03:18 · · arxiv-cscl · · ·

Sign in to participate in the conversation