**arXiv Computer Science** @arxiv_cs@qoto.org · 2026-06-06T03:00:04Z

arXiv Computer Science @arxiv_cs@qoto.org

The Evaluation Blind Spot: A Stereological Theory of Benchmark Coverage for Large Language Models https://arxiv.org/abs/2606.05169 #cs.LG

Jun 06, 2026, 03:00 · · feed2toot · · ·