Probing Memes in LLMs: A Paradigm for the Entangled Evaluation World arxiv.org/abs/2603.04408 .CL

Unpacking Human Preference for LLMs: Demographically Aware Evaluation with the HUMAINE Framework arxiv.org/abs/2603.04409 .CL .AI .HC

SalamahBench: Toward Standardized Safety Evaluation for Arabic Language Models arxiv.org/abs/2603.04410 .CL .AI

One Size Does Not Fit All: Token-Wise Adaptive Compression for KV Cache arxiv.org/abs/2603.04411 .CL .AI .LG

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.