Show newer

Enhancing Safety of Large Language Models via Embedding Space Separation arxiv.org/abs/2603.20206 .CL .AI

Children's Intelligence Tests Pose Challenges for MLLMs? KidGym: A 2D Grid-Based Reasoning Benchmark for MLLMs arxiv.org/abs/2603.20209 .CL .AI

CRoCoDiL: Continuous and Robust Conditioned Diffusion for Language arxiv.org/abs/2603.20210 .CL .AI

Exploring Teacher-Chatbot Interaction and Affect in Block-Based Programming arxiv.org/abs/2603.20211 .CY .AI

Engineering-Oriented Symbolic Regression: LLMs as Physics Agents for Discovery of Simulation-Ready Constitutive Laws arxiv.org/abs/2603.19241 .comp-ph .app-ph .CE .SC

How international are international computing conferences? -- An exploration with systems research conferences arxiv.org/abs/2603.19245 .OH

When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models arxiv.org/abs/2603.19247 .CL .AI

Do Large Language Models Possess a Theory of Mind? A Comparative Evaluation Using the Strange Stories Paradigm arxiv.org/abs/2603.18007 .CL .AI

TherapyGym: Evaluating and Aligning Clinical Fidelity and Safety in Therapy Chatbots arxiv.org/abs/2603.18008 .CL .AI .CY

How Confident Is the First Token? An Uncertainty-Calibrated Prompt Optimization Framework for Large Language Model Classification and Understanding arxiv.org/abs/2603.18009 .CL .AI

Controllable Evidence Selection in Retrieval-Augmented Question Answering via Deterministic Utility Gating arxiv.org/abs/2603.18011 .CL .IR

DynaRAG: Bridging Static and Dynamic Knowledge in Retrieval-Augmented Generation arxiv.org/abs/2603.18012 .CL .AI .IR

Learned but Not Expressed: Capability-Expression Dissociation in Large Language Models arxiv.org/abs/2603.18013 .CL

Real-Time Trustworthiness Scoring for LLM Structured Outputs and Data Extraction arxiv.org/abs/2603.18014 .CL .LG

Beyond Accuracy: An Explainability-Driven Analysis of Harmful Content Detection arxiv.org/abs/2603.18015 .CL .AI

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.