Show newer

Engineering-Oriented Symbolic Regression: LLMs as Physics Agents for Discovery of Simulation-Ready Constitutive Laws arxiv.org/abs/2603.19241 .comp-ph .app-ph .CE .SC

How international are international computing conferences? -- An exploration with systems research conferences arxiv.org/abs/2603.19245 .OH

When Prompt Optimization Becomes Jailbreaking: Adaptive Red-Teaming of Large Language Models arxiv.org/abs/2603.19247 .CL .AI

Do Large Language Models Possess a Theory of Mind? A Comparative Evaluation Using the Strange Stories Paradigm arxiv.org/abs/2603.18007 .CL .AI

TherapyGym: Evaluating and Aligning Clinical Fidelity and Safety in Therapy Chatbots arxiv.org/abs/2603.18008 .CL .AI .CY

How Confident Is the First Token? An Uncertainty-Calibrated Prompt Optimization Framework for Large Language Model Classification and Understanding arxiv.org/abs/2603.18009 .CL .AI

Controllable Evidence Selection in Retrieval-Augmented Question Answering via Deterministic Utility Gating arxiv.org/abs/2603.18011 .CL .IR

DynaRAG: Bridging Static and Dynamic Knowledge in Retrieval-Augmented Generation arxiv.org/abs/2603.18012 .CL .AI .IR

Learned but Not Expressed: Capability-Expression Dissociation in Large Language Models arxiv.org/abs/2603.18013 .CL

Real-Time Trustworthiness Scoring for LLM Structured Outputs and Data Extraction arxiv.org/abs/2603.18014 .CL .LG

Beyond Accuracy: An Explainability-Driven Analysis of Harmful Content Detection arxiv.org/abs/2603.18015 .CL .AI

Trust, Safety, and Accuracy: Assessing LLMs for Routine Maternity Advice arxiv.org/abs/2603.16872 .CL .CY

The Truth, the Whole Truth, and Nothing but the Truth: Automatic Visualization Evaluation from Reconstruction Quality arxiv.org/abs/2603.16873 .HC .CV

Disclosure By Design: Identity Transparency as a Behavioural Property of Conversational AI Models arxiv.org/abs/2603.16874 .HC .AI

Attention Guidance through Video Script: A Case Study of Object Focusing on 360{\deg} VR Video Tours arxiv.org/abs/2603.16875 .HC .AI

Multi-Modal Multi-Agent Reinforcement Learning for Radiology Report Generation: Radiologist-Like Workflow with Clinically Verifiable Rewards arxiv.org/abs/2603.16876 .CV .AI .LG

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.