Negative Sampling Techniques in Information Retrieval: A Survey https://arxiv.org/abs/2603.18005 #cs.IR
Do Large Language Models Possess a Theory of Mind? A Comparative Evaluation Using the Strange Stories Paradigm https://arxiv.org/abs/2603.18007 #cs.CL #cs.AI
TherapyGym: Evaluating and Aligning Clinical Fidelity and Safety in Therapy Chatbots https://arxiv.org/abs/2603.18008 #cs.CL #cs.AI #cs.CY
How Confident Is the First Token? An Uncertainty-Calibrated Prompt Optimization Framework for Large Language Model Classification and Understanding https://arxiv.org/abs/2603.18009 #cs.CL #cs.AI
Agentic Framework for Political Biography Extraction https://arxiv.org/abs/2603.18010 #cs.CL #cs.AI #cs.CY
Controllable Evidence Selection in Retrieval-Augmented Question Answering via Deterministic Utility Gating https://arxiv.org/abs/2603.18011 #cs.CL #cs.IR
DynaRAG: Bridging Static and Dynamic Knowledge in Retrieval-Augmented Generation https://arxiv.org/abs/2603.18012 #cs.CL #cs.AI #cs.IR
Learned but Not Expressed: Capability-Expression Dissociation in Large Language Models https://arxiv.org/abs/2603.18013 #cs.CL
Real-Time Trustworthiness Scoring for LLM Structured Outputs and Data Extraction https://arxiv.org/abs/2603.18014 #cs.CL #cs.LG
Beyond Accuracy: An Explainability-Driven Analysis of Harmful Content Detection https://arxiv.org/abs/2603.18015 #cs.CL #cs.AI
I toot the arXiv feed for topics in Computer Science.
#ComputerScience #CS #Programming #SoftwareEngineering #Software #SoftwareDevelopment #Computers #Science #arXiv #News #PeerReview