HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs. (arXiv:2311.09774v1 [cs.CL])
To be or not to be? an exploration of continuously controllable prompt engineering. (arXiv:2311.09773v1 [cs.CL])
LLMs as Narcissistic Evaluators: When Ego Inflates Evaluation Scores. (arXiv:2311.09766v1 [cs.CL])
Test-time Backdoor Mitigation for Black-Box Large Language Models with Defensive Demonstrations. (arXiv:2311.09763v1 [cs.CL])
Graph-Guided Reasoning for Multi-Hop Question Answering in Large Language Models. (arXiv:2311.09762v1 [cs.CL])
MAFALDA: A Benchmark and Comprehensive Study of Fallacy Detection and Classification. (arXiv:2311.09761v1 [cs.CL])
OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking. (arXiv:2311.09758v1 [cs.CL])
FairytaleCQA: Integrating a Commonsense Knowledge Graph into Children's Storybook Narratives. (arXiv:2311.09756v1 [cs.CL])
How Does Calibration Data Affect the Post-training Pruning and Quantization of Large Language Models?. (arXiv:2311.09755v1 [cs.CL])
Translation Aligned Sentence Embeddings for Turkish Language. (arXiv:2311.09748v1 [cs.CL])
Capturing Perspectives of Crowdsourced Annotators in Subjective Learning Tasks. (arXiv:2311.09743v1 [cs.CL])
What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization. (arXiv:2311.09741v1 [cs.CL])
CARE: Extracting Experimental Findings From Clinical Literature. (arXiv:2311.09736v1 [cs.CL])
Tracking the Newsworthiness of Public Documents. (arXiv:2311.09734v1 [cs.CL])
MOKA: Moral Knowledge Augmentation for Moral Event Extraction. (arXiv:2311.09733v1 [cs.CL])
Source Prompt: Coordinated Pre-training of Language Models on Diverse Corpora from Multiple Sources. (arXiv:2311.09732v1 [cs.CL])
Prudent Silence or Foolish Babble? Examining Large Language Models' Responses to the Unknown. (arXiv:2311.09731v1 [cs.CL])
Aligning with Whom? Large Language Models Have Gender and Racial Biases in Subjective NLP Tasks. (arXiv:2311.09730v1 [cs.CL])
Outcome-supervised Verifiers for Planning in Mathematical Reasoning. (arXiv:2311.09724v1 [cs.AI])
On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering. (arXiv:2311.09721v1 [cs.CL])
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl