Graph-Guided Reasoning for Multi-Hop Question Answering in Large Language Models. (arXiv:2311.09762v1 [cs.CL])
MAFALDA: A Benchmark and Comprehensive Study of Fallacy Detection and Classification. (arXiv:2311.09761v1 [cs.CL])
OrchestraLLM: Efficient Orchestration of Language Models for Dialogue State Tracking. (arXiv:2311.09758v1 [cs.CL])
FairytaleCQA: Integrating a Commonsense Knowledge Graph into Children's Storybook Narratives. (arXiv:2311.09756v1 [cs.CL])
How Does Calibration Data Affect the Post-training Pruning and Quantization of Large Language Models?. (arXiv:2311.09755v1 [cs.CL])
Translation Aligned Sentence Embeddings for Turkish Language. (arXiv:2311.09748v1 [cs.CL])
Capturing Perspectives of Crowdsourced Annotators in Subjective Learning Tasks. (arXiv:2311.09743v1 [cs.CL])
What Constitutes a Faithful Summary? Preserving Author Perspectives in News Summarization. (arXiv:2311.09741v1 [cs.CL])
CARE: Extracting Experimental Findings From Clinical Literature. (arXiv:2311.09736v1 [cs.CL])
Tracking the Newsworthiness of Public Documents. (arXiv:2311.09734v1 [cs.CL])
MOKA: Moral Knowledge Augmentation for Moral Event Extraction. (arXiv:2311.09733v1 [cs.CL])
Source Prompt: Coordinated Pre-training of Language Models on Diverse Corpora from Multiple Sources. (arXiv:2311.09732v1 [cs.CL])
Prudent Silence or Foolish Babble? Examining Large Language Models' Responses to the Unknown. (arXiv:2311.09731v1 [cs.CL])
Aligning with Whom? Large Language Models Have Gender and Racial Biases in Subjective NLP Tasks. (arXiv:2311.09730v1 [cs.CL])
Outcome-supervised Verifiers for Planning in Mathematical Reasoning. (arXiv:2311.09724v1 [cs.AI])
On Evaluating the Integration of Reasoning and Action in LLM Agents with Database Question Answering. (arXiv:2311.09721v1 [cs.CL])
You don't need a personality test to know these models are unreliable: Assessing the Reliability of Large Language Models on Psychometric Instruments. (arXiv:2311.09718v1 [cs.CL])
Regularized Conventions: Equilibrium Computation as a Model of Pragmatic Reasoning. (arXiv:2311.09712v1 [cs.CL])
Large Language Model Inference with Lexical Shortlisting. (arXiv:2311.09709v1 [cs.CL])
A Self-enhancement Multitask Framework for Unsupervised Aspect Category Detection. (arXiv:2311.09708v1 [cs.CL])
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl