Inference-Time Intervention: Eliciting Truthful Answers from a Language Model. (arXiv:2306.03341v5 [cs.LG] UPDATED)
Handling Realistic Label Noise in BERT Text Classification. (arXiv:2305.16337v2 [cs.CL] UPDATED)
Language Model Tokenizers Introduce Unfairness Between Languages. (arXiv:2305.15425v2 [cs.CL] UPDATED)
Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples. (arXiv:2305.15269v2 [cs.CL] UPDATED)
Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM. (arXiv:2305.15255v3 [cs.CL] UPDATED)
A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis. (arXiv:2305.15054v2 [cs.CL] UPDATED)
CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering. (arXiv:2305.14869v2 [cs.CL] UPDATED)
Getting MoRE out of Mixture of Language Model Reasoning Experts. (arXiv:2305.14628v2 [cs.CL] UPDATED)
Learning Semantic Role Labeling from Compatible Label Sequences. (arXiv:2305.14600v3 [cs.CL] UPDATED)
Query Rewriting for Retrieval-Augmented Large Language Models. (arXiv:2305.14283v2 [cs.CL] UPDATED)
Question Answering as Programming for Solving Time-Sensitive Questions. (arXiv:2305.14221v3 [cs.CL] UPDATED)
Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation. (arXiv:2305.13785v2 [cs.CL] UPDATED)
PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training. (arXiv:2305.13723v2 [cs.CL] UPDATED)
Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning. (arXiv:2305.13660v2 [cs.CL] UPDATED)
ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue. (arXiv:2305.13602v2 [cs.CL] UPDATED)
BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance. (arXiv:2305.13395v2 [cs.CL] UPDATED)
Can LLMs facilitate interpretation of pre-trained language models?. (arXiv:2305.13386v2 [cs.CL] UPDATED)
Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents. (arXiv:2305.13303v3 [cs.CL] UPDATED)
SimCSE++: Improving Contrastive Learning for Sentence Embeddings from Two Perspectives. (arXiv:2305.13192v2 [cs.CL] UPDATED)
SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables. (arXiv:2305.13186v2 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl