Show newer

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model. (arXiv:2306.03341v5 [cs.LG] UPDATED) 

Handling Realistic Label Noise in BERT Text Classification. (arXiv:2305.16337v2 [cs.CL] UPDATED) 

Language Model Tokenizers Introduce Unfairness Between Languages. (arXiv:2305.15425v2 [cs.CL] UPDATED) 

Testing the General Deductive Reasoning Capacity of Large Language Models Using OOD Examples. (arXiv:2305.15269v2 [cs.CL] UPDATED) 

Spoken Question Answering and Speech Continuation Using Spectrogram-Powered LLM. (arXiv:2305.15255v3 [cs.CL] UPDATED) 

A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis. (arXiv:2305.15054v2 [cs.CL] UPDATED) 

CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering. (arXiv:2305.14869v2 [cs.CL] UPDATED) 

Getting MoRE out of Mixture of Language Model Reasoning Experts. (arXiv:2305.14628v2 [cs.CL] UPDATED) 

Learning Semantic Role Labeling from Compatible Label Sequences. (arXiv:2305.14600v3 [cs.CL] UPDATED) 

Query Rewriting for Retrieval-Augmented Large Language Models. (arXiv:2305.14283v2 [cs.CL] UPDATED) 

Question Answering as Programming for Solving Time-Sensitive Questions. (arXiv:2305.14221v3 [cs.CL] UPDATED) 

Enhancing Black-Box Few-Shot Text Classification with Prompt-Based Data Augmentation. (arXiv:2305.13785v2 [cs.CL] UPDATED) 

PIEClass: Weakly-Supervised Text Classification with Prompting and Noise-Robust Iterative Ensemble Training. (arXiv:2305.13723v2 [cs.CL] UPDATED) 

Prompt-Based Monte-Carlo Tree Search for Goal-Oriented Dialogue Policy Planning. (arXiv:2305.13660v2 [cs.CL] UPDATED) 

ReSee: Responding through Seeing Fine-grained Visual Knowledge in Open-domain Dialogue. (arXiv:2305.13602v2 [cs.CL] UPDATED) 

BioDEX: Large-Scale Biomedical Adverse Drug Event Extraction for Real-World Pharmacovigilance. (arXiv:2305.13395v2 [cs.CL] UPDATED) 

Can LLMs facilitate interpretation of pre-trained language models?. (arXiv:2305.13386v2 [cs.CL] UPDATED) 

Towards Unsupervised Recognition of Token-level Semantic Differences in Related Documents. (arXiv:2305.13303v3 [cs.CL] UPDATED) 

SimCSE++: Improving Contrastive Learning for Sentence Embeddings from Two Perspectives. (arXiv:2305.13192v2 [cs.CL] UPDATED) 

SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables. (arXiv:2305.13186v2 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.