Think-in-Memory: Recalling and Post-thinking Enable LLMs with Long-Term Memory. (arXiv:2311.08719v1 [cs.CL])
Decomposing Uncertainty for Large Language Models through Input Clarification Ensembling. (arXiv:2311.08718v1 [cs.CL])
PLUG: Leveraging Pivot Language in Cross-Lingual Instruction Tuning. (arXiv:2311.08711v1 [cs.CL])
Evaluating Robustness of Dialogue Summarization Models in the Presence of Naturally Occurring Variations. (arXiv:2311.08705v1 [cs.CL])
Can Large Language Models Follow Concept Annotation Guidelines? A Case Study on Scientific and Financial Domains. (arXiv:2311.08704v1 [cs.CL])
Debate Helps Supervise Unreliable Experts. (arXiv:2311.08702v1 [cs.AI])
Attribute Diversity Determines the Systematicity Gap in VQA. (arXiv:2311.08695v1 [cs.LG])
Routing to the Expert: Efficient Reward-guided Ensemble of Large Language Models. (arXiv:2311.08692v1 [cs.CL])
An Eye on Clinical BERT: Investigating Language Model Generalization for Diabetic Eye Disease Phenotyping. (arXiv:2311.08687v1 [cs.CL])
Safer-Instruct: Aligning Language Models with Automated Preference Data. (arXiv:2311.08685v1 [cs.CL])
Understanding Calibration for Multilingual Question Answering Models. (arXiv:2311.08669v1 [cs.CL])
It Takes Two to Negotiate: Modeling Social Exchange in Online Multiplayer Games. (arXiv:2311.08666v1 [cs.CL])
Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets. (arXiv:2311.08662v1 [cs.CL])
Explore Spurious Correlations at the Concept Level in Language Models for Text Classification. (arXiv:2311.08648v1 [cs.CL])
Multistage Collaborative Knowledge Distillation from Large Language Models. (arXiv:2311.08640v1 [cs.CL])
Formal Proofs as Structured Explanations: Proposing Several Tasks on Explainable Natural Language Inference. (arXiv:2311.08637v1 [cs.CL])
DEED: Dynamic Early Exit on Decoder for Accelerating Encoder-Decoder Transformer Models. (arXiv:2311.08623v1 [cs.CV])
Multiple-Question Multiple-Answer Text-VQA. (arXiv:2311.08622v1 [cs.CV])
Toucan: Token-Aware Character Level Language Modeling. (arXiv:2311.08620v1 [cs.CL])
XplainLLM: A QA Explanation Dataset for Understanding LLM Decision-Making. (arXiv:2311.08614v1 [cs.CL])
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl