LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language Models. (arXiv:2311.04879v2 [cs.CL] UPDATED)
Massive Editing for Large Language Models via Meta Learning. (arXiv:2311.04661v2 [cs.CL] UPDATED)
Aspects of human memory and Large Language Models. (arXiv:2311.03839v2 [cs.CL] UPDATED)
Multilingual Mathematical Autoformalization. (arXiv:2311.03755v2 [cs.CL] UPDATED)
Principles from Clinical Research for NLP Model Generalization. (arXiv:2311.03663v2 [cs.CL] UPDATED)
Citance-Contextualized Summarization of Scientific Papers. (arXiv:2311.02408v2 [cs.CL] UPDATED)
FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization. (arXiv:2311.02271v2 [cs.CL] UPDATED)
Ensemble of Task-Specific Language Models for Brain Encoding. (arXiv:2310.15720v2 [cs.CL] UPDATED)
Is ChatGPT a game changer for geocoding -- a benchmark for geocoding address parsing techniques. (arXiv:2310.14360v2 [cs.CL] UPDATED)
Bridging Information-Theoretic and Geometric Compression in Language Models. (arXiv:2310.13620v2 [cs.CL] UPDATED)
Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models. (arXiv:2310.10378v4 [cs.CL] UPDATED)
Chameleon: a heterogeneous and disaggregated accelerator system for retrieval-augmented language models. (arXiv:2310.09949v2 [cs.LG] UPDATED)
Strahler Number of Natural Language Sentences in Comparison with Random Trees. (arXiv:2307.02697v3 [cs.CL] UPDATED)
Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research. (arXiv:2306.16900v2 [cs.CL] UPDATED)
Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing. (arXiv:2306.12929v2 [cs.LG] UPDATED)
Large Language Models are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect using the Only Connect Wall Dataset. (arXiv:2306.11167v4 [cs.CL] UPDATED)
Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation. (arXiv:2305.14734v2 [cs.CL] UPDATED)
Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought. (arXiv:2305.13903v3 [cs.CL] UPDATED)
Leveraging Human Feedback to Scale Educational Datasets: Combining Crowdworkers and Comparative Judgement. (arXiv:2305.12894v2 [cs.CL] UPDATED)
Sabi\'a: Portuguese Large Language Models. (arXiv:2304.07880v4 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl