Show newer

LongQLoRA: Efficient and Effective Method to Extend Context Length of Large Language Models. (arXiv:2311.04879v2 [cs.CL] UPDATED) 

Massive Editing for Large Language Models via Meta Learning. (arXiv:2311.04661v2 [cs.CL] UPDATED) 

Aspects of human memory and Large Language Models. (arXiv:2311.03839v2 [cs.CL] UPDATED) 

Multilingual Mathematical Autoformalization. (arXiv:2311.03755v2 [cs.CL] UPDATED) 

Principles from Clinical Research for NLP Model Generalization. (arXiv:2311.03663v2 [cs.CL] UPDATED) 

Citance-Contextualized Summarization of Scientific Papers. (arXiv:2311.02408v2 [cs.CL] UPDATED) 

FaMeSumm: Investigating and Improving Faithfulness of Medical Summarization. (arXiv:2311.02271v2 [cs.CL] UPDATED) 

Ensemble of Task-Specific Language Models for Brain Encoding. (arXiv:2310.15720v2 [cs.CL] UPDATED) 

Is ChatGPT a game changer for geocoding -- a benchmark for geocoding address parsing techniques. (arXiv:2310.14360v2 [cs.CL] UPDATED) 

Bridging Information-Theoretic and Geometric Compression in Language Models. (arXiv:2310.13620v2 [cs.CL] UPDATED) 

Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models. (arXiv:2310.10378v4 [cs.CL] UPDATED) 

Chameleon: a heterogeneous and disaggregated accelerator system for retrieval-augmented language models. (arXiv:2310.09949v2 [cs.LG] UPDATED) 

Strahler Number of Natural Language Sentences in Comparison with Random Trees. (arXiv:2307.02697v3 [cs.CL] UPDATED) 

Surveying (Dis)Parities and Concerns of Compute Hungry NLP Research. (arXiv:2306.16900v2 [cs.CL] UPDATED) 

Quantizable Transformers: Removing Outliers by Helping Attention Heads Do Nothing. (arXiv:2306.12929v2 [cs.LG] UPDATED) 

Large Language Models are Fixated by Red Herrings: Exploring Creative Problem Solving and Einstellung Effect using the Only Connect Wall Dataset. (arXiv:2306.11167v4 [cs.CL] UPDATED) 

Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation. (arXiv:2305.14734v2 [cs.CL] UPDATED) 

Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought. (arXiv:2305.13903v3 [cs.CL] UPDATED) 

Leveraging Human Feedback to Scale Educational Datasets: Combining Crowdworkers and Comparative Judgement. (arXiv:2305.12894v2 [cs.CL] UPDATED) 

Sabi\'a: Portuguese Large Language Models. (arXiv:2304.07880v4 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.