MathVista: Evaluating Math Reasoning in Visual Contexts with GPT-4V, Bard, and Other Large Multimodal Models. (arXiv:2310.02255v2 [cs.CV] UPDATED)
Multilingual Natural Language Processing Model for Radiology Reports -- The Summary is all you need!. (arXiv:2310.00100v3 [cs.CL] UPDATED)
What are Public Concerns about ChatGPT? A Novel Self-Supervised Neural Topic Model Tells You. (arXiv:2309.01522v2 [cs.CL] UPDATED)
Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior. (arXiv:2309.00359v3 [cs.CL] UPDATED)
Lexical Diversity in Kinship Across Languages and Dialects. (arXiv:2308.13056v2 [cs.CL] UPDATED)
A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction. (arXiv:2307.16200v3 [cs.CL] UPDATED)
AlpaGasus: Training A Better Alpaca with Fewer Data. (arXiv:2307.08701v3 [cs.CL] UPDATED)
Large Language Models as General Pattern Machines. (arXiv:2307.04721v2 [cs.AI] UPDATED)
CARE-MI: Chinese Benchmark for Misinformation Evaluation in Maternity and Infant Care. (arXiv:2307.01458v4 [cs.CL] UPDATED)
DocumentNet: Bridging the Data Gap in Document Pre-Training. (arXiv:2306.08937v3 [cs.CL] UPDATED)
Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations. (arXiv:2306.04618v2 [cs.CL] UPDATED)
Scaling Data-Constrained Language Models. (arXiv:2305.16264v4 [cs.CL] UPDATED)
Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages. (arXiv:2305.15814v3 [cs.CL] UPDATED)
Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models. (arXiv:2305.15080v2 [cs.CL] UPDATED)
AutoPlan: Automatic Planning of Interactive Decision-Making Tasks With Large Language Models. (arXiv:2305.15064v3 [cs.CL] UPDATED)
Editing Common Sense in Transformers. (arXiv:2305.14956v3 [cs.CL] UPDATED)
Coverage-based Example Selection for In-Context Learning. (arXiv:2305.14907v2 [cs.CL] UPDATED)
Estimating Large Language Model Capabilities without Labeled Test Data. (arXiv:2305.14802v2 [cs.CL] UPDATED)
DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4. (arXiv:2305.14702v2 [cs.CL] UPDATED)
What Else Do I Need to Know? The Effect of Background Information on Users' Reliance on QA Systems. (arXiv:2305.14331v2 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl