Show newer

MathVista: Evaluating Math Reasoning in Visual Contexts with GPT-4V, Bard, and Other Large Multimodal Models. (arXiv:2310.02255v2 [cs.CV] UPDATED) 

Multilingual Natural Language Processing Model for Radiology Reports -- The Summary is all you need!. (arXiv:2310.00100v3 [cs.CL] UPDATED) 

What are Public Concerns about ChatGPT? A Novel Self-Supervised Neural Topic Model Tells You. (arXiv:2309.01522v2 [cs.CL] UPDATED) 

Large Content And Behavior Models To Understand, Simulate, And Optimize Content And Behavior. (arXiv:2309.00359v3 [cs.CL] UPDATED) 

Lexical Diversity in Kinship Across Languages and Dialects. (arXiv:2308.13056v2 [cs.CL] UPDATED) 

A Knowledge-enhanced Two-stage Generative Framework for Medical Dialogue Information Extraction. (arXiv:2307.16200v3 [cs.CL] UPDATED) 

AlpaGasus: Training A Better Alpaca with Fewer Data. (arXiv:2307.08701v3 [cs.CL] UPDATED) 

Large Language Models as General Pattern Machines. (arXiv:2307.04721v2 [cs.AI] UPDATED) 

CARE-MI: Chinese Benchmark for Misinformation Evaluation in Maternity and Infant Care. (arXiv:2307.01458v4 [cs.CL] UPDATED) 

DocumentNet: Bridging the Data Gap in Document Pre-Training. (arXiv:2306.08937v3 [cs.CL] UPDATED) 

Revisiting Out-of-distribution Robustness in NLP: Benchmark, Analysis, and LLMs Evaluations. (arXiv:2306.04618v2 [cs.CL] UPDATED) 

Scaling Data-Constrained Language Models. (arXiv:2305.16264v4 [cs.CL] UPDATED) 

Bhasha-Abhijnaanam: Native-script and romanized Language Identification for 22 Indic languages. (arXiv:2305.15814v3 [cs.CL] UPDATED) 

Visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models. (arXiv:2305.15080v2 [cs.CL] UPDATED) 

AutoPlan: Automatic Planning of Interactive Decision-Making Tasks With Large Language Models. (arXiv:2305.15064v3 [cs.CL] UPDATED) 

Editing Common Sense in Transformers. (arXiv:2305.14956v3 [cs.CL] UPDATED) 

Coverage-based Example Selection for In-Context Learning. (arXiv:2305.14907v2 [cs.CL] UPDATED) 

Estimating Large Language Model Capabilities without Labeled Test Data. (arXiv:2305.14802v2 [cs.CL] UPDATED) 

DecipherPref: Analyzing Influential Factors in Human Preference Judgments via GPT-4. (arXiv:2305.14702v2 [cs.CL] UPDATED) 

What Else Do I Need to Know? The Effect of Background Information on Users' Reliance on QA Systems. (arXiv:2305.14331v2 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.