Show newer

QUDEVAL: The Evaluation of Questions Under Discussion Discourse Parsing. (arXiv:2310.14520v2 [cs.CL] UPDATED) 

Language Agents for Detecting Implicit Stereotypes in Text-to-image Models at Scale. (arXiv:2310.11778v3 [cs.CY] UPDATED) 

EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling. (arXiv:2310.04691v3 [cs.CL] UPDATED) 

LLM and Infrastructure as a Code use case. (arXiv:2309.01456v2 [cs.CL] UPDATED) 

A Comprehensive Overview of Large Language Models. (arXiv:2307.06435v5 [cs.CL] UPDATED) 

VoxPoser: Composable 3D Value Maps for Robotic Manipulation with Language Models. (arXiv:2307.05973v2 [cs.RO] UPDATED) 

Text Alignment Is An Efficient Unified Model for Massive NLP Tasks. (arXiv:2307.02729v2 [cs.CL] UPDATED) 

EHRSHOT: An EHR Benchmark for Few-Shot Evaluation of Foundation Models. (arXiv:2307.02028v2 [cs.LG] UPDATED) 

Meta-Reasoning: Semantics-Symbol Deconstruction For Large Language Models. (arXiv:2306.17820v2 [cs.CL] UPDATED) 

Iterated Piecewise Affine (IPA) Approximation for Language Modeling. (arXiv:2306.12317v3 [cs.CL] UPDATED) 

AVIS: Autonomous Visual Information Seeking with Large Language Model Agent. (arXiv:2306.08129v3 [cs.CV] UPDATED) 

Diable: Efficient Dialogue State Tracking as Operations on Tables. (arXiv:2305.17020v3 [cs.CL] UPDATED) 

The Art of SOCRATIC QUESTIONING: Recursive Thinking with Large Language Models. (arXiv:2305.14999v2 [cs.CL] UPDATED) 

Towards Legally Enforceable Hate Speech Detection for Public Forums. (arXiv:2305.13677v2 [cs.CL] UPDATED) 

Open-world Semi-supervised Generalized Relation Discovery Aligned in a Real-world Setting. (arXiv:2305.13533v2 [cs.CL] UPDATED) 

SEAHORSE: A Multilingual, Multifaceted Dataset for Summarization Evaluation. (arXiv:2305.13194v2 [cs.CL] UPDATED) 

Discovering Universal Geometry in Embeddings with ICA. (arXiv:2305.13175v2 [cs.CL] UPDATED) 

Textually Pretrained Speech Language Models. (arXiv:2305.13009v2 [cs.CL] UPDATED) 

Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization. (arXiv:2305.01951v3 [cs.CL] UPDATED) 

How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model. (arXiv:2305.00586v5 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.