Universal Self-Adaptive Prompting. (arXiv:2305.14926v2 [cs.CL] UPDATED)
Evaluating Evaluation Metrics: A Framework for Analyzing NLG Evaluation Metrics using Measurement Theory. (arXiv:2305.14889v2 [cs.CL] UPDATED)
Debiasing Made State-of-the-art: Revisiting the Simple Seed-based Weak Supervision for Text Classification. (arXiv:2305.14794v2 [cs.CL] UPDATED)
Bi-Drop: Enhancing Fine-tuning Generalization via Synchronous sub-net Estimation and Optimization. (arXiv:2305.14760v2 [cs.CL] UPDATED)
Don't Take This Out of Context! On the Need for Contextual Models and Evaluations for Stylistic Rewriting. (arXiv:2305.14755v2 [cs.CL] UPDATED)
ECHo: A Visio-Linguistic Dataset for Event Causality Inference via Human-Centric Reasoning. (arXiv:2305.14740v2 [cs.AI] UPDATED)
Centering the Margins: Outlier-Based Identification of Harmed Populations in Toxicity Detection. (arXiv:2305.14735v2 [cs.CL] UPDATED)
Gender Biases in Automatic Evaluation Metrics for Image Captioning. (arXiv:2305.14711v2 [cs.CL] UPDATED)
You Are What You Annotate: Towards Better Models through Annotator Representations. (arXiv:2305.14663v2 [cs.CL] UPDATED)
COMET-M: Reasoning about Multiple Events in Complex Sentences. (arXiv:2305.14617v2 [cs.CL] UPDATED)
Parameter-Efficient Language Model Tuning with Active Learning in Low-Resource Settings. (arXiv:2305.14576v2 [cs.CL] UPDATED)
Sources of Hallucination by Large Language Models on Inference Tasks. (arXiv:2305.14552v2 [cs.CL] UPDATED)
MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems. (arXiv:2305.14536v2 [cs.CL] UPDATED)
NAIL: Lexical Retrieval Indices with Efficient Non-Autoregressive Decoders. (arXiv:2305.14499v2 [cs.CL] UPDATED)
Sociocultural Norm Similarities and Differences via Situational Alignment and Explainable Textual Entailment. (arXiv:2305.14492v2 [cs.CL] UPDATED)
Dancing Between Success and Failure: Edit-level Simplification Evaluation using SALSA. (arXiv:2305.14458v2 [cs.CL] UPDATED)
Automatic Model Selection with Large Language Models for Reasoning. (arXiv:2305.14333v2 [cs.CL] UPDATED)
TalkUp: Paving the Way for Understanding Empowering Language. (arXiv:2305.14326v2 [cs.CL] UPDATED)
LLM-powered Data Augmentation for Enhanced Cross-lingual Performance. (arXiv:2305.14288v2 [cs.CL] UPDATED)
Query Rewriting for Retrieval-Augmented Large Language Models. (arXiv:2305.14283v3 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl