Learning by Applying: A General Framework for Mathematical Reasoning via Enhancing Explicit Knowledge Learning. (arXiv:2302.05717v1 [cs.AI])
Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLP. (arXiv:2302.05711v1 [cs.CL])
MTTM: Metamorphic Testing for Textual Content Moderation Software. (arXiv:2302.05706v1 [cs.CL])
HateProof: Are Hateful Meme Detection Systems really Robust?. (arXiv:2302.05703v1 [cs.CL])
Compositional Exemplars for In-context Learning. (arXiv:2302.05698v1 [cs.CL])
Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns. (arXiv:2302.05674v1 [cs.CL])
DocILE Benchmark for Document Information Localization and Extraction. (arXiv:2302.05658v1 [cs.CL])
Dialectograms: Machine Learning Differences between Discursive Communities. (arXiv:2302.05657v1 [cs.CL])
Evaluating the Robustness of Discrete Prompts. (arXiv:2302.05619v1 [cs.CL])
Metaphor Detection with Effective Context Denoising. (arXiv:2302.05611v1 [cs.CL])
Emotion Detection From Social Media Posts. (arXiv:2302.05610v1 [cs.LG])
MatKB: Semantic Search for Polycrystalline Materials Synthesis Procedures. (arXiv:2302.05597v1 [cs.CL])
ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems. (arXiv:2302.05582v1 [eess.AS])
Characterizing Attribution and Fluency Tradeoffs for Retrieval-Augmented Large Language Models. (arXiv:2302.05578v1 [cs.CL])
NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching Summarization. (arXiv:2302.05574v1 [cs.CL])
FairPy: A Toolkit for Evaluation of Social Biases and their Mitigation in Large Language Models. (arXiv:2302.05508v1 [cs.CL])
Long-Context Language Decision Transformers and Exponential Tilt for Interactive Text Environments. (arXiv:2302.05507v1 [cs.CL])
Distillation of encoder-decoder transformers for sequence labelling. (arXiv:2302.05454v1 [cs.CL])
Towards Inferential Reproducibility of Machine Learning Research. (arXiv:2302.04054v2 [cs.LG] UPDATED)
Reliable Natural Language Understanding with Large Language Models and Answer Set Programming. (arXiv:2302.03780v2 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl