Show newer

Learning by Applying: A General Framework for Mathematical Reasoning via Enhancing Explicit Knowledge Learning. (arXiv:2302.05717v1 [cs.AI]) 

Fair Enough: Standardizing Evaluation and Model Selection for Fairness Research in NLP. (arXiv:2302.05711v1 [cs.CL]) 

MTTM: Metamorphic Testing for Textual Content Moderation Software. (arXiv:2302.05706v1 [cs.CL]) 

HateProof: Are Hateful Meme Detection Systems really Robust?. (arXiv:2302.05703v1 [cs.CL]) 

Compositional Exemplars for In-context Learning. (arXiv:2302.05698v1 [cs.CL]) 

Counter-GAP: Counterfactual Bias Evaluation through Gendered Ambiguous Pronouns. (arXiv:2302.05674v1 [cs.CL]) 

DocILE Benchmark for Document Information Localization and Extraction. (arXiv:2302.05658v1 [cs.CL]) 

Dialectograms: Machine Learning Differences between Discursive Communities. (arXiv:2302.05657v1 [cs.CL]) 

Evaluating the Robustness of Discrete Prompts. (arXiv:2302.05619v1 [cs.CL]) 

Metaphor Detection with Effective Context Denoising. (arXiv:2302.05611v1 [cs.CL]) 

Emotion Detection From Social Media Posts. (arXiv:2302.05610v1 [cs.LG]) 

MatKB: Semantic Search for Polycrystalline Materials Synthesis Procedures. (arXiv:2302.05597v1 [cs.CL]) 

ASDF: A Differential Testing Framework for Automatic Speech Recognition Systems. (arXiv:2302.05582v1 [eess.AS]) 

Characterizing Attribution and Fluency Tradeoffs for Retrieval-Augmented Large Language Models. (arXiv:2302.05578v1 [cs.CL]) 

NapSS: Paragraph-level Medical Text Simplification via Narrative Prompting and Sentence-matching Summarization. (arXiv:2302.05574v1 [cs.CL]) 

FairPy: A Toolkit for Evaluation of Social Biases and their Mitigation in Large Language Models. (arXiv:2302.05508v1 [cs.CL]) 

Long-Context Language Decision Transformers and Exponential Tilt for Interactive Text Environments. (arXiv:2302.05507v1 [cs.CL]) 

Distillation of encoder-decoder transformers for sequence labelling. (arXiv:2302.05454v1 [cs.CL]) 

Towards Inferential Reproducibility of Machine Learning Research. (arXiv:2302.04054v2 [cs.LG] UPDATED) 

Reliable Natural Language Understanding with Large Language Models and Answer Set Programming. (arXiv:2302.03780v2 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.