Show newer

Dataless Knowledge Fusion by Merging Weights of Language Models. (arXiv:2212.09849v1 [cs.CL]) 

(Psycho-)Linguistic Features Meet Transformer Models for Improved Explainable and Controllable Text Simplification. (arXiv:2212.09848v1 [cs.CL]) 

Exploring Hybrid and Ensemble Models for Multiclass Prediction of Mental Health Status on Social Media. (arXiv:2212.09839v1 [cs.CL]) 

What to Read in a Contract? Party-Specific Summarization of Important Obligations, Entitlements, and Prohibitions in Legal Documents. (arXiv:2212.09825v1 [cs.CL]) 

Memory-efficient NLLB-200: Language-specific Expert Pruning of a Massively Multilingual Machine Translation Model. (arXiv:2212.09811v1 [cs.CL]) 

Training Trajectories of Language Models Across Scales. (arXiv:2212.09803v1 [cs.CL]) 

Human-in-the-loop Abstractive Dialogue Summarization. (arXiv:2212.09750v1 [cs.CL]) 

Swing Distillation: A Privacy-Preserving Knowledge Distillation Framework. (arXiv:2212.08349v1 [cs.LG] CROSS LISTED) 

UNIREX: A Unified Learning Framework for Language Model Rationale Extraction. (arXiv:2112.08802v2 [cs.CL] CROSS LISTED) 

SalKG: Learning From Knowledge Graph Explanations for Commonsense Reasoning. (arXiv:2104.08793v5 [cs.CL] CROSS LISTED) 

Teaching Small Language Models to Reason. (arXiv:2212.08410v2 [cs.CL] UPDATED) 

Azimuth: Systematic Error Analysis for Text Classification. (arXiv:2212.08216v2 [cs.LG] UPDATED) 

Lisan: Yemeni, Iraqi, Libyan, and Sudanese Arabic Dialect Copora with Morphological Annotations. (arXiv:2212.06468v2 [cs.CL] UPDATED) 

Improving Generalization of Pre-trained Language Models via Stochastic Weight Averaging. (arXiv:2212.05956v2 [cs.CL] UPDATED) 

Feature-Level Debiased Natural Language Understanding. (arXiv:2212.05421v2 [cs.CL] UPDATED) 

Investigating Glyph Phonetic Information for Chinese Spell Checking: What Works and What's Next. (arXiv:2212.04068v2 [cs.CL] UPDATED) 

AIONER: All-in-one scheme-based biomedical named entity recognition using deep learning. (arXiv:2211.16944v2 [cs.CL] UPDATED) 

Refined Semantic Enhancement towards Frequency Diffusion for Video Captioning. (arXiv:2211.15076v2 [cs.CV] UPDATED) 

Context Variance Evaluation of Pretrained Language Models for Prompt-based Biomedical Knowledge Probing. (arXiv:2211.10265v2 [cs.CL] UPDATED) 

UniSumm: Unified Few-shot Summarization with Multi-Task Pre-Training and Prefix-Tuning. (arXiv:2211.09783v5 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.