Show newer

Recurrent Memory Transformer. (arXiv:2207.06881v2 [cs.CL] UPDATED) 

Know Where You're Going: Meta-Learning for Parameter-Efficient Fine-Tuning. (arXiv:2205.12453v2 [cs.CL] UPDATED) 

Measuring Context-Word Biases in Lexical Semantic Datasets. (arXiv:2112.06733v4 [cs.CL] UPDATED) 

Reinforcement Learning for Few-Shot Text Generation Adaptation. (arXiv:2111.11030v3 [cs.CL] UPDATED) 

Confounds and Overestimations in Fake Review Detection: Experimentally Controlling for Product-Ownership and Data-Origin. (arXiv:2110.15130v2 [cs.CL] UPDATED) 

Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning. (arXiv:2105.03654v3 [cs.CL] UPDATED) 

OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models. (arXiv:2212.04408v1 [cs.CV]) 

BEVBert: Topo-Metric Map Pre-training for Language-guided Navigation. (arXiv:2212.04385v1 [cs.CV]) 

Robust Speech Recognition via Large-Scale Weak Supervision. (arXiv:2212.04356v1 [eess.AS]) 

Implicit causality in GPT-2: a case study. (arXiv:2212.04348v1 [cs.CL]) 

Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers. (arXiv:2212.04325v1 [eess.AS]) 

Montague semantics and modifier consistency measurement in neural language models. (arXiv:2212.04310v1 [cs.CL]) 

Lie detection algorithms attract few users but vastly increase accusation rates. (arXiv:2212.04277v1 [econ.GN]) 

A Modality-level Explainable Framework for Misinformation Checking in Social Networks. (arXiv:2212.04272v1 [cs.LG]) 

ConsistTL: Modeling Consistency in Transfer Learning for Low-Resource Neural Machine Translation. (arXiv:2212.04262v1 [cs.CL]) 

Momentum Calibration for Text Generation. (arXiv:2212.04257v1 [cs.CL]) 

Harnessing the Power of Multi-Task Pretraining for Ground-Truth Level Natural Language Explanations. (arXiv:2212.04231v1 [cs.CV]) 

The Neural Correlates of Linguistic Structure Building: Comments on Kazanina & Tavano (2022). (arXiv:2212.04219v1 [cs.CL]) 

Scientific Paper Extractive Summarization Enhanced by Citation Graphs. (arXiv:2212.04214v1 [cs.CL]) 

DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding. (arXiv:2212.04205v1 [cs.CL]) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.