Recurrent Memory Transformer. (arXiv:2207.06881v2 [cs.CL] UPDATED)
Know Where You're Going: Meta-Learning for Parameter-Efficient Fine-Tuning. (arXiv:2205.12453v2 [cs.CL] UPDATED)
Measuring Context-Word Biases in Lexical Semantic Datasets. (arXiv:2112.06733v4 [cs.CL] UPDATED)
Reinforcement Learning for Few-Shot Text Generation Adaptation. (arXiv:2111.11030v3 [cs.CL] UPDATED)
Confounds and Overestimations in Fake Review Detection: Experimentally Controlling for Product-Ownership and Data-Origin. (arXiv:2110.15130v2 [cs.CL] UPDATED)
Improving Named Entity Recognition by External Context Retrieving and Cooperative Learning. (arXiv:2105.03654v3 [cs.CL] UPDATED)
OFASys: A Multi-Modal Multi-Task Learning System for Building Generalist Models. (arXiv:2212.04408v1 [cs.CV])
BEVBert: Topo-Metric Map Pre-training for Language-guided Navigation. (arXiv:2212.04385v1 [cs.CV])
Robust Speech Recognition via Large-Scale Weak Supervision. (arXiv:2212.04356v1 [eess.AS])
Implicit causality in GPT-2: a case study. (arXiv:2212.04348v1 [cs.CL])
Lattice-Free Sequence Discriminative Training for Phoneme-Based Neural Transducers. (arXiv:2212.04325v1 [eess.AS])
Montague semantics and modifier consistency measurement in neural language models. (arXiv:2212.04310v1 [cs.CL])
Lie detection algorithms attract few users but vastly increase accusation rates. (arXiv:2212.04277v1 [econ.GN])
A Modality-level Explainable Framework for Misinformation Checking in Social Networks. (arXiv:2212.04272v1 [cs.LG])
ConsistTL: Modeling Consistency in Transfer Learning for Low-Resource Neural Machine Translation. (arXiv:2212.04262v1 [cs.CL])
Momentum Calibration for Text Generation. (arXiv:2212.04257v1 [cs.CL])
Harnessing the Power of Multi-Task Pretraining for Ground-Truth Level Natural Language Explanations. (arXiv:2212.04231v1 [cs.CV])
The Neural Correlates of Linguistic Structure Building: Comments on Kazanina & Tavano (2022). (arXiv:2212.04219v1 [cs.CL])
Scientific Paper Extractive Summarization Enhanced by Citation Graphs. (arXiv:2212.04214v1 [cs.CL])
DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding. (arXiv:2212.04205v1 [cs.CL])
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl