Sparse Upcycling: Training Mixture-of-Experts from Dense Checkpoints. (arXiv:2212.05055v2 [cs.LG] UPDATED)
Towards Building Text-To-Speech Systems for the Next Billion Users. (arXiv:2211.09536v3 [cs.CL] UPDATED)
Modular Hybrid Autoregressive Transducer. (arXiv:2210.17049v2 [cs.CL] UPDATED)
Alibaba-Translate China's Submission for WMT 2022 Quality Estimation Shared Task. (arXiv:2210.10049v2 [cs.CL] UPDATED)
Alibaba-Translate China's Submission for WMT 2022 Metrics Shared Task. (arXiv:2210.09683v2 [cs.CL] UPDATED)
A Kernel-Based View of Language Model Fine-Tuning. (arXiv:2210.05643v2 [cs.LG] UPDATED)
FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation. (arXiv:2210.00193v2 [cs.CL] UPDATED)
Write and Paint: Generative Vision-Language Models are Unified Modal Learners. (arXiv:2206.07699v3 [cs.CV] UPDATED)
Descartes: Generating Short Descriptions of Wikipedia Articles. (arXiv:2205.10012v3 [cs.CL] UPDATED)
Unsupervised Keyphrase Extraction via Interpretable Neural Networks. (arXiv:2203.07640v2 [cs.CL] UPDATED)
Aligning AI With Shared Human Values. (arXiv:2008.02275v6 [cs.CY] UPDATED)
Complex QA and language models hybrid architectures, Survey. (arXiv:2302.09051v1 [cs.CL])
CK-Transformer: Commonsense Knowledge Enhanced Transformers for Referring Expression Comprehension. (arXiv:2302.09027v1 [cs.CV])
Designing and Evaluating Interfaces that Highlight News Coverage Diversity Using Discord Questions. (arXiv:2302.08997v1 [cs.HC])
Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors. (arXiv:2302.08975v1 [cs.CL])
Grimm in Wonderland: Prompt Engineering with Midjourney to Illustrate Fairytales. (arXiv:2302.08961v1 [cs.CL])
Like a Good Nearest Neighbor: Practical Content Moderation with Sentence Transformers. (arXiv:2302.08957v1 [cs.CL])
AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages. (arXiv:2302.08956v1 [cs.CL])
Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches. (arXiv:2302.08950v1 [cs.CL])
Entry Separation using a Mixed Visual and Textual Language Model: Application to 19th century French Trade Directories. (arXiv:2302.08948v1 [cs.CL])
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl