Task-Optimized Adapters for an End-to-End Task-Oriented Dialogue System. (arXiv:2305.02468v3 [cs.CL] UPDATED)
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling. (arXiv:2304.01373v2 [cs.CL] UPDATED)
Aligning a medium-size GPT model in English to a small closed domain in Spanish. (arXiv:2303.17649v3 [cs.CL] UPDATED)
cTBLS: Augmenting Large Language Models with Conversational Tables. (arXiv:2303.12024v3 [cs.CL] UPDATED)
Exploring Partial Knowledge Base Inference in Biomedical Entity Linking. (arXiv:2303.10330v2 [cs.CL] UPDATED)
Query-Utterance Attention with Joint modeling for Query-Focused Meeting Summarization. (arXiv:2303.04487v2 [cs.CL] UPDATED)
Attend-and-Excite: Attention-Based Semantic Guidance for Text-to-Image Diffusion Models. (arXiv:2301.13826v2 [cs.CV] UPDATED)
UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers. (arXiv:2301.13741v2 [cs.CV] UPDATED)
Understanding INT4 Quantization for Transformer Models: Latency Speedup, Composability, and Failure Cases. (arXiv:2301.12017v2 [cs.CL] UPDATED)
Matching Exemplar as Next Sentence Prediction (MeNSP): Zero-shot Prompt Learning for Automatic Scoring in Science Education. (arXiv:2301.08771v4 [cs.CL] UPDATED)
Continual Contrastive Finetuning Improves Low-Resource Relation Extraction. (arXiv:2212.10823v3 [cs.CL] UPDATED)
Generic Temporal Reasoning with Differential Analysis and Explanation. (arXiv:2212.10467v2 [cs.CL] UPDATED)
ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations. (arXiv:2212.10409v3 [cs.CL] UPDATED)
I Cast Detect Thoughts: Learning to Converse and Guide with Intents and Theory-of-Mind in Dungeons and Dragons. (arXiv:2212.10060v2 [cs.CL] UPDATED)
Synthetic Pre-Training Tasks for Neural Machine Translation. (arXiv:2212.09864v2 [cs.CL] UPDATED)
Cross-Lingual Retrieval Augmented Prompt for Low-Resource Languages. (arXiv:2212.09651v3 [cs.CL] UPDATED)
DuNST: Dual Noisy Self Training for Semi-Supervised Controllable Text Generation. (arXiv:2212.08724v2 [cs.CL] UPDATED)
Transformers learn in-context by gradient descent. (arXiv:2212.07677v2 [cs.LG] UPDATED)
MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets. (arXiv:2211.07321v3 [cs.CL] UPDATED)
Evaluating context-invariance in unsupervised speech representations. (arXiv:2210.15775v2 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl