Show newer

Training Trajectories of Language Models Across Scales. (arXiv:2212.09803v3 [cs.CL] UPDATED) 

One Embedder, Any Task: Instruction-Finetuned Text Embeddings. (arXiv:2212.09741v3 [cs.CL] UPDATED) 

LENS: A Learnable Evaluation Metric for Text Simplification. (arXiv:2212.09739v2 [cs.CL] UPDATED) 

Multi-VALUE: A Framework for Cross-Dialectal English NLP. (arXiv:2212.08011v3 [cs.CL] UPDATED) 

Prompting Is Programming: A Query Language for Large Language Models. (arXiv:2212.06094v3 [cs.CL] UPDATED) 

GPT-3-driven pedagogical agents for training children's curious question-asking skills. (arXiv:2211.14228v6 [cs.CL] UPDATED) 

World Knowledge in Multiple Choice Reading Comprehension. (arXiv:2211.07040v2 [cs.CL] UPDATED) 

Using contradictions improves question answering systems. (arXiv:2211.05598v2 [cs.CL] UPDATED) 

LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers. (arXiv:2211.02809v2 [cs.CL] UPDATED) 

Saliency Map Verbalization: Comparing Feature Importance Representations from Model-free and Instruction-based Methods. (arXiv:2210.07222v2 [cs.CL] UPDATED) 

Physical computation and compositionality. (arXiv:2210.00392v3 [quant-ph] UPDATED) 

Extractive is not Faithful: An Investigation of Broad Unfaithfulness Problems in Extractive Summarization. (arXiv:2209.03549v2 [cs.CL] UPDATED) 

Diversity Enhanced Table-to-Text Generation via Type Control. (arXiv:2205.10938v2 [cs.CL] UPDATED) 

Multi-armed bandits for resource efficient, online optimization of language model pre-training: the use case of dynamic masking. (arXiv:2203.13151v2 [cs.CL] UPDATED) 

C2-CRS: Coarse-to-Fine Contrastive Learning for Conversational Recommender System. (arXiv:2201.02732v3 [cs.CL] UPDATED) 

Detecting Inspiring Content on Social Media. (arXiv:2109.02734v2 [cs.CL] UPDATED) 

Enhanced Chart Understanding in Vision and Language Task via Cross-modal Pre-training on Plot Table Pairs. (arXiv:2305.18641v1 [cs.CL]) 

Short Answer Grading Using One-shot Prompting and Text Similarity Scoring Model. (arXiv:2305.18638v1 [cs.CL]) 

W-procer: Weighted Prototypical Contrastive Learning for Medical Few-Shot Named Entity Recognition. (arXiv:2305.18624v1 [cs.CL]) 

Alfred: A System for Prompted Weak Supervision. (arXiv:2305.18623v1 [cs.LG]) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.