Show newer

Hard Prompts Made Easy: Gradient-Based Discrete Optimization for Prompt Tuning and Discovery. (arXiv:2302.03668v2 [cs.LG] UPDATED) 

Grounding Language Models to Images for Multimodal Inputs and Outputs. (arXiv:2301.13823v3 [cs.CL] UPDATED) 

Communication Drives the Emergence of Language Universals in Neural Agents: Evidence from the Word-order/Case-marking Trade-off. (arXiv:2301.13083v2 [cs.CL] UPDATED) 

A Survey on In-context Learning. (arXiv:2301.00234v3 [cs.CL] UPDATED) 

DISCO: Distilling Phrasal Counterfactuals with Large Language Models. (arXiv:2212.10534v2 [cs.CL] UPDATED) 

Towards Understanding Chain-of-Thought Prompting: An Empirical Study of What Matters. (arXiv:2212.10001v2 [cs.CL] UPDATED) 

Reasoning with Language Model Prompting: A Survey. (arXiv:2212.09597v5 [cs.CL] UPDATED) 

Teaching Small Language Models to Reason. (arXiv:2212.08410v3 [cs.CL] UPDATED) 

Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning. (arXiv:2212.00259v2 [cs.CV] UPDATED) 

Reward Gaming in Conditional Text Generation. (arXiv:2211.08714v3 [cs.CL] UPDATED) 

MT Metrics Correlate with Human Ratings of Simultaneous Speech Translation. (arXiv:2211.08633v2 [cs.CL] UPDATED) 

Speaking Multiple Languages Affects the Moral Bias of Language Models. (arXiv:2211.07733v2 [cs.CL] UPDATED) 

Emergent Linguistic Structures in Neural Networks are Fragile. (arXiv:2210.17406v8 [cs.LG] UPDATED) 

Arithmetic Sampling: Parallel Diverse Decoding for Large Language Models. (arXiv:2210.15458v2 [cs.CL] UPDATED) 

Automatic Creation of Named Entity Recognition Datasets by Querying Phrase Representations. (arXiv:2210.07586v4 [cs.CL] UPDATED) 

SQuId: Measuring Speech Naturalness in Many Languages. (arXiv:2210.06324v2 [cs.CL] UPDATED) 

Best Prompts for Text-to-Image Models and How to Find Them. (arXiv:2209.11711v3 [cs.HC] UPDATED) 

Do Large Language Models know what humans know?. (arXiv:2209.01515v3 [cs.CL] UPDATED) 

Claim-Dissector: An Interpretable Fact-Checking System with Joint Re-ranking and Veracity Prediction. (arXiv:2207.14116v3 [cs.CL] UPDATED) 

GENEVA: Benchmarking Generalizability for Event Argument Extraction with Hundreds of Event Types and Argument Roles. (arXiv:2205.12505v5 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.