Show newer

Distilling Script Knowledge from Large Language Models for Constrained Language Planning. (arXiv:2305.05252v4 [cs.CL] UPDATED) 

MGR: Multi-generator based Rationalization. (arXiv:2305.04492v3 [cs.LG] UPDATED) 

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages. (arXiv:2305.04160v3 [cs.CL] UPDATED) 

Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models. (arXiv:2305.04091v2 [cs.CL] UPDATED) 

Sensitive Data Detection with High-Throughput Machine Learning Models in Electrical Health Records. (arXiv:2305.03169v2 [cs.CR] UPDATED) 

Are Human Explanations Always Helpful? Towards Objective Evaluation of Human Natural Language Explanations. (arXiv:2305.03117v2 [cs.CL] UPDATED) 

Task-Optimized Adapters for an End-to-End Task-Oriented Dialogue System. (arXiv:2305.02468v2 [cs.CL] UPDATED) 

Improving Contrastive Learning of Sentence Embeddings from AI Feedback. (arXiv:2305.01918v3 [cs.CL] UPDATED) 

SCOTT: Self-Consistent Chain-of-Thought Distillation. (arXiv:2305.01879v2 [cs.CL] UPDATED) 

Search-in-the-Chain: Towards Accurate, Credible and Traceable Large Language Models for Knowledge-intensive Tasks. (arXiv:2304.14732v4 [cs.CL] UPDATED) 

PMC-LLaMA: Further Finetuning LLaMA on Medical Papers. (arXiv:2304.14454v2 [cs.CL] UPDATED) 

LaMP: When Large Language Models Meet Personalization. (arXiv:2304.11406v2 [cs.CL] UPDATED) 

RRHF: Rank Responses to Align Language Models with Human Feedback without tears. (arXiv:2304.05302v2 [cs.CL] UPDATED) 

Why think step by step? Reasoning emerges from the locality of experience. (arXiv:2304.03843v2 [cs.AI] UPDATED) 

Inspecting and Editing Knowledge Representations in Language Models. (arXiv:2304.00740v2 [cs.CL] UPDATED) 

MEGA: Multilingual Evaluation of Generative AI. (arXiv:2303.12528v3 [cs.CL] UPDATED) 

Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization. (arXiv:2303.12314v3 [cs.CL] UPDATED) 

Reflexion: Language Agents with Verbal Reinforcement Learning. (arXiv:2303.11366v2 [cs.AI] UPDATED) 

UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers. (arXiv:2303.00807v2 [cs.IR] UPDATED) 

Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches. (arXiv:2302.08950v2 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.