Enhancing Long-form Text Generation Efficacy with Task-adaptive Tokenization. (arXiv:2310.05317v4 [cs.CL] UPDATED)
Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems. (arXiv:2310.05280v4 [cs.CL] UPDATED)
DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models. (arXiv:2310.05074v3 [cs.CL] UPDATED)
Guideline Learning for In-context Information Extraction. (arXiv:2310.05066v2 [cs.CL] UPDATED)
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU. (arXiv:2310.04928v2 [cs.CL] UPDATED)
LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model. (arXiv:2310.04445v2 [cs.CL] UPDATED)
Evaluating Hallucinations in Chinese Large Language Models. (arXiv:2310.03368v2 [cs.CL] UPDATED)
Conversational Health Agents: A Personalized LLM-Powered Agent Framework. (arXiv:2310.02374v2 [cs.CL] UPDATED)
Improving Dialogue Management: Quality Datasets vs Models. (arXiv:2310.01339v2 [cs.CL] UPDATED)
GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond. (arXiv:2309.16583v3 [cs.CL] UPDATED)
Prompt Tuned Embedding Classification for Multi-Label Industry Sector Allocation. (arXiv:2309.12075v2 [cs.CL] UPDATED)
Towards Effective Disambiguation for Machine Translation with Large Language Models. (arXiv:2309.11668v2 [cs.CL] UPDATED)
Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts. (arXiv:2309.07430v2 [cs.CL] UPDATED)
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL. (arXiv:2309.06553v3 [cs.CL] UPDATED)
One Wide Feedforward is All You Need. (arXiv:2309.01826v2 [cs.CL] UPDATED)
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. (arXiv:2308.13137v2 [cs.LG] UPDATED)
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors. (arXiv:2308.10848v3 [cs.CL] UPDATED)
End-to-End Evaluation for Low-Latency Simultaneous Speech Translation. (arXiv:2308.03415v2 [cs.CL] UPDATED)
Baby's CoThought: Leveraging Large Language Models for Enhanced Reasoning in Compact Models. (arXiv:2308.01684v2 [cs.CL] UPDATED)
Trie-NLG: Trie Context Augmentation to Improve Personalized Query Auto-Completion for Short and Unseen Prefixes. (arXiv:2307.15455v2 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl