Show newer

Enhancing Long-form Text Generation Efficacy with Task-adaptive Tokenization. (arXiv:2310.05317v4 [cs.CL] UPDATED) 

Are Personalized Stochastic Parrots More Dangerous? Evaluating Persona Biases in Dialogue Systems. (arXiv:2310.05280v4 [cs.CL] UPDATED) 

DialCoT Meets PPO: Decomposing and Exploring Reasoning Paths in Smaller Language Models. (arXiv:2310.05074v3 [cs.CL] UPDATED) 

Guideline Learning for In-context Information Extraction. (arXiv:2310.05066v2 [cs.CL] UPDATED) 

Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU. (arXiv:2310.04928v2 [cs.CL] UPDATED) 

LoFT: Local Proxy Fine-tuning For Improving Transferability Of Adversarial Attacks Against Large Language Model. (arXiv:2310.04445v2 [cs.CL] UPDATED) 

Evaluating Hallucinations in Chinese Large Language Models. (arXiv:2310.03368v2 [cs.CL] UPDATED) 

Conversational Health Agents: A Personalized LLM-Powered Agent Framework. (arXiv:2310.02374v2 [cs.CL] UPDATED) 

Improving Dialogue Management: Quality Datasets vs Models. (arXiv:2310.01339v2 [cs.CL] UPDATED) 

GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond. (arXiv:2309.16583v3 [cs.CL] UPDATED) 

Prompt Tuned Embedding Classification for Multi-Label Industry Sector Allocation. (arXiv:2309.12075v2 [cs.CL] UPDATED) 

Towards Effective Disambiguation for Machine Translation with Large Language Models. (arXiv:2309.11668v2 [cs.CL] UPDATED) 

Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts. (arXiv:2309.07430v2 [cs.CL] UPDATED) 

Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL. (arXiv:2309.06553v3 [cs.CL] UPDATED) 

One Wide Feedforward is All You Need. (arXiv:2309.01826v2 [cs.CL] UPDATED) 

OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. (arXiv:2308.13137v2 [cs.LG] UPDATED) 

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors. (arXiv:2308.10848v3 [cs.CL] UPDATED) 

End-to-End Evaluation for Low-Latency Simultaneous Speech Translation. (arXiv:2308.03415v2 [cs.CL] UPDATED) 

Baby's CoThought: Leveraging Large Language Models for Enhanced Reasoning in Compact Models. (arXiv:2308.01684v2 [cs.CL] UPDATED) 

Trie-NLG: Trie Context Augmentation to Improve Personalized Query Auto-Completion for Short and Unseen Prefixes. (arXiv:2307.15455v2 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.