Show newer

AV2Wav: Diffusion-Based Re-synthesis from Continuous Self-supervised Features for Audio-Visual Speech Enhancement. (arXiv:2309.08030v2 [eess.AS] UPDATED) 

Clinical Text Summarization: Adapting Large Language Models Can Outperform Human Experts. (arXiv:2309.07430v3 [cs.CL] UPDATED) 

Measuring vagueness and subjectivity in texts: from symbolic to neural VAGO. (arXiv:2309.06132v2 [cs.CL] UPDATED) 

nanoT5: A PyTorch Framework for Pre-training and Fine-tuning T5-style Models with Limited Resources. (arXiv:2309.02373v2 [cs.CL] UPDATED) 

LLM Self Defense: By Self Examination, LLMs Know They Are Being Tricked. (arXiv:2308.07308v3 [cs.CL] UPDATED) 

TARJAMAT: Evaluation of Bard and ChatGPT on Machine Translation of Ten Arabic Varieties. (arXiv:2308.03051v2 [cs.CL] UPDATED) 

MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities. (arXiv:2308.02490v3 [cs.AI] UPDATED) 

Baby Llama: knowledge distillation from an ensemble of teachers trained on a small dataset with no performance penalty. (arXiv:2308.02019v2 [cs.CL] UPDATED) 

WebArena: A Realistic Web Environment for Building Autonomous Agents. (arXiv:2307.13854v2 [cs.AI] UPDATED) 

RADAR: Robust AI-Text Detection via Adversarial Learning. (arXiv:2307.03838v2 [cs.CL] UPDATED) 

Don't Trust ChatGPT when Your Question is not in English: A Study of Multilingual Abilities and Types of LLMs. (arXiv:2305.16339v2 [cs.CL] UPDATED) 

Contrastive Learning of Sentence Embeddings from Scratch. (arXiv:2305.15077v2 [cs.CL] UPDATED) 

ToMChallenges: A Principle-Guided Dataset and Diverse Evaluation Tasks for Exploring Theory of Mind. (arXiv:2305.15068v2 [cs.CL] UPDATED) 

Dior-CVAE: Pre-trained Language Models and Diffusion Priors for Variational Dialog Generation. (arXiv:2305.15025v2 [cs.CL] UPDATED) 

The ACL OCL Corpus: Advancing Open Science in Computational Linguistics. (arXiv:2305.14996v2 [cs.CL] UPDATED) 

Dolphin: A Challenging and Diverse Benchmark for Arabic NLG. (arXiv:2305.14989v2 [cs.CL] UPDATED) 

Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback. (arXiv:2305.14975v2 [cs.CL] UPDATED) 

GRACE: Discriminator-Guided Chain-of-Thought Reasoning. (arXiv:2305.14934v2 [cs.CL] UPDATED) 

ByteSized32: A Corpus and Challenge Task for Generating Task-Specific World Models Expressed as Text Games. (arXiv:2305.14879v2 [cs.CL] UPDATED) 

Leveraging GPT-4 for Automatic Translation Post-Editing. (arXiv:2305.14878v2 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.