MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use. (arXiv:2310.03128v2 [cs.SE] UPDATED)
Hierarchical Evaluation Framework: Best Practices for Human Evaluation. (arXiv:2310.01917v2 [cs.CL] UPDATED)
Ring Attention with Blockwise Transformers for Near-Infinite Context. (arXiv:2310.01889v3 [cs.CL] UPDATED)
GenAI Against Humanity: Nefarious Applications of Generative Artificial Intelligence and Large Language Models. (arXiv:2310.00737v2 [cs.CY] UPDATED)
A Comprehensive Survey of Document-level Relation Extraction (2016-2023). (arXiv:2309.16396v3 [cs.CL] UPDATED)
DiLu: A Knowledge-Driven Approach to Autonomous Driving with Large Language Models. (arXiv:2309.16292v2 [cs.RO] UPDATED)
AceGPT, Localizing Large Language Models in Arabic. (arXiv:2309.12053v3 [cs.CL] UPDATED)
MBR and QE Finetuning: Training-time Distillation of the Best and Most Expensive Decoding Methods. (arXiv:2309.10966v5 [cs.CL] UPDATED)
MINT: Evaluating LLMs in Multi-turn Interaction with Tools and Language Feedback. (arXiv:2309.10691v2 [cs.CL] UPDATED)
DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning. (arXiv:2309.05173v2 [cs.CL] UPDATED)
PromptTTS 2: Describing and Generating Voices with Text Prompt. (arXiv:2309.02285v2 [eess.AS] UPDATED)
Where are We in Event-centric Emotion Analysis? Bridging Emotion Role Labeling and Appraisal-based Approaches. (arXiv:2309.02092v3 [cs.CL] UPDATED)
StoryBench: A Multifaceted Benchmark for Continuous Story Visualization. (arXiv:2308.11606v2 [cs.CV] UPDATED)
PlatoLM: Teaching LLMs via a Socratic Questioning User Simulator. (arXiv:2308.11534v4 [cs.CL] UPDATED)
Exploring zero-shot capability of large language models in inferences from medical oncology notes. (arXiv:2308.03853v2 [cs.CL] UPDATED)
CIDER: Context sensitive sentiment analysis for short-form text. (arXiv:2307.07864v2 [cs.CL] UPDATED)
Distilling Large Vision-Language Model with Out-of-Distribution Generalizability. (arXiv:2307.03135v3 [cs.CV] UPDATED)
Questioning the Survey Responses of Large Language Models. (arXiv:2306.07951v2 [cs.CL] UPDATED)
An Efficient Multilingual Language Model Compression through Vocabulary Trimming. (arXiv:2305.15020v2 [cs.CL] UPDATED)
Learning to Generate Novel Scientific Directions with Contextualized Literature-based Discovery. (arXiv:2305.14259v3 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl