FAIR Enough: How Can We Develop and Assess a FAIR-Compliant Dataset for Large Language Models' Training?. (arXiv:2401.11033v2 [cs.CL] UPDATED)
Multilingual acoustic word embeddings for zero-resource languages. (arXiv:2401.10543v2 [eess.AS] UPDATED)
ChatQA: Building GPT-4 Level Conversational QA Models. (arXiv:2401.10225v2 [cs.CL] UPDATED)
Spatial-Temporal Large Language Model for Traffic Prediction. (arXiv:2401.10134v2 [cs.LG] UPDATED)
Partial Diacritization: A Context-Contrastive Inference Approach. (arXiv:2401.08919v2 [cs.CL] UPDATED)
Supporting Student Decisions on Learning Recommendations: An LLM-Based Chatbot with Knowledge Graph Contextualization for Conversational Explainability and Mentoring. (arXiv:2401.08517v2 [cs.AI] UPDATED)
APLe: Token-Wise Adaptive for Multi-Modal Prompt Learning. (arXiv:2401.06827v2 [cs.CV] UPDATED)
An Analysis of User Behaviors for Objectively Evaluating Spoken Dialogue Systems. (arXiv:2401.04867v2 [cs.CL] UPDATED)
Blending Is All You Need: Cheaper, Better Alternative to Trillion-Parameters LLM. (arXiv:2401.02994v3 [cs.CL] UPDATED)
AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation. (arXiv:2312.13010v2 [cs.CL] UPDATED)
A Survey of Text Watermarking in the Era of Large Language Models. (arXiv:2312.07913v4 [cs.CL] UPDATED)
A ripple in time: a discontinuity in American history. (arXiv:2312.01185v3 [cs.CL] UPDATED)
Beyond Turing: A Comparative Analysis of Approaches for Detecting Machine-Generated Text. (arXiv:2311.12373v2 [cs.CL] UPDATED)
Outlier Dimensions Encode Task-Specific Knowledge. (arXiv:2310.17715v2 [cs.CL] UPDATED)
Formally Specifying the High-Level Behavior of LLM-Based Agents. (arXiv:2310.08535v2 [cs.AI] UPDATED)
EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling. (arXiv:2310.04691v5 [cs.CL] UPDATED)
Retrieval meets Long Context Large Language Models. (arXiv:2310.03025v2 [cs.CL] UPDATED)
GenAI Against Humanity: Nefarious Applications of Generative Artificial Intelligence and Large Language Models. (arXiv:2310.00737v3 [cs.CY] UPDATED)
AutomaTikZ: Text-Guided Synthesis of Scientific Vector Graphics with TikZ. (arXiv:2310.00367v2 [cs.CL] UPDATED)
A Multitask Training Approach to Enhance Whisper with Contextual Biasing and Open-Vocabulary Keyword Spotting. (arXiv:2309.09552v3 [cs.AI] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl