Show newer

DiariST: Streaming Speech Translation with Speaker Diarization. (arXiv:2309.08007v2 [eess.AS] UPDATED) 

SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models. (arXiv:2308.16692v2 [cs.CL] UPDATED) 

Large Language Models Vote: Prompting for Rare Disease Identification. (arXiv:2308.12890v3 [cs.CL] UPDATED) 

Human-centered NLP Fact-checking: Co-Designing with Fact-checkers using Matchmaking for AI. (arXiv:2308.07213v2 [cs.HC] UPDATED) 

OWQ: Lessons learned from activation outliers for weight quantization in large language models. (arXiv:2306.02272v3 [cs.CL] UPDATED) 

Learning to Generate Novel Scientific Directions with Contextualized Literature-based Discovery. (arXiv:2305.14259v4 [cs.CL] UPDATED) 

SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and Verification. (arXiv:2305.09781v3 [cs.CL] UPDATED) 

Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings. (arXiv:2305.02317v3 [cs.CL] UPDATED) 

Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement. (arXiv:2304.14391v4 [cs.RO] UPDATED) 

Learning to Generate Questions by Enhancing Text Generation with Sentence Selection. (arXiv:2212.12192v2 [cs.CL] UPDATED) 

AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents. (arXiv:2401.12963v1 [cs.RO]) 

Meta-Prompting: Enhancing Language Models with Task-Agnostic Scaffolding. (arXiv:2401.12954v1 [cs.CL]) 

Transformer-Based Models Are Not Yet Perfect At Learning to Emulate Structural Recursion. (arXiv:2401.12947v1 [cs.CL]) 

Multicultural Name Recognition For Previously Unseen Names. (arXiv:2401.12941v1 [cs.CL]) 

Red Teaming Visual Language Models. (arXiv:2401.12915v1 [cs.AI]) 

From Understanding to Utilization: A Survey on Explainability for Large Language Models. (arXiv:2401.12874v1 [cs.CL]) 

Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model. (arXiv:2401.12873v1 [cs.CL]) 

KAM-CoT: Knowledge Augmented Multimodal Chain-of-Thoughts Reasoning. (arXiv:2401.12863v1 [cs.CL]) 

Gradient Flow of Energy: A General and Efficient Approach for Entity Alignment Decoding. (arXiv:2401.12798v1 [cs.IR]) 

Benchmarking LLMs via Uncertainty Quantification. (arXiv:2401.12794v1 [cs.CL]) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.