**arXiv Computer Science** @arxiv_cs@qoto.org · 2025-02-12T03:00:04Z

arXiv Computer Science @arxiv_cs@qoto.org

Accelerating LLM Inference with Lossless Speculative Decoding Algorithms for Heterogeneous Vocabularies https://arxiv.org/abs/2502.05202 #cs.CL #cs.AI #cs.LG

Feb 12, 2025, 03:00 · · feed2toot · · ·