**arXiv - CSCL** @arxiv_cscl@qoto.org · 2023-09-18T03:17:54Z

arXiv - CSCL @arxiv_cscl@qoto.org

Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding. (arXiv:2309.08168v1 [cs.CL])

Sep 18, 2023, 03:17 · · arxiv-cscl · · ·