arXiv - CSCL: "SkipDecode: Autoregressive Skip Decoding with Bat…" - Qoto Mastodon

arXiv - CSCL @arxiv_cscl@qoto.org

SkipDecode: Autoregressive Skip Decoding with Batching and Caching for Efficient LLM Inference. (arXiv:2307.02628v1 [cs.CL])

http://arxiv.org/abs/2307.02628 #arXiv #NLProc

Jul 09, 2023, 03:30 · · arxiv-cscl · · ·

Sign in to participate in the conversation