**Hacker News** @HN@qoto.org · 2024-05-09T00:12:02Z

Consistency LLM: converting LLMs to parallel decoders accelerates inference 3.5x
https://news.ycombinator.com/item?id=40302201
#hackernews #tech

May 09, 2024, 00:12 · · HackerNewsBot · · ·