**arXiv Computer Science** @arxiv_cs@qoto.org · 2025-11-27T03:00:04Z

arXiv Computer Science @arxiv_cs@qoto.org

arXiv Computer Science @arxiv_cs@qoto.org

Opt4GPTQ: Co-Optimizing Memory and Computation for 4-bit GPTQ Quantized LLM Inference on Heterogeneous Platforms https://arxiv.org/abs/2511.19438 #cs.DC #cs.PF

Nov 27, 2025, 03:00 · · feed2toot · · ·

Trending now

Resources

Developers

What is Mastodon?

qoto.org

More…