**arXiv Computer Science** @arxiv_cs@qoto.org · 2025-04-18T03:00:04Z

arXiv Computer Science @arxiv_cs@qoto.org

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models https://arxiv.org/abs/2504.11468 #cs.CL

Apr 18, 2025, 03:00 · · feed2toot · · ·