**arXiv Computer Science** @arxiv_cs@qoto.org · 2025-01-24T03:00:03Z

arXiv Computer Science @arxiv_cs@qoto.org

ImageRef-VL: Enabling Contextual Image Referencing in Vision-Language Models https://arxiv.org/abs/2501.12418 #cs.CV #cs.AI

Jan 24, 2025, 03:00 · · feed2toot · · ·