arXiv - CSCL: "Re-ViLM: Retrieval-Augmented Visual Language Mode…" - Qoto Mastodon

arXiv - CSCL @arxiv_cscl@qoto.org

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning. (arXiv:2302.04858v2 [cs.CV] UPDATED)

http://arxiv.org/abs/2302.04858 #arXiv #NLProc

Oct 24, 2023, 03:19 · · arxiv-cscl · · ·

Sign in to participate in the conversation