arXiv - CSCL: "MMICL: Empowering Vision-language Model with Mult…" - Qoto Mastodon

arXiv - CSCL @arxiv_cscl@qoto.org

MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning. (arXiv:2309.07915v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2309.07915 #arXiv #NLProc

Oct 03, 2023, 03:18 · · arxiv-cscl · · ·

Sign in to participate in the conversation