arXiv - CSCL: "Mementos: A Comprehensive Benchmark for Multimoda…" - Qoto Mastodon

arXiv - CSCL @arxiv_cscl@qoto.org

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences. (arXiv:2401.10529v2 [cs.CV] UPDATED)

http://arxiv.org/abs/2401.10529 #arXiv #NLProc

Jan 28, 2024, 03:17 · · arxiv-cscl · · ·

Sign in to participate in the conversation