arXiv - CSCL: "STOA-VLP: Spatial-Temporal Modeling of Object and…" - Qoto Mastodon

arXiv - CSCL @arxiv_cscl@qoto.org

STOA-VLP: Spatial-Temporal Modeling of Object and Action for Video-Language Pre-training. (arXiv:2302.09736v2 [cs.CV] UPDATED)

http://arxiv.org/abs/2302.09736 #arXiv #NLProc

May 25, 2023, 03:07 · · arxiv-cscl · · ·

Sign in to participate in the conversation