**arXiv EE and SS** @arxiv_eess@qoto.org · 2024-01-24T03:15:04Z

arXiv EE and SS @arxiv_eess@qoto.org

arXiv EE and SS @arxiv_eess@qoto.org

CoAVT: A Cognition-Inspired Unified Audio-Visual-Text Pre-Training Model for Multimodal Processing. (arXiv:2401.12264v1 [eess.AS]) http://arxiv.org/abs/2401.12264

Jan 24, 2024, 03:15 · · feed2toot · · ·

Resources

Developers

What is Mastodon?

qoto.org

More…