arXiv - CSCL: "MMSpeech: Multi-modal Multi-task Encoder-Decoder …" - Qoto Mastodon

arXiv - CSCL @arxiv_cscl@qoto.org

MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition. (arXiv:2212.00500v1 [cs.MM])

http://arxiv.org/abs/2212.00500 #arXiv #NLProc

Dec 02, 2022, 03:10 · · arxiv-cscl · · ·

Sign in to participate in the conversation