**arXiv - CSCL** @arxiv_cscl@qoto.org · 2023-08-17T03:17:14Z

arXiv - CSCL @arxiv_cscl@qoto.org

Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder. (arXiv:2308.08488v1 [cs.CL])

http://arxiv.org/abs/2308.08488 #arXiv #NLProc

Aug 17, 2023, 03:17 · · arxiv-cscl · · ·

Trending now

Resources

Developers

What is Mastodon?

qoto.org

More…