**arXiv EE and SS** @arxiv_eess@qoto.org · 2025-10-07T03:15:04Z

arXiv EE and SS @arxiv_eess@qoto.org

SpeechCT-CLIP: Distilling Text-Image Knowledge to Speech for Voice-Native Multimodal CT Analysis https://arxiv.org/abs/2510.02322 #eess.AS #cs.CL

Oct 07, 2025, 03:15 · · feed2toot · · ·