Show newer

Improving Speech Recognition for African American English With Audio Classification. (arXiv:2309.09996v1 [eess.AS]) arxiv.org/abs/2309.09996

Rely-guarantee Reasoning about Concurrent Memory Management: Correctness, Safety and Security. (arXiv:2309.09997v1 [cs.SE]) arxiv.org/abs/2309.09997

Monitoring Urban Changes in Mariupol/Ukraine in 2022/23. (arXiv:2309.08607v1 [cs.CY]) arxiv.org/abs/2309.08607

Do the Frankenstein, or how to achieve better out-of-distribution performance with manifold mixing model soup. (arXiv:2309.08610v1 [cs.LG]) arxiv.org/abs/2309.08610

Maneuver Decision-Making Through Proximal Policy Optimization And Monte Carlo Tree Search. (arXiv:2309.08611v1 [cs.AI]) arxiv.org/abs/2309.08611

Explaining Vision and Language through Graphs of Events in Space and Time. (arXiv:2309.08612v1 [cs.AI]) arxiv.org/abs/2309.08612

Multimodal Recommender Systems in the Prediction of Disease Comorbidity. (arXiv:2309.08613v1 [cs.IR]) arxiv.org/abs/2309.08613

Analyzing Character and Consciousness in AI-Generated Social Content: A Case Study of Chirper, the AI Social Network. (arXiv:2309.08614v1 [cs.AI]) arxiv.org/abs/2309.08614

Energy Concerns with HPC Systems and Applications. (arXiv:2309.08615v1 [cs.CY]) arxiv.org/abs/2309.08615

Introduction of accelerated BOIN design and facilitation of its application. (arXiv:2309.08616v1 [q-bio.QM]) arxiv.org/abs/2309.08616

Drifter: Efficient Online Feature Monitoring for Improved Data Integrity in Large-Scale Recommendation Systems. (arXiv:2309.08617v1 [cs.IR]) arxiv.org/abs/2309.08617

An Automated and Efficient Aerodynamic Design and Analysis Framework Integrated to PANAIR. (arXiv:2309.07923v1 [cs.CE]) arxiv.org/abs/2309.07923

Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023. (arXiv:2309.07925v1 [eess.AS]) arxiv.org/abs/2309.07925

COMPASS: High-Efficiency Deep Image Compression with Arbitrary-scale Spatial Scalability. (arXiv:2309.07926v1 [eess.IV]) arxiv.org/abs/2309.07926

Kid-Whisper: Towards Bridging the Performance Gap in Automatic Speech Recognition for Children VS. Adults. (arXiv:2309.07927v1 [eess.AS]) arxiv.org/abs/2309.07927

Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer. (arXiv:2309.07929v1 [cs.CV]) arxiv.org/abs/2309.07929

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.