Show newer

Computer Aided Detection and Classification of mammograms using Convolutional Neural Network arxiv.org/abs/2409.16290 .IV .CV

Efficient Training of Self-Supervised Speech Foundation Models on a Compute Budget arxiv.org/abs/2409.16295 .AS .CL .LG .SD

How Redundant Is the Transformer Stack in Speech Representation Models? arxiv.org/abs/2409.16302 .AS .CL .LG .SD

A Literature Review of Keyword Spotting Technologies for Urdu arxiv.org/abs/2409.16317 .AS .AI .CL .LG .SD

Towards Within-Class Variation in Alzheimer's Disease Detection from Spontaneous Speech arxiv.org/abs/2409.16322 -bio.NC .AS .AI .CL .LG .SD

Future-Proofing Medical Imaging with Privacy-Preserving Federated Learning and Uncertainty Quantification: A Review arxiv.org/abs/2409.16340 .IV .AI .CV

Transformer based time series prediction of the maximum power point for solar photovoltaic cells arxiv.org/abs/2409.16342 .SY .LG .SY

Active Perception with Initial-State Uncertainty: A Policy Gradient Method arxiv.org/abs/2409.16439 .SY .SY

A novel open-source ultrasound dataset with deep learning benchmarks for spinal cord injury localization and anatomical segmentation arxiv.org/abs/2409.16441 .IV .CV .LG

A Multi-Agent Multi-Environment Mixed Q-Learning for Partially Decentralized Wireless Network Optimization arxiv.org/abs/2409.16450 .SP .LG

Equivariance-based self-supervised learning for audio signal recovery from clipped measurements arxiv.org/abs/2409.15283 .AS .SP .IR .LG .SD

Joint LOS Identification and Data Association for 6G-Enabled Networked Device-Free Sensing arxiv.org/abs/2409.15309 .SP .IT .IT

WaveTransfer: A Flexible End-to-end Multi-instrument Timbre Transfer with Diffusion arxiv.org/abs/2409.15321 .AS .SD

A Lightweight GAN-Based Image Fusion Algorithm for Visible and Infrared Images arxiv.org/abs/2409.15332 .IV .CV

A Large Dataset of Spontaneous Speech with the Accent Spoken in S\~ao Paulo for Automatic Speech Recognition Evaluation arxiv.org/abs/2409.15350 .AS .CL

Contextualization of ASR with LLM using phonetic retrieval-based augmentation arxiv.org/abs/2409.15353 .AS .CL .LG .SD

TCG CREST System Description for the Second DISPLACE Challenge arxiv.org/abs/2409.15356 .AS .LG .SD

A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework arxiv.org/abs/2409.15357 .AS .CL .LG .SD

Explainable AI for Autism Diagnosis: Identifying Critical Brain Regions Using fMRI Data arxiv.org/abs/2409.15374 .IV .AI .CV .LG

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.