Show newer

Comparative Analysis of Personalized Voice Activity Detection Systems: Assessing Real-World Effectiveness arxiv.org/abs/2406.09443 .AS .HC .LG

GenDistiller: Distilling Pre-trained Language Models based on an Autoregressive Generative Model arxiv.org/abs/2406.09444 .AS .CL .SD

Validation of human benchmark models for Automated Driving System approval: How competent and careful are they really? arxiv.org/abs/2406.09493 .SY .SY

The Second DISPLACE Challenge : DIarization of SPeaker and LAnguage in Conversational Environments arxiv.org/abs/2406.09494 .AS .LG

Multi-Channel Multi-Speaker ASR Using Target Speaker's Solo Segment arxiv.org/abs/2406.09589 .AS

Hierarchical Control for Vehicle Repositioning in Autonomous Mobility on Demand Systems arxiv.org/abs/2406.09609 .SY .SY

Efficient Personalization of Amplification in Hearing Aids via Multi-band Bayesian Machine Learning arxiv.org/abs/2406.09634 .AS .SP

Machine learning-based Near-field Emitter Localization via Grouped Hybrid Analog and Digital Massive MIMO Receive Array arxiv.org/abs/2406.09695 .SP

DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition arxiv.org/abs/2406.08517 .AS .SD

A Plug-and-Play Untrained Neural Network for Full Waveform Inversion in Reconstructing Sound Speed Images of Ultrasound Computed Tomography arxiv.org/abs/2406.08523 .IV

Safety-Driven Battery Charging: A Fisher Information-guided Adaptive MPC with Real-time Parameter Identification arxiv.org/abs/2406.08626 .SY .SY

Unveiling Incomplete Modality Brain Tumor Segmentation: Leveraging Masked Predicted Auto-Encoder and Divergence Learning arxiv.org/abs/2406.08634 .IV .CV .LG

Toward Fully-End-to-End Listened Speech Decoding from EEG Signals arxiv.org/abs/2406.08644 .SP .AS .AI .SD

Data-driven Thermal Modeling for Electrically Excited Synchronous Motors -- A Supervised Machine Learning Approach arxiv.org/abs/2406.08708 .SY .SY

Real-time Digital RF Emulation -- I: The Direct Path Computational Model arxiv.org/abs/2406.08710 .SP

Real-time Digital RF Emulation -- II: A Near Memory Custom Accelerator arxiv.org/abs/2406.08714 .SP

Towards objective and interpretable speech disorder assessment: a comparative analysis of CNN and transformer-based models arxiv.org/abs/2406.07576 .AS .LG .SD

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.