Show newer

Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training. (arXiv:2303.00786v1 [cs.CL]) arxiv.org/abs/2303.00786

Improved Segmentation of Deep Sulci in Cortical Gray Matter Using a Deep Learning Framework Incorporating Laplace's Equation. (arXiv:2303.00795v1 [eess.IV]) arxiv.org/abs/2303.00795

Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition. (arXiv:2303.00802v1 [cs.CL]) arxiv.org/abs/2303.00802

Ego-noise reduction of a mobile robot using noise spatial covariance matrix learning and minimum variance distortionless response. (arXiv:2303.00829v1 [eess.AS]) arxiv.org/abs/2303.00829

DISPLACE Challenge: DIarization of SPeaker and LAnguage in Conversational Environments. (arXiv:2303.00830v1 [eess.AS]) arxiv.org/abs/2303.00830

Distributed Adaptive Norm Estimation for Blind System Identification in Wireless Sensor Networks. (arXiv:2303.00832v1 [eess.SP]) arxiv.org/abs/2303.00832

Distortion Minimization with Age of Information and Cost Constraints. (arXiv:2303.00850v1 [cs.IT]) arxiv.org/abs/2303.00850

SLAS: Speed and Lane Advisory System for Highway Navigation. (arXiv:2303.00861v1 [cs.RO]) arxiv.org/abs/2303.00861

State estimation for control: an approach for output-feedback stochastic MPC. (arXiv:2303.00873v1 [math.OC]) arxiv.org/abs/2303.00873

ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus. (arXiv:2303.00069v1 [cs.CL]) arxiv.org/abs/2303.00069

Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model. (arXiv:2303.00091v1 [eess.AS]) arxiv.org/abs/2303.00091

A study on the use of perceptual hashing to detect manipulation of embedded messages in images. (arXiv:2303.00092v1 [cs.CR]) arxiv.org/abs/2303.00092

PixCUE -- Joint Uncertainty Estimation and Image Reconstruction in MRI using Deep Pixel Classification. (arXiv:2303.00111v1 [eess.IV]) arxiv.org/abs/2303.00111

A Low-Complexity Solution to Sum Rate Maximization for IRS-assisted SWIPT-MIMO Broadcasting. (arXiv:2303.00131v1 [eess.SP]) arxiv.org/abs/2303.00131

Containing a spread through sequential learning: to exploit or to explore?. (arXiv:2303.00141v1 [cs.LG]) arxiv.org/abs/2303.00141

I Know Your Feelings Before You Do: Predicting Future Affective Reactions in Human-Computer Dialogue. (arXiv:2303.00146v1 [cs.HC]) arxiv.org/abs/2303.00146

Exponential Consensus of Multiple Agents over Dynamic Network Topology: Controllability, Connectivity, and Compactness. (arXiv:2303.00155v1 [eess.SY]) arxiv.org/abs/2303.00155

On Parametric Misspecified Bayesian Cram\'{e}r-Rao bound: An application to linear Gaussian systems. (arXiv:2303.00160v1 [math.ST]) arxiv.org/abs/2303.00160

DTW-SiameseNet: Dynamic Time Warped Siamese Network for Mispronunciation Detection and Correction. (arXiv:2303.00171v1 [cs.LG]) arxiv.org/abs/2303.00171

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.