Show newer

Performance optimizations on deep noise suppression models. (arXiv:2110.04378v1 [eess.AS]) arxiv.org/abs/2110.04378

Individualized Hear-through For Acoustic Transparency Using PCA-Based Sound Pressure Estimation At The Eardrum. (arXiv:2110.04385v1 [eess.AS]) arxiv.org/abs/2110.04385

Aura: Privacy-preserving augmentation to improve test set diversity in noise suppression applications. (arXiv:2110.04391v1 [eess.AS]) arxiv.org/abs/2110.04391

Atomic Norm Based Localization and Orientation Estimation for Millimeter-Wave MIMO OFDM Systems. (arXiv:2110.04401v1 [eess.SP]) arxiv.org/abs/2110.04401

TitaNet: Neural Model for speaker representation with 1D Depth-wise separable convolutions and global context. (arXiv:2110.04410v1 [eess.AS]) arxiv.org/abs/2110.04410

Towards Lightweight Applications: Asymmetric Enroll-Verify Structure for Speaker Verification. (arXiv:2110.04438v1 [cs.SD]) arxiv.org/abs/2110.04438

Learning Higher-Order Dynamics in Video-Based Cardiac Measurement. (arXiv:2110.03690v1 [eess.IV]) arxiv.org/abs/2110.03690

Direct design of biquad filter cascades with deep learning by sampling random polynomials. (arXiv:2110.03691v1 [eess.SP]) arxiv.org/abs/2110.03691

Power efficient analog features for audio recognition. (arXiv:2110.03715v1 [eess.AS]) arxiv.org/abs/2110.03715

Robustness to Incorrect Priors and Controlled Filter Stability in Partially Observed Stochastic Control. (arXiv:2110.03720v1 [math.OC]) arxiv.org/abs/2110.03720

Sequence-To-Sequence Voice Conversion using F0 and Time Conditioning and Adversarial Learning. (arXiv:2110.03744v1 [cs.SD]) arxiv.org/abs/2110.03744

Discomfort Monitoring System for Residential Electrical Water Heater. (arXiv:2110.03751v1 [eess.SY]) arxiv.org/abs/2110.03751

Sonorant spectra and coarticulation distinguish speakers with different dialects. (arXiv:2110.03756v1 [cs.CL]) arxiv.org/abs/2110.03756

Label Propagation across Graphs: Node Classification using Graph Neural Tangent Kernels. (arXiv:2110.03763v1 [cs.LG]) arxiv.org/abs/2110.03763

Wake-Cough: cough spotting and cougher identification for personalised long-term cough monitoring. (arXiv:2110.03771v1 [cs.SD]) arxiv.org/abs/2110.03771

Federated Learning via Plurality Vote. (arXiv:2110.02998v1 [cs.LG]) arxiv.org/abs/2110.02998

Predictability and Fairness in Load Aggregation and Operations of Virtual Power Plants. (arXiv:2110.03001v1 [math.OC]) arxiv.org/abs/2110.03001

Multi-Scale Convolutional Neural Network for Automated AMD Classification using Retinal OCT Images. (arXiv:2110.03002v1 [eess.IV]) arxiv.org/abs/2110.03002

AECMOS: A speech quality assessment metric for echo impairment. (arXiv:2110.03010v1 [eess.AS]) arxiv.org/abs/2110.03010

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.