Show newer

Improving Perceptual Quality by Phone-Fortified Perceptual Loss for Speech Enhancement. (arXiv:2010.15174v1 [cs.SD]) arxiv.org/abs/2010.15174

Ground Roll Suppression using Convolutional Neural Networks. (arXiv:2010.15209v1 [eess.IV]) arxiv.org/abs/2010.15209

Safety-Aware Cascade Controller Tuning Using Constrained Bayesian Optimization. (arXiv:2010.15211v1 [eess.SY]) arxiv.org/abs/2010.15211

Inference of ventricular activation properties from non-invasive electrocardiography. (arXiv:2010.15214v1 [q-bio.TO]) arxiv.org/abs/2010.15214

Accurate Prostate Cancer Detection and Segmentation on Biparametric MRI using Non-local Mask R-CNN with Histopathological Ground Truth. (arXiv:2010.15233v1 [eess.IV]) arxiv.org/abs/2010.15233

Cloud-Based Dynamic Programming for an Electric City Bus Energy Management Considering Real-Time Passenger Load Prediction. (arXiv:2010.15239v1 [eess.SY]) arxiv.org/abs/2010.15239

Semantic video segmentation for autonomous driving. (arXiv:2010.15250v1 [cs.CV]) arxiv.org/abs/2010.15250

DNSMOS: A Non-Intrusive Perceptual Objective Speech Quality metric to evaluate Noise Suppressors. (arXiv:2010.15258v1 [cs.SD]) arxiv.org/abs/2010.15258

Remixing Music with Visual Conditioning. (arXiv:2010.14565v1 [cs.SD]) arxiv.org/abs/2010.14565

Learning Time Reduction Using Warm Start Methods for a Reinforcement Learning Based Supervisory Control in Hybrid Electric Vehicle Applications. (arXiv:2010.14575v1 [cs.RO]) arxiv.org/abs/2010.14575

Nonlinear State-Space Generalizations of Graph Convolutional Neural Networks. (arXiv:2010.14585v1 [eess.SP]) arxiv.org/abs/2010.14585

CopyPaste: An Augmentation Method for Speech Emotion Recognition. (arXiv:2010.14602v1 [cs.SD]) arxiv.org/abs/2010.14602

Cascaded encoders for unifying streaming and non-streaming ASR. (arXiv:2010.14606v1 [eess.AS]) arxiv.org/abs/2010.14606

System Identification via Meta-Learning in Linear Time-Varying Environments. (arXiv:2010.14664v1 [cs.LG]) arxiv.org/abs/2010.14664

Melody-Conditioned Lyrics Generation with SeqGANs. (arXiv:2010.14709v1 [cs.SD]) arxiv.org/abs/2010.14709

CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech Recognition. (arXiv:2010.14725v1 [eess.AS]) arxiv.org/abs/2010.14725

Continuous Chaotic Nonlinear System and Lyapunov controller Optimization using Deep Learning. (arXiv:2010.14746v1 [eess.SY]) arxiv.org/abs/2010.14746

Linear Predictive Coding as a Valid Approximation of a Mass Spring Damper Model for Acute Stress Prediction from Computer Mouse Movement. (arXiv:2010.13836v1 [eess.SP]) arxiv.org/abs/2010.13836

Improved Supervised Training of Physics-Guided Deep Learning Image Reconstruction with Multi-Masking. (arXiv:2010.13868v1 [eess.IV]) arxiv.org/abs/2010.13868

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.