Show newer

Low-rank on Graphs plus Temporally Smooth Sparse Decomposition for Anomaly Detection in Spatiotemporal Data. (arXiv:2010.12633v1 [cs.LG]) arxiv.org/abs/2010.12633

SpeakerNet: 1D Depth-wise Separable Convolutional Network for Text-Independent Speaker Recognition and Verification. (arXiv:2010.12653v1 [eess.AS]) arxiv.org/abs/2010.12653

On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer. (arXiv:2010.12673v1 [cs.CL]) arxiv.org/abs/2010.12673

Super-Resolution Reconstruction of Interval Energy Data. (arXiv:2010.12678v1 [eess.SP]) arxiv.org/abs/2010.12678

Loss-analysis via Attention-scale for Physiologic Time Series. (arXiv:2010.12690v1 [cs.LG]) arxiv.org/abs/2010.12690

Exploring Multi-Channel Features for Speaker Verification with Joint VAD and Speech Enhancement. (arXiv:2010.12692v1 [eess.AS]) arxiv.org/abs/2010.12692

Deep Convolutional Neural Networks Model-based Brain Tumor Detection in Brain MRI Images. (arXiv:2010.11978v1 [eess.IV]) arxiv.org/abs/2010.11978

Atlas Fusion -- Modern Framework for Autonomous Agent Sensor Data Fusion. (arXiv:2010.11991v1 [cs.RO]) arxiv.org/abs/2010.11991

Unsupervised deep learning for grading of age-related macular degeneration using retinal fundus images. (arXiv:2010.11993v1 [cs.CV]) arxiv.org/abs/2010.11993

CellCycleGAN: Spatiotemporal Microscopy Image Synthesis of Cell Populations using Statistical Shape Models and Conditional GANs. (arXiv:2010.12011v1 [eess.IV]) arxiv.org/abs/2010.12011

Listening to Sounds of Silence for Speech Denoising. (arXiv:2010.12013v1 [cs.SD]) arxiv.org/abs/2010.12013

Sequence-to-sequence Singing Voice Synthesis with Perceptual Entropy Loss. (arXiv:2010.12024v1 [eess.AS]) arxiv.org/abs/2010.12024

Combination of Deep Speaker Embeddings for Diarisation. (arXiv:2010.12025v1 [cs.SD]) arxiv.org/abs/2010.12025

Automating Abnormality Detection in Musculoskeletal Radiographs through Deep Learning. (arXiv:2010.12030v1 [eess.IV]) arxiv.org/abs/2010.12030

Deep Image Prior for Sparse-sampling Photoacoustic Microscopy. (arXiv:2010.12041v1 [eess.IV]) arxiv.org/abs/2010.12041

Explaining Neural Network Predictions for Functional Data Using Principal Component Analysis and Feature Importance. (arXiv:2010.12063v1 [cs.LG]) arxiv.org/abs/2010.12063

AttendAffectNet: Self-Attention based Networks for Predicting Affective Responses from Movies. (arXiv:2010.11188v1 [cs.SD]) arxiv.org/abs/2010.11188

DC Microgrid State Estimation and Sensor Placement Based on Compressive Sensing. (arXiv:2010.11218v1 [eess.SY]) arxiv.org/abs/2010.11218

Learning Speaker Embedding from Text-to-Speech. (arXiv:2010.11221v1 [eess.AS]) arxiv.org/abs/2010.11221

Dynamic Layer Customization for Noise Robust Speech Emotion Recognition in Heterogeneous Condition Training. (arXiv:2010.11226v1 [cs.SD]) arxiv.org/abs/2010.11226

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.