Show newer

Dual Causal/Non-Causal Self-Attention for Streaming End-to-End Speech Recognition. (arXiv:2107.01269v1 [eess.AS]) arxiv.org/abs/2107.01269

Unbiasing Procedures for Scale-invariant Multi-reference Alignment. (arXiv:2107.01274v1 [eess.SP]) arxiv.org/abs/2107.01274

Relaxed Attention: A Simple Method to Boost Performance of End-to-End Automatic Speech Recognition. (arXiv:2107.01275v1 [eess.AS]) arxiv.org/abs/2107.01275

A study of CNN capacity applied to Left Venticle Segmentation in Cardiac MRI. (arXiv:2107.01318v1 [eess.IV]) arxiv.org/abs/2107.01318

Physical Layer Security for NOMA-Enabled Multi-Access Edge Computing Wireless Networks. (arXiv:2107.01322v1 [cs.IT]) arxiv.org/abs/2107.01322

VinDr-RibCXR: A Benchmark Dataset for Automatic Segmentation and Labeling of Individual Ribs on Chest X-rays. (arXiv:2107.01327v1 [eess.IV]) arxiv.org/abs/2107.01327

The HCCL Speaker Verification System for Far-Field Speaker Verification Challenge. (arXiv:2107.01329v1 [cs.SD]) arxiv.org/abs/2107.01329

SPI-GAN: Towards Single-Pixel Imaging through Generative Adversarial Network. (arXiv:2107.01330v1 [cs.CV]) arxiv.org/abs/2107.01330

Inter-Beat Interval Estimation with Tiramisu Model: A Novel Approach with Reduced Error. (arXiv:2107.00693v1 [eess.SP]) arxiv.org/abs/2107.00693

Precise Feature Selection and Case Study of Intrusion Detection in an Industrial Control System (ICS) Environment. (arXiv:2107.00705v1 [eess.SP]) arxiv.org/abs/2107.00705

Normalizing Flow based Hidden Markov Models for Classification of Speech Phones with Explainability. (arXiv:2107.00730v1 [cs.LG]) arxiv.org/abs/2107.00730

EMG-Based Feature Extraction and Classification for Prosthetic Hand Control. (arXiv:2107.00733v1 [eess.SP]) arxiv.org/abs/2107.00733

Design Optimization of Monoblade Autorotating Pods To Exhibit an Unconventional Descent Technique Using Glauert's Modelling. (arXiv:2107.00738v1 [cs.RO]) arxiv.org/abs/2107.00738

Geometric Machine Learning for Channel Covariance Estimation in Vehicular Networks. (arXiv:2107.00759v1 [eess.SP]) arxiv.org/abs/2107.00759

Combining Frame-Synchronous and Label-Synchronous Systems for Speech Recognition. (arXiv:2107.00764v1 [eess.AS]) arxiv.org/abs/2107.00764

Autonomous Navigation for Quadrupedal Robots with Optimized Jumping through Constrained Obstacles. (arXiv:2107.00773v1 [cs.RO]) arxiv.org/abs/2107.00773

Reinforcement Learning for Feedback-Enabled Cyber Resilience. (arXiv:2107.00783v1 [cs.CR]) arxiv.org/abs/2107.00783

Precoder Design for Physical-Layer Security and Authentication in Massive MIMO UAV Communications. (arXiv:2107.00799v1 [eess.SP]) arxiv.org/abs/2107.00799

Computationally efficient spatial rendering of late reverberation in virtual acoustic environments. (arXiv:2107.00004v1 [eess.AS]) arxiv.org/abs/2107.00004

Which Echo Chamber? Regions of Attraction in Learning with Decision-Dependent Distributions. (arXiv:2107.00055v1 [cs.LG]) arxiv.org/abs/2107.00055

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.