Show newer

Exposing AI-Synthesized Human Voices Using Neural Vocoder Artifacts. (arXiv:2302.09198v1 [cs.SD]) arxiv.org/abs/2302.09198

Brainomaly: Unsupervised Neurologic Disease Detection Utilizing Unannotated T1-weighted Brain MR Images. (arXiv:2302.09200v1 [eess.IV]) arxiv.org/abs/2302.09200

Cost-effective Models for Detecting Depression from Speech. (arXiv:2302.09214v1 [cs.SD]) arxiv.org/abs/2302.09214

Domain Agnostic Pipeline for Retina Vessel Segmentation. (arXiv:2302.09215v1 [eess.IV]) arxiv.org/abs/2302.09215

A Review of Codebooks for CSI Feedback in 5G New Radio and Beyond. (arXiv:2302.09222v1 [cs.IT]) arxiv.org/abs/2302.09222

Beamforming and Phase Shift Design for HR-IRS-aided Directional Modulation Network with a Malicious Attacker. (arXiv:2302.09240v1 [cs.IT]) arxiv.org/abs/2302.09240

Visual deep learning-based explanation for neuritic plaques segmentation in Alzheimer's Disease using weakly annotated whole slide histopathological images. (arXiv:2302.08511v1 [eess.IV]) arxiv.org/abs/2302.08511

Computation and Privacy Protection for Satellite-Ground Digital Twin Networks. (arXiv:2302.08525v1 [eess.SP]) arxiv.org/abs/2302.08525

Numerical analysis of a multistable capsule system under the delayed feedback control with a constant delay. (arXiv:2302.08543v1 [eess.SY]) arxiv.org/abs/2302.08543

Speaker Change Detection for Transformer Transducer ASR. (arXiv:2302.08549v1 [eess.AS]) arxiv.org/abs/2302.08549

A New 22 nm ULPLS Architecture to Detect 70 mV Minimum Input, Suitable for IOT Applications. (arXiv:2302.08553v1 [eess.SP]) arxiv.org/abs/2302.08553

Topological Signal Processing over Weighted Simplicial Complexes. (arXiv:2302.08561v1 [eess.SP]) arxiv.org/abs/2302.08561

Adaptable End-to-End ASR Models using Replaceable Internal LMs and Residual Softmax. (arXiv:2302.08579v1 [eess.AS]) arxiv.org/abs/2302.08579

JEIT: Joint End-to-End Model and Internal Language Model Training for Speech Recognition. (arXiv:2302.08583v1 [eess.AS]) arxiv.org/abs/2302.08583

Propagation Measurements and Analyses at 28 GHz via an Autonomous Beam-Steering Platform. (arXiv:2302.08584v1 [eess.SP]) arxiv.org/abs/2302.08584

Frequency-domain Learning for Volumetric-based 3D Data Perception. (arXiv:2302.08595v1 [cs.CV]) arxiv.org/abs/2302.08595

Ultrafast single-channel machine vision based on neuro-inspired photonic computing. (arXiv:2302.07875v1 [physics.optics]) arxiv.org/abs/2302.07875

Multi-Channel Target Speaker Extraction with Refinement: The WavLab Submission to the Second Clarity Enhancement Challenge. (arXiv:2302.07928v1 [eess.AS]) arxiv.org/abs/2302.07928

Self-supervised Registration and Segmentation of the Ossicles with A Single Ground Truth Label. (arXiv:2302.07967v1 [eess.IV]) arxiv.org/abs/2302.07967

Filtered Iterative Denoising for Linear Inverse Problems. (arXiv:2302.07972v1 [eess.IV]) arxiv.org/abs/2302.07972

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.