Show newer

Learning Audio-Visual Dynamics Using Scene Graphs for Audio Source Separation. (arXiv:2210.16472v1 [cs.SD]) arxiv.org/abs/2210.16472

Average Age of Information Penalty of Short-Packet Communications with Packet Management. (arXiv:2210.15672v1 [cs.IT]) arxiv.org/abs/2210.15672

FedAudio: A Federated Learning Benchmark for Audio Tasks. (arXiv:2210.15707v1 [cs.SD]) arxiv.org/abs/2210.15707

A mathematical framework for dynamical social interactions with dissimulation. (arXiv:2210.15712v1 [math.OC]) arxiv.org/abs/2210.15712

Channel State Information-Free Artificial Noise-Aided Location-Privacy Enhancement. (arXiv:2210.15713v1 [eess.SP]) arxiv.org/abs/2210.15713

Simulating realistic speech overlaps improves multi-talker ASR. (arXiv:2210.15715v1 [eess.AS]) arxiv.org/abs/2210.15715

The sample complexity of sparse multi-reference alignment and single-particle cryo-electron microscopy. (arXiv:2210.15727v1 [cs.IT]) arxiv.org/abs/2210.15727

Joint Uplink-Downlink Capacity and Coverage Optimization via Site-Specific Learning of Antenna Settings. (arXiv:2210.15732v1 [eess.SP]) arxiv.org/abs/2210.15732

Token-level Sequence Labeling for Spoken Language Understanding using Compositional End-to-End Models. (arXiv:2210.15734v1 [cs.CL]) arxiv.org/abs/2210.15734

One-Shot Acoustic Matching Of Audio Signals -- Learning to Hear Music In Any Room/ Concert Hall. (arXiv:2210.15750v1 [cs.SD]) arxiv.org/abs/2210.15750

Proceedings of the ACII Affective Vocal Bursts Workshop and Competition 2022 (A-VB): Understanding a critically understudied modality of emotional expression. (arXiv:2210.15754v1 [eess.AS]) arxiv.org/abs/2210.15754

Automated Diagnosis of Cardiovascular Diseases from Cardiac Magnetic Resonance Imaging Using Deep Learning Models: A Review. (arXiv:2210.14909v1 [eess.IV]) arxiv.org/abs/2210.14909

Quadratic approximation based heuristic for optimization-based coordination of automated vehicles in confined areas. (arXiv:2210.14911v1 [math.OC]) arxiv.org/abs/2210.14911

kube-volttron: Rearchitecting the VOLTTRON Building Energy Management System for Cloud Native Deployment. (arXiv:2210.14948v1 [cs.DC]) arxiv.org/abs/2210.14948

Optimising Different Feature Types for Inpainting-based Image Representations. (arXiv:2210.14949v1 [eess.IV]) arxiv.org/abs/2210.14949

SINCO: A Novel structural regularizer for image compression using implicit neural representations. (arXiv:2210.14974v1 [eess.IV]) arxiv.org/abs/2210.14974

Knowledge Transfer For On-Device Speech Emotion Recognition with Neural Structured Learning. (arXiv:2210.14977v1 [cs.SD]) arxiv.org/abs/2210.14977

Interstellar Object Accessibility and Mission Design. (arXiv:2210.14980v1 [astro-ph.EP]) arxiv.org/abs/2210.14980

On the exactness of a stability test for Lur'e systems with slope-restricted nonlinearities. (arXiv:2210.14992v1 [math.OC]) arxiv.org/abs/2210.14992

Privacy-preserving Automatic Speaker Diarization. (arXiv:2210.14995v1 [eess.AS]) arxiv.org/abs/2210.14995

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.