Show newer

In-Sync: Adaptation of Speech Aware Large Language Models for ASR with Word Level Timestamp Predictions arxiv.org/abs/2604.22817 .AS .CL .LG .SD

When Altruism Meets Autonomy: Managing Bottleneck Congestion with Strategic Autonomous Vehicles arxiv.org/abs/2604.21941 .SY .GT .SY

Conditional Diffusion Posterior Alignment for Sparse-View CT Reconstruction arxiv.org/abs/2604.21960 .IV .CV .LG

Virtualizing the Senses: Enabling High-Precision ISAC on Commercial Cellular Infrastructure arxiv.org/abs/2604.22054 .SP

Empirical Assessment of Time-Series Foundation Models For Power System Forecasting Applications arxiv.org/abs/2604.22077 .SY .SY

A Hybrid Reinforcement and Self-Supervised Learning Aided Benders Decomposition Algorithm arxiv.org/abs/2604.22107 .SY .SY

Characterizing pitch and roll torque coupling in insect-sized flapping-wing robots using a microfabricated gimbal arxiv.org/abs/2604.22121 .SY .RO .SY

FastICA with Learned Scores from the Empirical Characteristic Function arxiv.org/abs/2604.22125 .SP

Beyond Acoustic Sparsity and Linguistic Bias: A Prompt-Free Paradigm for Mispronunciation Detection and Diagnosis arxiv.org/abs/2604.22133 .AS .SD

Sampling-Based Safety Filter with Probabilistic Restrictiveness Guarantee arxiv.org/abs/2604.22149 .SY .SY

A Unified Framework for Ambiguity Function Shaping and PAPR Control in AFDM Systems arxiv.org/abs/2604.22198 .SP

Explainable Speech Emotion Recognition: Weighted Attribute Fairness to Model Demographic Contributions to Social Bias arxiv.org/abs/2604.19763 .AS .AI .CL

Enhancing ASR Performance in the Medical Domain for Dravidian Languages arxiv.org/abs/2604.19797 .AS .AI .CL

Utterance-Level Methods for Identifying Reliable ASR-Output for Child Speech arxiv.org/abs/2604.19801 .AS .AI .CL

Output Feedback Backup Control Barrier Functions: Safety Guarantees Under Input Bounds and State Estimation Error arxiv.org/abs/2604.19893 .SY .SY

New Insights into Channel vs Subspace Codes for Large-Scale Beamspace MIMO Channel Sensing arxiv.org/abs/2604.19904 .SP .IT .IT

Cross-Atlantic Research Agenda for Scalable Grid Architectures and Distributed Flexibility arxiv.org/abs/2604.19933 .SY .SY

Indic-CodecFake meets SATYAM: Towards Detecting Neural Audio Codec Synthesized Speech Deepfakes in Indic Languages arxiv.org/abs/2604.19949 .AS

Algebraic Diversity: Principles of a Group-Theoretic Approach to Signal Processing arxiv.org/abs/2604.19983 .SP .IT .IT

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.