Show newer

Single-channel speech enhancement using learnable loss mixup. (arXiv:2312.17255v1 [eess.AS]) arxiv.org/abs/2312.17255

$\mu$-Net: ConvNext-Based U-Nets for Cosmic Muon Tomography. (arXiv:2312.17265v1 [cs.CV]) arxiv.org/abs/2312.17265

Automatic laminectomy cutting plane planning based on artificial intelligence in robot assisted laminectomy surgery. (arXiv:2312.17266v1 [eess.IV]) arxiv.org/abs/2312.17266

Stateful FastConformer with Cache-based Inference for Streaming Automatic Speech Recognition. (arXiv:2312.17279v1 [cs.CL]) arxiv.org/abs/2312.17279

Revolutionizing Personalized Voice Synthesis: The Journey towards Emotional and Individual Authenticity with DIVSE (Dynamic Individual Voice Synthesis Engine). (arXiv:2312.17281v1 [cs.SD]) arxiv.org/abs/2312.17281

Nonlinear energy harvesting system with multiple stability. (arXiv:2312.17282v1 [eess.SY]) arxiv.org/abs/2312.17282

Dynamic Decision Making in Engineering System Design: A Deep Q-Learning Approach. (arXiv:2312.17284v1 [cs.LG]) arxiv.org/abs/2312.17284

Combining Convolution Neural Networks with Long-Short Time Memory Layers to Predict Parkinson's Disease Progression. (arXiv:2312.17290v1 [eess.IV]) arxiv.org/abs/2312.17290

$\mu$GUIDE: a framework for microstructure imaging via generalized uncertainty-driven inference using deep learning. (arXiv:2312.17293v1 [eess.IV]) arxiv.org/abs/2312.17293

AQUALLM: Audio Question Answering Data Generation Using Large Language Models. (arXiv:2312.17343v1 [cs.CL]) arxiv.org/abs/2312.17343

Deformable Audio Transformer for Audio Event Detection. (arXiv:2312.16228v1 [cs.SD]) arxiv.org/abs/2312.16228

Proximal Gradient Descent Unfolding Dense-spatial Spectral-attention Transformer for Compressive Spectral Imaging. (arXiv:2312.16237v1 [eess.SP]) arxiv.org/abs/2312.16237

Toward Accurate and Temporally Consistent Video Restoration from Raw Data. (arXiv:2312.16247v1 [cs.CV]) arxiv.org/abs/2312.16247

In-Lab Implementation of DSRC PHY Layer. (arXiv:2312.16255v1 [eess.SP]) arxiv.org/abs/2312.16255

Joint Planning of Active Distribution Network and EV Charging Stations Considering Vehicle-to-Grid Functionality and Reactive Power Support. (arXiv:2312.16258v1 [eess.SY]) arxiv.org/abs/2312.16258

Early and Accurate Detection of Tomato Leaf Diseases Using TomFormer. (arXiv:2312.16331v1 [eess.IV]) arxiv.org/abs/2312.16331

Frame Structure and Protocol Design for Sensing-Assisted NR-V2X Communications. (arXiv:2312.16381v1 [eess.SP]) arxiv.org/abs/2312.16381

Frame-level emotional state alignment method for speech emotion recognition. (arXiv:2312.16383v1 [cs.SD]) arxiv.org/abs/2312.16383

Maximum Likelihood CFO Estimation for High-Mobility OFDM Systems: A Chinese Remainder Theorem Based Method. (arXiv:2312.16386v1 [eess.SP]) arxiv.org/abs/2312.16386

Sharp inequality for $\ell_p$ quasi-norm and $\ell_q$-norm with $01$. (arXiv:2312.16394v1 [eess.SP]) arxiv.org/abs/2312.16394

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.