Show newer

Unsupervised Pre-Training for Vietnamese Automatic Speech Recognition in the HYKIST Project. (arXiv:2309.15869v1 [cs.CL]) arxiv.org/abs/2309.15869

X-ray dark-field via spectral propagation-based imaging. (arXiv:2309.15874v1 [physics.med-ph]) arxiv.org/abs/2309.15874

High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models. (arXiv:2309.15889v1 [eess.IV]) arxiv.org/abs/2309.15889

Quantum computer-enabled receivers for optical communication. (arXiv:2309.15914v1 [quant-ph]) arxiv.org/abs/2309.15914

Exploring Self-Supervised Contrastive Learning of Spatial Sound Event Representation. (arXiv:2309.15938v1 [eess.AS]) arxiv.org/abs/2309.15938

IEEE 802.11be Wi-Fi 7: Feature Summary and Performance Evaluation. (arXiv:2309.15951v1 [cs.NI]) arxiv.org/abs/2309.15951

Linear Progressive Coding for Semantic Communication using Deep Neural Networks. (arXiv:2309.15959v1 [eess.SP]) arxiv.org/abs/2309.15959

Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields. (arXiv:2309.15977v1 [cs.SD]) arxiv.org/abs/2309.15977

A multi-modal approach for identifying schizophrenia using cross-modal attention. (arXiv:2309.15136v1 [eess.SP]) arxiv.org/abs/2309.15136

AsQM: Audio streaming Quality Metric based on Network Impairments and User Preferences. (arXiv:2309.15186v1 [eess.SP]) arxiv.org/abs/2309.15186

Reliable Majority Vote Computation with Complementary Sequences for UAV Waypoint Flight Control. (arXiv:2309.15193v1 [eess.SP]) arxiv.org/abs/2309.15193

Application of reciprocity for facilitation of wave field visualization and defect detection. (arXiv:2309.15198v1 [eess.SP]) arxiv.org/abs/2309.15198

Eve Said Yes: AirBone Authentication for Head-Wearable Smart Voice Assistant. (arXiv:2309.15203v1 [cs.CR]) arxiv.org/abs/2309.15203

Wave-shape Function Model Order Estimation by Trigonometric Regression. (arXiv:2309.15210v1 [eess.SP]) arxiv.org/abs/2309.15210

Fully Adaptive Time-Varying Wave-Shape Model: Applications in Biomedical Signal Processing. (arXiv:2309.15211v1 [eess.SP]) arxiv.org/abs/2309.15211

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition. (arXiv:2309.15223v1 [cs.CL]) arxiv.org/abs/2309.15223

Collaborative Watermarking for Adversarial Speech Synthesis. (arXiv:2309.15224v1 [eess.AS]) arxiv.org/abs/2309.15224

Critical Infrastructure Security Goes to Space: Leveraging Lessons Learned on the Ground. (arXiv:2309.15232v1 [cs.CR]) arxiv.org/abs/2309.15232

Integration of Polyimide Flexible PCB Wings in Northeastern Aerobat. (arXiv:2309.14346v1 [cs.RO]) arxiv.org/abs/2309.14346

Continuous-time control synthesis under nested signal temporal logic specifications. (arXiv:2309.14347v1 [eess.SY]) arxiv.org/abs/2309.14347

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.