Show newer

SatDM: Synthesizing Realistic Satellite Image with Semantic Layout Conditioning using Diffusion Models. (arXiv:2309.16812v1 [cs.CV]) arxiv.org/abs/2309.16812

Reflection Invariance Learning for Few-shot Semantic Segmentation. (arXiv:2309.15850v1 [cs.CV]) arxiv.org/abs/2309.15850

Identifying factors associated with fast visual field progression in patients with ocular hypertension based on unsupervised machine learning. (arXiv:2309.15867v1 [cs.LG]) arxiv.org/abs/2309.15867

Unsupervised Pre-Training for Vietnamese Automatic Speech Recognition in the HYKIST Project. (arXiv:2309.15869v1 [cs.CL]) arxiv.org/abs/2309.15869

X-ray dark-field via spectral propagation-based imaging. (arXiv:2309.15874v1 [physics.med-ph]) arxiv.org/abs/2309.15874

High Perceptual Quality Wireless Image Delivery with Denoising Diffusion Models. (arXiv:2309.15889v1 [eess.IV]) arxiv.org/abs/2309.15889

Quantum computer-enabled receivers for optical communication. (arXiv:2309.15914v1 [quant-ph]) arxiv.org/abs/2309.15914

Exploring Self-Supervised Contrastive Learning of Spatial Sound Event Representation. (arXiv:2309.15938v1 [eess.AS]) arxiv.org/abs/2309.15938

IEEE 802.11be Wi-Fi 7: Feature Summary and Performance Evaluation. (arXiv:2309.15951v1 [cs.NI]) arxiv.org/abs/2309.15951

Linear Progressive Coding for Semantic Communication using Deep Neural Networks. (arXiv:2309.15959v1 [eess.SP]) arxiv.org/abs/2309.15959

Neural Acoustic Context Field: Rendering Realistic Room Impulse Response With Neural Fields. (arXiv:2309.15977v1 [cs.SD]) arxiv.org/abs/2309.15977

A multi-modal approach for identifying schizophrenia using cross-modal attention. (arXiv:2309.15136v1 [eess.SP]) arxiv.org/abs/2309.15136

AsQM: Audio streaming Quality Metric based on Network Impairments and User Preferences. (arXiv:2309.15186v1 [eess.SP]) arxiv.org/abs/2309.15186

Reliable Majority Vote Computation with Complementary Sequences for UAV Waypoint Flight Control. (arXiv:2309.15193v1 [eess.SP]) arxiv.org/abs/2309.15193

Application of reciprocity for facilitation of wave field visualization and defect detection. (arXiv:2309.15198v1 [eess.SP]) arxiv.org/abs/2309.15198

Eve Said Yes: AirBone Authentication for Head-Wearable Smart Voice Assistant. (arXiv:2309.15203v1 [cs.CR]) arxiv.org/abs/2309.15203

Wave-shape Function Model Order Estimation by Trigonometric Regression. (arXiv:2309.15210v1 [eess.SP]) arxiv.org/abs/2309.15210

Fully Adaptive Time-Varying Wave-Shape Model: Applications in Biomedical Signal Processing. (arXiv:2309.15211v1 [eess.SP]) arxiv.org/abs/2309.15211

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition. (arXiv:2309.15223v1 [cs.CL]) arxiv.org/abs/2309.15223

Collaborative Watermarking for Adversarial Speech Synthesis. (arXiv:2309.15224v1 [eess.AS]) arxiv.org/abs/2309.15224

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.