Show newer

Proposal-based Few-shot Sound Event Detection for Speech and Environmental Sounds with Perceivers. (arXiv:2107.13616v1 [eess.AS]) arxiv.org/abs/2107.13616

Pitch-Informed Instrument Assignment Using a Deep Convolutional Network with Multiple Kernel Shapes. (arXiv:2107.13617v1 [cs.SD]) arxiv.org/abs/2107.13617

Efficient Episodic Learning of Nonstationary and Unknown Zero-Sum Games Using Expert Game Ensembles. (arXiv:2107.13632v1 [cs.GT]) arxiv.org/abs/2107.13632

Neural Remixer: Learning to Remix Music with Interactive Control. (arXiv:2107.13634v1 [eess.AS]) arxiv.org/abs/2107.13634

Lighter Stacked Hourglass Human Pose Estimation. (arXiv:2107.13643v1 [cs.CV]) arxiv.org/abs/2107.13643

A Similarity Measure of Histopathology Images by Deep Embeddings. (arXiv:2107.13703v1 [eess.IV]) arxiv.org/abs/2107.13703

Three-dimensional instantaneous orbit map for rotor-bearing system based on a novel multivariable complex variational mode decomposition algorithm. (arXiv:2107.13740v1 [eess.SP]) arxiv.org/abs/2107.13740

A Novel Passivity-Based Trajectory Tracking Control For Conservative Mechanical Systems. (arXiv:2107.13761v1 [eess.SY]) arxiv.org/abs/2107.13761

Whole Slide Images are 2D Point Clouds: Context-Aware Survival Prediction using Patch-based Graph Convolutional Networks. (arXiv:2107.13048v1 [eess.IV]) arxiv.org/abs/2107.13048

A Highly Linear and Flexible FPGA-Based Time-to-Digital Converter. (arXiv:2107.13053v1 [eess.SY]) arxiv.org/abs/2107.13053

A strawberry harvest-aiding system with crop-transport co-robots: Design, development, and field evaluation. (arXiv:2107.13063v1 [cs.RO]) arxiv.org/abs/2107.13063

A-star path planning simulation for UAS Traffic Management (UTM) application. (arXiv:2107.13103v1 [cs.RO]) arxiv.org/abs/2107.13103

Combining physics-based modeling and deep learning for ultrasound elastography. (arXiv:2107.13120v1 [eess.IV]) arxiv.org/abs/2107.13120

Learning Site-Specific Probing Beams for Fast mmWave Beam Alignment. (arXiv:2107.13121v1 [eess.SP]) arxiv.org/abs/2107.13121

Subjective evaluation of traditional and learning-based image coding methods. (arXiv:2107.13122v1 [cs.CV]) arxiv.org/abs/2107.13122

Insights from Generative Modeling for Neural Video Compression. (arXiv:2107.13136v1 [eess.IV]) arxiv.org/abs/2107.13136

CycleGAN-based Non-parallel Speech Enhancement with an Adaptive Attention-in-attention Mechanism. (arXiv:2107.13143v1 [cs.SD]) arxiv.org/abs/2107.13143

Retinal Microvasculature as Biomarker for Diabetes and Cardiovascular Diseases. (arXiv:2107.13157v1 [eess.IV]) arxiv.org/abs/2107.13157

Asynchronous Distributed Reinforcement Learning for LQR Control via Zeroth-Order Block Coordinate Descent. (arXiv:2107.12416v1 [eess.SY]) arxiv.org/abs/2107.12416

Improving Word Recognition in Speech Transcriptions by Decision-level Fusion of Stemming and Two-way Phoneme Pruning. (arXiv:2107.12428v1 [cs.CL]) arxiv.org/abs/2107.12428

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.