Show newer

Refining bridge-block decompositions through two-stage and recursive tree partitioning. (arXiv:2110.06998v1 [math.OC]) arxiv.org/abs/2110.06998

Study of positional encoding approaches for Audio Spectrogram Transformers. (arXiv:2110.06999v1 [cs.SD]) arxiv.org/abs/2110.06999

An MILP-based approach to tree partitioning with minimal power flow disruption and generator coherency constraints. (arXiv:2110.07000v1 [math.OC]) arxiv.org/abs/2110.07000

The Design and Simulation of Biomimetic Fish Robot for Aquatic Creature Study. (arXiv:2110.07019v1 [cs.RO]) arxiv.org/abs/2110.07019

Comparison of SVD and factorized TDNN approaches for speech to text. (arXiv:2110.07027v1 [cs.SD]) arxiv.org/abs/2110.07027

Robust MIMO Detection using Hypernetworks with Learned Regularizers. (arXiv:2110.07053v1 [eess.SP]) arxiv.org/abs/2110.07053

Continual learning using lattice-free MMI for speech recognition. (arXiv:2110.07055v1 [eess.AS]) arxiv.org/abs/2110.07055

High-throughput Phenotyping of Nematode Cysts. (arXiv:2110.07057v1 [eess.IV]) arxiv.org/abs/2110.07057

Speech Summarization using Restricted Self-Attention. (arXiv:2110.06263v1 [cs.CL]) arxiv.org/abs/2110.06263

Stacking Integrators Without Sacrificing the Overshoot in Reset Control Systems. (arXiv:2110.06268v1 [eess.SY]) arxiv.org/abs/2110.06268

Toward nonlinear dynamic control over encrypted data for infinite time horizon. (arXiv:2110.06270v1 [eess.SY]) arxiv.org/abs/2110.06270

S3PRL-VC: Open-source Voice Conversion Framework with Self-supervised Speech Representations. (arXiv:2110.06280v1 [cs.SD]) arxiv.org/abs/2110.06280

Tomographic phase and attenuation extraction for a sample composed of unknown materials using X-ray propagation-based phase-contrast imaging. (arXiv:2110.06284v1 [physics.med-ph]) arxiv.org/abs/2110.06284

Switch-based Hybrid Beamforming for Wideband Multi-carrier Communications. (arXiv:2110.06301v1 [eess.SP]) arxiv.org/abs/2110.06301

Generalized Time Domain Velocity Vector. (arXiv:2110.06304v1 [eess.AS]) arxiv.org/abs/2110.06304

Fine-grained style control in Transformer-based Text-to-speech Synthesis. (arXiv:2110.06306v1 [eess.AS]) arxiv.org/abs/2110.06306

Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition. (arXiv:2110.06309v1 [eess.AS]) arxiv.org/abs/2110.06309

DQN-based Beamforming for Uplink mmWave Cellular-Connected UAVs. (arXiv:2110.06318v1 [eess.SP]) arxiv.org/abs/2110.06318

Image Compression and Classification Using Qubits and Quantum Deep Learning. (arXiv:2110.05476v1 [quant-ph]) arxiv.org/abs/2110.05476

UnfairGAN: An Enhanced Generative Adversarial Network for Raindrop Removal from A Single Image. (arXiv:2110.05523v1 [cs.CV]) arxiv.org/abs/2110.05523

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.