Show newer

Voicing Personas: Rewriting Persona Descriptions into Style Prompts for Controllable Text-to-Speech arxiv.org/abs/2505.17093 .AS .CL

Assessing the generalization performance of SAM for ureteroscopy scene understanding arxiv.org/abs/2505.17210 .IV .AI .CV .LG

IAE Optimized PID Tuning via Second Order Step Response Target Matching arxiv.org/abs/2505.17268 .SY .SY

Navigating Polytopes with Safety: A Control Barrier Function Approach arxiv.org/abs/2505.17270 .SY .RO .SY

Control of Renewable Energy Communities using AI and Real-World Data arxiv.org/abs/2505.17321 .SY .AI .SY

Low-Rank Adaptation of Pre-trained Vision Backbones for Energy-Efficient Image Coding for Machine arxiv.org/abs/2505.17366 .IV

State of health prediction of lithium-ion batteries for driving conditions based on full parameter domain sparrow search algorithm and dual-module bidirectional gated recurrent unit arxiv.org/abs/2505.17405 .SY .SY

A Comprehensive Review of Techniques, Algorithms, Advancements, Challenges, and Clinical Applications of Multi-modal Medical Image Fusion for Improved Diagnosis arxiv.org/abs/2505.14715 .IV .CV

A Hybrid Quantum Classical Pipeline for X Ray Based Fracture Diagnosis arxiv.org/abs/2505.14716 .IV .CV .ET .LG

Aneumo: A Large-Scale Multimodal Aneurysm Dataset with Computational Fluid Dynamics Simulations and Deep Learning Benchmarks arxiv.org/abs/2505.14717 .IV .AI .CV .LG

QUADS: QUAntized Distillation Framework for Efficient Speech Language Understanding arxiv.org/abs/2505.14723 .AS .AI .CL .LG .SD

LOD1 3D City Model from LiDAR: The Impact of Segmentation Accuracy on Quality of Urban 3D Modeling and Morphology Extraction arxiv.org/abs/2505.14747 .IV .CV .LG

TransMedSeg: A Transferable Semantic Framework for Semi-Supervised Medical Image Segmentation arxiv.org/abs/2505.14753 .IV .AI .CV

Model-Independent Machine Learning Approach for Nanometric Axial Localization and Tracking arxiv.org/abs/2505.14754 .ins-det -ph.IM .IV .CV .LG

A Compact Narrowband Antenna Design for RF Fingerprinting Applications arxiv.org/abs/2505.14764 .SP

Virtual Fluoroscopy for Interventional Guidance using Magnetic Tracking arxiv.org/abs/2505.14854 .med-ph .IV .SY .SY

Exploring Emotional Synchrony in Dyadic Interactions: The Role of Speech Conditions in Facial and Vocal Affective Alignment arxiv.org/abs/2505.13455 .AS .AI

SPIRIT: Patching Speech Language Models against Jailbreak Attacks arxiv.org/abs/2505.13541 .AS .LG

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.