Show newer

LadderSym: A Multimodal Interleaved Transformer for Music Practice Error Detection arxiv.org/abs/2510.08580 .AS .SD .AI

Evaluating Hallucinations in Multimodal LLMs with Spoken Queries under Diverse Acoustic Conditions arxiv.org/abs/2510.08581 .AS .SD .AI

A Neural Surrogate-Enhanced Multi-Method Framework for Robust Wing Design Optimization arxiv.org/abs/2510.08582 .OC .NE

EGSTalker: Real-Time Audio-Driven Talking Head Generation with Efficient Gaussian Deformation arxiv.org/abs/2510.08587 .AS .SD .AI

Enhancing Biomedical Named Entity Recognition using GLiNER-BioMed with Targeted Dictionary-Based Post-processing for BioASQ 2025 task 6 arxiv.org/abs/2510.08588 .CL

Beyond CNNs: Efficient Fine-Tuning of Multi-Modal LLMs for Object Detection on Low-Data Regimes arxiv.org/abs/2510.08589 .CV .AI

Deep Learning Based Approach to Enhanced Recognition of Emotions and Behavioral Patterns of Autistic Children arxiv.org/abs/2510.07320 .LG .AI .CV .HC

How human is the machine? Evidence from 66,000 Conversations with Large Language Models arxiv.org/abs/2510.07321 -fin.EC .GN .HC

A LoRa IoT Framework with Machine Learning for Remote Livestock Monitoring in Smart Agriculture arxiv.org/abs/2510.07322 .HC

A Modality-Aware Cooperative Co-Evolutionary Framework for Multimodal Graph Neural Architecture Search arxiv.org/abs/2510.07325 .LG .NE

Audio-Visual Separation with Hierarchical Fusion and Representation Alignment arxiv.org/abs/2510.07326 .MM .SD

MultiFair: Multimodal Balanced Fairness-Aware Medical Classification with Dual-Level Gradient Modulation arxiv.org/abs/2510.07328 .LG .AI .CV .CY

A Digital Pheromone-Based Approach for In/Out-of-Control Classification arxiv.org/abs/2510.07329 .SY .NE .SY

Truth-Aware Decoding: A Program-Logic Approach to Factual Language Generation arxiv.org/abs/2510.07331 .AI .LO

Auctioning Future Services in Edge Networks with Moving Vehicles: N-Step Look-Ahead Contracts for Sustainable Resource Provision arxiv.org/abs/2510.07333 .SY .GT .SY

Nonlinear System Identification for Model-Based Control of Waked Wind Turbines arxiv.org/abs/2510.07336 .flu-dyn .SY .SY

Inducing State Anxiety in LLM Agents Reproduces Human-Like Biases in Consumer Decision-Making arxiv.org/abs/2510.06222 -fin.EC .GN .HC

A Multimodal GUI Architecture for Interfacing with LLM-Based Conversational Assistants arxiv.org/abs/2510.06223 .HC .AI

Exploring Human-AI Collaboration Using Mental Models of Early Adopters of Multi-Agent Generative AI Tools arxiv.org/abs/2510.06224 .HC .AI .CY

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.