Show newer

Integrating multimedia documents in 3D city models for a better understanding of territories arxiv.org/abs/2506.10003 .MM .HC

Immersive Multimedia Communication: State-of-the-Art on eXtended Reality Streaming arxiv.org/abs/2506.10004 .MM .AI .ET .NI

Multimodal Cinematic Video Synthesis Using Text-to-Image and Audio Generation Models arxiv.org/abs/2506.10005 .CV .AI .CL .GR .MM

HER2 Expression Prediction with Flexible Multi-Modal Inputs via Dynamic Bidirectional Reconstruction arxiv.org/abs/2506.10006 .MM .AI .CV .LG

Controllable Expressive 3D Facial Animation via Diffusion in a Unified Multimodal Space arxiv.org/abs/2506.10007 .MM .AI .CV

Structured Graph Representations for Visual Narrative Reasoning: A Hierarchical Framework for Comics arxiv.org/abs/2506.10008 .MM .AI .CV

Llama-Affinity: A Predictive Antibody Antigen Binding Model Integrating Antibody Sequences with Llama3 Backbone Architecture arxiv.org/abs/2506.09052 -bio.QM .LG .AI

MetaInfoSci: An Integrated Web Tool for Scholarly Data Analysis arxiv.org/abs/2506.09056 .data-an .DL

EdgeProfiler: A Fast Profiling Framework for Lightweight LLMs on Edge Using Analytical Model arxiv.org/abs/2506.09061 .DC .AI .PF

ReStNet: A Reusable & Stitchable Network for Dynamic Adaptation on IoT Devices arxiv.org/abs/2506.09066 .CV .AI

Enhancing the Safety of Medical Vision-Language Models by Synthetic Demonstrations arxiv.org/abs/2506.09067 .CV .AI

STREAMINGGS: Voxel-Based Streaming 3D Gaussian Splatting with Memory Optimization and Architectural Support arxiv.org/abs/2506.09070 .GR .AI

Segment Any Architectural Facades (SAAF):An automatic segmentation model for building facades, walls and windows based on multimodal semantics guidance arxiv.org/abs/2506.09071 .CV .AI

SILK: Smooth InterpoLation frameworK for motion in-betweening A Simplified Computational Approach arxiv.org/abs/2506.09075 .GR .CV .LG

Understanding Financial Reasoning in AI: A Multimodal Benchmark and Error Learning Approach arxiv.org/abs/2506.06282 .AI

Facial Foundational Model Advances Early Warning of Coronary Artery Disease from Live Videos with DigitalShadow arxiv.org/abs/2506.06283 .CV .AI

NFISiS: New Perspectives on Fuzzy Inference Systems for Renewable Energy Forecasting arxiv.org/abs/2506.06285 .AI

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.