Show newer

Towards Affordable, Adaptive and Automatic GNN Training on CPU-GPU Heterogeneous Platforms arxiv.org/abs/2511.07421 .DC

From Attention to Disaggregation: Tracing the Evolution of LLM Inference arxiv.org/abs/2511.07422 .DC

Synera: Synergistic LLM Serving across Device and Cloud at Scale arxiv.org/abs/2511.07423 .DC .AI .LG

Enhancing reliability in AI inference services: An empirical study on real production incidents arxiv.org/abs/2511.07424 .DC .CY

An Evaluation of LLMs Inference on Popular Single-board Computers arxiv.org/abs/2511.07425 .DC .AI

Network and Systems Performance Characterization of MCP-Enabled LLM Agents arxiv.org/abs/2511.07426 .DC .AI .CL .NI .SE

DynaKV: Enabling Accurate and Efficient Long-Sequence LLM Decoding on Smartphones arxiv.org/abs/2511.07427 .DC .AI

Resource Allocation in Hybrid Radio-Optical IoT Networks using GNN with Multi-task Learning arxiv.org/abs/2511.07428 .NI .LG

Knowledge-Guided Textual Reasoning for Explainable Video Anomaly Detection via LLMs arxiv.org/abs/2511.07429 .CV .AI

GreyShot: Zeroshot and Privacy-preserving Recommender System by GM(1,1) Model arxiv.org/abs/2511.05493 .IR

Customized Retrieval-Augmented Generation with LLM for Debiasing Recommendation Unlearning arxiv.org/abs/2511.05494 .IR .AI

IMDMR: An Intelligent Multi-Dimensional Memory Retrieval System for Enhanced Conversational AI arxiv.org/abs/2511.05495 .IR .AI

DOCUEVAL: An LLM-based AI Engineering Tool for Building Customisable Document Evaluation Workflows arxiv.org/abs/2511.05496 .IR .AI

Socially Aware Music Recommendation: A Multi-Modal Graph Neural Networks for Collaborative Music Consumption and Community-Based Engagement arxiv.org/abs/2511.05497 .IR .LG .MM

Biomedical Hypothesis Explainability with Graph-Based Context Retrieval arxiv.org/abs/2511.05498 .IR .AI

Weightless Neural Networks for Continuously Trainable Personalized Recommendation Systems arxiv.org/abs/2511.05499 .IR .AI .LG

Predicting Oscar-Nominated Screenplays with Sentence Embeddings arxiv.org/abs/2511.05500 .IR .AI .CL

Towards Ecologically Valid LLM Benchmarks: Understanding and Designing Domain-Centered Evaluations for Journalism Practitioners arxiv.org/abs/2511.05501 .HC .AI

Production-Grade Local LLM Inference on Apple Silicon: A Comparative Study of MLX, MLC-LLM, Ollama, llama.cpp, and PyTorch MPS arxiv.org/abs/2511.05502 .AR .AI

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.