Show newer

Dynamic Evaluation Framework for Personalized and Trustworthy Agents: A Multi-Session Approach to Preference Adaptability arxiv.org/abs/2504.06277 .IR .AI

Toward Total Recall: Enhancing FAIRness through AI-Driven Metadata Standardization arxiv.org/abs/2504.05307 .IR .AI

IterQR: An Iterative Framework for LLM-based Query Rewrite in e-Commercial Search System arxiv.org/abs/2504.05309 .IR .AI

GRIT: Graph-based Recall Improvement for Task-oriented E-commerce Queries arxiv.org/abs/2504.05310 .IR

Towards Adaptive Memory-Based Optimization for Enhanced Retrieval-Augmented Generation arxiv.org/abs/2504.05312 .IR .AI

Multimodal Quantitative Language for Generative Recommendation arxiv.org/abs/2504.05314 .IR .AI .CL

Coherency Improved Explainable Recommendation via Large Language Model arxiv.org/abs/2504.05315 .IR

Scale Up Composed Image Retrieval Learning via Modification Text Generation arxiv.org/abs/2504.05316 .IR .AI .CV

AIBrix: Towards Scalable, Cost-Effective Large Language Model Inference Infrastructure arxiv.org/abs/2504.03648 .DC .AI

Diagnostic Method for Hydropower Plant Condition-based Maintenance combining Autoencoder with Clustering Algorithms arxiv.org/abs/2504.03649 .AI .LG .NE

BoxRL-NNV: Boxed Refinement of Latin Hypercube Samples for Neural Network Verification arxiv.org/abs/2504.03650 .LG .AI

Echo: Efficient Co-Scheduling of Hybrid Online-Offline Tasks for Large Language Model Serving arxiv.org/abs/2504.03651 .DC .AI .LG

A Survey on Heterogeneous Computing Using SmartNICs and Emerging Data Processing Units (Expanded Preprint) arxiv.org/abs/2504.03653 .DC .NI

PointSplit: Towards On-device 3D Object Detection with Heterogeneous Low-power Accelerators arxiv.org/abs/2504.03654 .DC .AI .CV

Memory and Bandwidth are All You Need for Fully Sharded Data Parallel arxiv.org/abs/2504.03655 .DC .LG

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.