Show newer

Teaching Language Models to Critique via Reinforcement Learning arxiv.org/abs/2502.03492 .LG .AI .CL

Optimistic {\epsilon}-Greedy Exploration for Cooperative Multi-Agent Reinforcement Learning arxiv.org/abs/2502.03506 .MA .LG

Reconstructing 3D Flow from 2D Data with Diffusion Transformer arxiv.org/abs/2502.02593 .flu-dyn .CE .AI

Offshore Wind Turbine Tower Design and Optimization: A Review and AI-Driven Future Directions arxiv.org/abs/2502.02594 .SY .CE .SY

A Quasi-Optimal Shape Design Method for Lattice Structure Construction arxiv.org/abs/2502.02602 .OC .CE

Physically Interpretable Representation and Controlled Generation for Turbulence Data arxiv.org/abs/2502.02605 .comp-ph .flu-dyn .CE .LG

Carbon Per Transistor (CPT): The Golden Formula for Green Computing Metrics arxiv.org/abs/2502.02606 -mat.mtrl-sci .OH

MIND: Microstructure INverse Design with Generative Hybrid Neural Representation arxiv.org/abs/2502.02607 .CV .GR .LG

FruitPAL: An IoT-Enabled Framework for Automatic Monitoring of Fruit Consumption in Smart Healthcare arxiv.org/abs/2502.01643 .CV

UA-1 PH2 DECISIVE Testing Handbook: Test Methods and Benchmarking Performance Results for sUAS in Dense Urban Environments arxiv.org/abs/2502.01648 .SY .RO .SY

Fine-tuning LLaMA 2 interference: a comparative study of language implementations for optimal efficiency arxiv.org/abs/2502.01651 .LG .AI

Hybrid Group Relative Policy Optimization: A Multi-Sample Approach to Enhancing Policy Optimization arxiv.org/abs/2502.01652 .LG .AI

Predicting concentration levels of air pollutants by transfer learning and recurrent neural network arxiv.org/abs/2502.01654 .ao-ph .LG .NE

A binary PSO based ensemble under-sampling model for rebalancing imbalanced training data arxiv.org/abs/2502.01655 .LG .AI .NE

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.