"All tested LLMs performed poorly on medical code querying, often generating codes conveying imprecise or fabricated information. LLMs are not appropriate for use on medical coding tasks without additional research."
Soroush, A. et al. (2024) 'Large language models are poor medical coders — benchmarking of medical code querying,' NEJM AI [Preprint]. https://doi.org/10.1056/aidbp2300040. #Research #DOI #Science #Medicine #Data #Ethics #Statistics #LLM #AI #ArtificialIntelligence #Academia #Academic #Academics @science