Follow

"All tested LLMs performed poorly on medical code querying, often generating codes conveying imprecise or fabricated information. LLMs are not appropriate for use on medical coding tasks without additional research."

Soroush, A. et al. (2024) 'Large language models are poor medical coders — benchmarking of medical code querying,' NEJM AI [Preprint]. doi.org/10.1056/aidbp2300040. @science

Sign in to participate in the conversation
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.