Follow

@rao2z.bsky.social presents an extremely interesting evaluation of LLMs' ability to reason. His team had been doing this research for a while now, but now with the emergence of Large Reasoning Models, finally there is some notable progress

His post on bsky: bsky.app/profile/rao2z.bsky.so
The preprint: arxiv.org/abs/2504.09762

Sign in to participate in the conversation
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.