Follow

Researchers at Apple come up with a generalisation of GSM8K benchmark that allows discerning how LLM's learn mathematics: whether they really do the maths, or just engage in rote learning
awful.systems/post/2610681

Sign in to participate in the conversation
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.