@ct_bergstrom It does even worse when spatial/geometrical reasoning is required. Still, without endorsing claims about AGI, the screenshotted imitation of solving a logic puzzle it's a pretty mindblowing performance for a pure language model. Much better imitation than I would have thought possible if you had asked me 5 years ago.