Follow

@BenjaminHan

I cannot confirm this. Out of the box, answers several of the 17 questions Joshi claims it failed correctly.

When primed with a prompt to consider answers carefully, it answers 16 of the 17 answers also (mostly) correctly. Mostly, because some of the questions are ill-posed.

Some of the answers ChatGPT answers correctly were labelled incorrectly in the TruthfulQA dataset.

Sign in to participate in the conversation
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.