Excellent piece in Tech Review on on evaluation issues for LLMs.
https://www.technologyreview.com/2023/08/30/1078670/large-language-models-arent-people-lets-stop-testing-them-like-they-were
@melaniemitchell
Nothing like coercing paying students into unpaid research work to guarantee maximum human effort in a humans vs AI study.
QOTO: Question Others to Teach Ourselves An inclusive, Academic Freedom, instance All cultures welcome. Hate speech and harassment strictly forbidden.