Okay, people keep telling me to read this NY Mag profile of Emily Bender, and they're right. It's a fantastic read. However, this line is... wrong (or misleading). Everything that ChatGPT trains on is also covered by copyright. The idea that it can't do books because of copyright is just wrong. It can't train based on ebooks, because the ebooks are locked up and not publicly available (without great cost).

nymag.com/intelligencer/articl

Follow

@mmasnick Enjoyed the article, but this quote from the article was cringe worthy as well. It's not that very few people understand how to make LLMs; it's that very few people can afford to train LLMs. As for the very precise $15.7 trillion dollar estimate ... no comment.

Sign in to participate in the conversation
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.