My lab's website is part of Google's LLM training set
@Riedl My website for my ancient iOS app is also part of the training set, which definitely disproves Common Crawl's claim that "it tries to prioritize the most important and reputable sites."
QOTO: Question Others to Teach Ourselves An inclusive, Academic Freedom, instance All cultures welcome. Hate speech and harassment strictly forbidden.