**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:12

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:12

David Thiel @det@hachyderm.io

Jul 24, 2023, 14:12

In what is hopefully my last child safety report for a while: a report on how our previous reports on CSAM issues intersect with the Fediverse.

https://cyber.fsi.stanford.edu/io/news/addressing-child-exploitation-federated-social-media

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:13

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:13

Jul 24, 2023, 14:13

David Thiel @det@hachyderm.io

Similar to how we analyzed Twitter in our self-generated CSAM report, we did a brief analysis of public timelines of prominent servers, processing media with PhotoDNA and SafeSearch. The results were legitimately jaw-dropping: our first pDNA alerts started rolling in within minutes. The true scale of the problem is much larger, as inferred by cross-referencing CSAM-related hashtags with SafeSearch level 5 nudity matches.

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:13

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:13

Jul 24, 2023, 14:13

David Thiel @det@hachyderm.io

Hits were primarily on a not-to-be-named Japanese instance, but a secondary test to see how far they propagated did show them getting federated to other servers. A number of matches were also detected in posts originating from the big mainstream servers. Some of the posts that triggered matches were removed eventually, but the origin servers did not seem to consistently send "delete" events when that happened, which I hope doesn't mean the other servers just continued to store it.

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:13

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:13

Jul 24, 2023, 14:13

David Thiel @det@hachyderm.io

The Japanese server problem is often thought to mean "lolicon" or CG-CSAM, but it appears that servers that allow computer-generated imagery of kids also attracts users posting and trading "IRL" materials (their words, clear from post and match metadata), as well as grooming and swapping of CSAM chat group identifiers. This is not altogether surprising, but it is another knock against the excuses of lolicon apologists.

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:13

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:13

Jul 24, 2023, 14:13

David Thiel @det@hachyderm.io

Traditionally the solution here has been to defederate from freezepeach servers and...well, all of Japan. This is commonly framed as a feature and not a bug, but it's a blunt instrument and it allows the damage to continue. With the right tooling, it might be possible to get the large Japanese servers to at least crack down on material that's illegal there (which non-generated/illustrated CSAM is).

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:14

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:14

Jul 24, 2023, 14:14

David Thiel @det@hachyderm.io

I have argued for a while that the Fediverse is way behind in this area; part of this lack of tooling and reliance on user reports, but part is architectural. CSAM-scanning systems work one of two ways: hosted like PhotoDNA, or privately distributed hash databases. The former is a problem because all servers hitting PhotoDNA at once for the same images doesn't scale. The latter is a problem because widely distributed hash databases allow for crafting evasions or collisions.

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:14

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:14

Jul 24, 2023, 14:14

David Thiel @det@hachyderm.io

I think for this particular issue to be resolved, a couple things need to happen: one, an ActivityPub implementation of content scanning attestation should be developed, allowing the origin servers to perform scanning via a remote service and other servers to verify it happened. Second, for the hash databases that are privately distributed (e.g. Take It Down, NCMEC's NCII database), someone should probably take on making these into a hosted service.

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:14

**David Thiel** @det@hachyderm.io · Jul 24, 2023, 14:14

Jul 24, 2023, 14:14

David Thiel @det@hachyderm.io

Integrated reporting to NCMEC's CyberTipline would make life easier for admins and increase the likelihood that those reports get filed at all. Even without attestation, the big instances should all be using PhotoDNA; it's unclear if anyone on the Fediverse is even doing this, given that they'd have to manually hack it in. UI needs to be added to mainline Mastodon to allow for that—it's a very simple pair of REST calls that just need a couple auth tokens.

**Ubergeek** @ubergeek@tilde.zone · Jul 24, 2023, 21:32

**Ubergeek** @ubergeek@tilde.zone · Jul 24, 2023, 21:32

Jul 24, 2023, 21:32

Ubergeek @ubergeek@tilde.zone

@det is PhotoDNA AGPL? If not, there is zero reason to trust it does what it says it does, and isn't also searching for politically dangerous content, and alerting authorities in those countries, many of which are openly hostile to individual liberties.

**Olives** @olives@qoto.org · 2023-07-26T09:47:01Z

Olives @olives@qoto.org

@ubergeek @det It involves uploading images to Microsoft's servers for them figure out whether it contains illegal bits.

Or something close enough.

No, you never see the actual algorithm (or database).

Jul 26, 2023, 09:47 · · · ·

Resources

Developers

What is Mastodon?

qoto.org

More…