**Rysiekúr (old account)** @rysiek@mastodon.social · Jul 08, 2021, 00:42

**Rysiekúr (old account)** @rysiek@mastodon.social · Jul 08, 2021, 00:42

Rysiekúr (old account) @rysiek@mastodon.social

Jul 08, 2021, 00:42

Rysiekúr (old account) @rysiek@mastodon.social

@tindall eh, no. There is a datamining exception, that allows this kind of thing:
https://juliareda.eu/2021/07/github-copilot-is-not-infringing-your-copyright/

And it's important and useful, for scientists and investigative journalists.

It also happens to be useful for Microsoft Github Copilot here. And I share your frustration about this. The problem is: it's really difficult to make it not useful for Microsofts of this world without a lot of blocking scientific research and investigative journalism.

**Rysiekúr (old account)** @rysiek@mastodon.social · Jul 08, 2021, 00:46

**Rysiekúr (old account)** @rysiek@mastodon.social · Jul 08, 2021, 00:46

Jul 08, 2021, 00:46

Rysiekúr (old account) @rysiek@mastodon.social

@tindall that is obviously still a conversation worth having, though!

Still, Microsoft Copilot does seem to infringe every now and then, when it quotes verbatim full passages from certain pieces of code:
https://www.reddit.com/r/programming/comments/oc9qj1/copilot_regurgitating_quake_code_including_sweary/

*That's* where Microsoft needs to get smacked hard for copyright infringement and licensing violations!

**Shamar** @Shamar@qoto.org · 2021-07-08T21:55:47Z

Shamar @Shamar@qoto.org

@rysiek

The argument about the derivative work is plain wrong, and I'm really surprised that Julia Reda wrote something like this.¹

```
On the other hand, the argument that the outputs of GitHub Copilot are derivative works of the training data is based on the assumption that a machine can produce works. This assumption is wrong and counterproductive. Copyright law has only ever applied to intellectual creations – where there is no creator, there is no work. This means that machine-generated code like that of GitHub Copilot is not a work under copyright law at all, so it is not a derivative work either. The output of a machine simply does not qualify for copyright protection – it is in the public domain. That is good news for the open movement and not something that needs fixing.
```

The output of a compiler is under the copyright of the authors of the sources because the machine does NOT add anything creative, but only apply an algorithmic transform to the sources.

Thus the output of a compiler is under the copyright of the authors of the sources.

Similarly a zip containing the sources is under the #copyright of the authors of the sources.

The training of #GitHubCopilot's model just did the same: it turned sources under their authors' copyright into a big opaque archive (aka blackbox #OpenAI) that can be queried through an API.

Thus the model is protected under the copyright of all the authors of the original sources.
And since such code were distributed under #AGPLv3 code, the whole model must be distributed within 30 days to prevent a termination of the license.

Sure, I'd be very happy to learn that zipping a book or ripping a dvd would end the rights of the copyright holders.

But if I cannot algorithmically transform #windows11 binaries, say by decompiling them, ending #Microsoft's right on the output, then Microsoft cannot transform my #AGPLv3 code without complying with the license.
____
¹ Or at least, I would have been surprised months ago, before she signed the "open letter" against #RMS to divide the #FreeSoftware movement

@tindall@cybre.space

Jul 08, 2021, 21:55 · · · ·

Trending now

Resources

Developers

What is Mastodon?

qoto.org

More…