These are public posts tagged with #Copylot. You can interact with them if you have an account anywhere in the fediverse.
It would be a huge misattribution, as most mentioned authors would have no impact on the specific code produced.
In the notorious case of #GitHub #Copylot distributing the code of Quake 3 Arena, it should have simply attributed it exactly to the author.
Same should happen for any transformation of such code (reordered lines, symbols renames and so on).
If the output mix the code from multiple authors, each and all of those authors (and only those authors) should be mentioned.
As for license, such hypotethical software should only mix code from compatible licenses and distribute / output it with the proper copyright statements declared in each of the original sources.
In other words, a programmer using a software to create a derivative work of one or more project, should obey the exact same rules followed by any other programmer directly doing the same.
Yet your proposal is reasonable for the #LLM itself: it's a derived work of all the sources used to statistically program it, so it should be attributed to all the original authors and should strictly respect each of the source licenses as any other derivative work.
This is not anti-LLM, just good sense: expensive automatic proxies should never put who control them above the law.
Soon after its release, GitHub #Copylot was caught distributing #GPLv3 code from Quake 3 Arena, with a wrong attribution and permissive MIT-like license. That's why I call it #CopyALot.
For a famous piece of copylefted code tht was recognized, the work of thousands of less known free software developer is going to be included in proprietary products without even the offending developers being aware of the theft.
#LLM "trained" on #OpemSource software can only be used to ethic-wash the practice, so that most of opensource developers won't realize how they are fooled and their work expropriated.
@zacchiro @joeyh@hachyderm.io