Conversation

Notices

Embed this notice
Bradley M. Kühn (bkuhn@fedi.copyleft.org)'s status on Thursday, 16-Apr-2026 08:59:34 JST Bradley M. Kühn

👀 … https://sfconservancy.org/blog/2026/apr/15/eternal-november-generative-ai-llm/ …my colleague Denver Gingerich writes: newcomers' extensive reliance on LLM-backed generative AI is comparable to the Eternal September onslaught to USENET in 1993. I was on USENET extensively then; I confirm the disruption was indeed similar. I urge you to read his essay, think about it, & join Denver, me, & others at the following datetimes…
$ date -d '2026-04-21 15:00 UTC'
$ date -d '2026-04-28 23:00 UTC'
…in https://bbb-new.sfconservancy.org/rooms/welcome-llm-gen-ai-users-to-foss/join
#AI #LLM #OpenSource
In conversation about 2 months ago from fedi.copyleft.org permalink
Attachments
1. Domain not in remote thumbnail source whitelist: sfconservancy.org
  
  Eternal November - the new influx of users, and why it's way better than the last one
  
  Many people may recall Eternal September (in 1993) — when Usenet membership increased overwhelmingly — marking the annual September rush of student joins. The ensuing moderation challenges changed the culture of Usenet (the largest Internet discussion fora back then). Many early Usenet adopters left; they reconnected with their communities elsewhere. While this onslaught of “newbies” knew little of Usenet's traditional cultural norms, they nonetheless benefitted greatly from these novel connections to discuss and learn together with people worldwide. The times were turbulent then, but eventually new cultural norms emerged.
2. Domain not in remote thumbnail source whitelist: bbb-new.sfconservancy.org
  
  BigBlueButton
  
  Learn using BigBlueButton, the trusted open-source web conferencing solution that enables seamless virtual collaboration and online learning experiences.
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Thursday, 16-Apr-2026 08:59:32 JST Christine Lemmer-Webber
  in reply to
  
  @bkuhn @ossguy I have to admit that I am pretty surprised by this post. Not in terms of being welcoming to newcomers, which is something I have advocated for and made the center of all of my FOSS work.
  However, the post says the following:
  > I encourage all of us in the FOSS community to welcome the new software developers who've adopted these tools, investigate their motivations, and seriously consider cautiously and carefully incorporating their workflows with ours.
  While the sentence which follows acknowledges that "seasoned software developers understand the benefits and limitations of LLM-assisted coding tools", there are two big things I expected at least acknowledged:
  - Many maintainers are facing *burnout* over the situation. However, I agree that addressing this in terms of norms is something we can consider
  - The biggest thing I am surprised to not see addressed at all is the licensing and copyright implications
  (cotd)
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Thursday, 16-Apr-2026 09:00:50 JST Christine Lemmer-Webber
  in reply to
  
  @bkuhn @ossguy The surprising thing about saying "seriously consider cautiously and carefully incorporating their workflows with ours" is that it doesn't address at all my *biggest* fear: the copyright status of LLM generated contributions seems currently unsettled.
  I know there's been assertions to the contrary floating around: the Supreme Court deferred to a lower court in the US. However that is not the same thing as the Supreme Court making a specific decision. And internationally, the copyright situation of output is even murkier... it will take a long time for this to settle.
  Does Conservancy not think this is the case? I would be surprised if so, but perhaps you all have an interpretation that I am not currently aware of.
  If there *is* concern, then we hit a serious risk: we may be seeing many contributions with legal status which has *yet to be determined* entering seasoned codebases. And this worries me a lot.
  
  In conversation about 2 months ago permalink
  
  Evan Prodromou repeated this.
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Thursday, 16-Apr-2026 09:14:22 JST Christine Lemmer-Webber
  in reply to
  
  @bkuhn @ossguy There are other things I am less worried about. genAI tools used to probe for software vulnerabilities does not lead to contributions of unknown status. Same for using LLMs to explore a codebase. However, there isn't any distinction made in the post, only a "seriously consider cautiously and carefully incorporating their workflows with ours".
  Does this mean Conservancy currently believes that the matter of genAI output by contemporary LLM tools is a settled matter, in terms of either a) being fully in the public domain or b) being the copyright status of the "prompter"?
  
  In conversation about 2 months ago permalink
- Embed this notice
  Rich Felker (dalias@hachyderm.io)'s status on Thursday, 16-Apr-2026 10:51:21 JST Rich Felker
  in reply to
  - Christine Lemmer-Webber
  - AI Cases Bot
  @bkuhn @cwebber @ai_cases I'm confused what you mean by "dire". All LLM-emitted code being infringing would not be a "dire" outcome but the ideal one. Even if it does blow up in the faces of irresponsible maintainers who've let that infect their codebases and who now need to revert to the last non-compromised versions.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bradley M. Kühn (bkuhn@fedi.copyleft.org)'s status on Thursday, 16-Apr-2026 10:51:24 JST Bradley M. Kühn
  in reply to
  - Christine Lemmer-Webber
  - AI Cases Bot
  @cwebber I think maybe you missed https://sfconservancy.org/blog/2026/mar/04/scotus-deny-cert-dc-circuit-thaler-appeal-llm-ai/ where #SFC analyzed that situation?
  Also, follow @ai_cases & see the *firehose* of litigation on this & remember the “Work Based on the Program” issue under GPLv2 has still never been litigated directly but lots of cases about 100% proprietary software have bolstered GPL's strength.
  Big Content has legal battles with Big Tech on 100s of fronts rn. Yes, we're adrift on their sea, but the situation is not as dire as you imagine.
  #AI #LLW
  In conversation about 2 months ago permalink
  Attachments
  1. Domain not in remote thumbnail source whitelist: sfconservancy.org
    
    SCOTUS Declines to Hear LLM-Backed AI Case Regarding Copyright
    
    No Serious Implications for FOSS from SCOTUS' DenialEarlier this week1, the U.S. Supreme Court (SCOTUS) denied certiorari (cert) in Thaler v. Perlmutter. Thaler contended that an image — generated by a Large Language Model (LLM)-backed Artificial Intelligence (AI) — deserved copyright registration. Since the U.S. Copyright Office refused to grant the registration, Thaler appealed to the U.S. District Court for the District of Columbia (DC Circuit). That Court affirmed the Copyright Office's decision. SCOTUS' denial of “cert” means they will not hear the case. Strictly speaking, this denial does not affirm the DC Circuit Court's ruling, but it does mean the DC Circuit decision stands.
  Christine Lemmer-Webber repeated this.
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Friday, 17-Apr-2026 05:50:38 JST Christine Lemmer-Webber
  in reply to
  
  @richardfontana @bkuhn @ossguy In which of the 5 million ways I could parse that sentence do you mean it?
  
  In conversation about 2 months ago permalink
- Embed this notice
  Richard Fontana (richardfontana@mastodon.social)'s status on Friday, 17-Apr-2026 05:50:39 JST Richard Fontana
  in reply to
  - Christine Lemmer-Webber
  @cwebber I truly don't think this is a new situation @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 22:49:54 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @bkuhn @ossguy @richardfontana Continuing here, because it's the relevant subthread.
  I am sympathetic to choosing to narrow a topic. However, the post, in implying that we should start accepting partially AIgen contributions, inherently pulls in the topic of whether or not that is legally safe.
  Yes, I have read the previous Conservancy post about the existing cases. This partly contributes to my surprise and confusion about the post.
  Acknowledging that the plan is to have continued conversations and meetings about this, I still feel it is important to lay down my current concerns, even before such a meeting. I am leaving the "quality of contributions" and many other details out of here, and instead focusing on whether of not it is *safe to accept* contributions on copyright grounds at the moment, and what the implications of thinking on that are.
  (cotd)
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 22:52:42 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @bkuhn @ossguy @richardfontana So the question is: is it safe, from a legal perspective, given the current state of uncertainty of copyright of such contributions, to encourage accepting such contributions into repositories?
  Now clearly, many projects are: the Linux kernel most famously is, and their recent policy document says effectively, "You can contribute AI generated code, but the onus is on you whether or not you legally could have".
  Which is not very helpful of a handwave, I would say, since few contributors are equipped to assess such a thing. I've left myself and three others addressed in this portion of the thread, and all of us *have* done licensing work, and my suspicion is, *especially* based on what's been written, that none of us could confidently project where things are going to go.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 22:58:43 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @bkuhn @ossguy @richardfontana Part of the problem here is that the AI companies have set the stage themselves. Their presumption is that it's fine to absorb effectively all open and "indie" content, and that this is entirely fair to pull into a model without any legal implications, whereas potentially yes, you may need to "license" something that looks like a Disney character. In the land of code, I also sense that Microsoft is perfectly fine with the idea that you can "copyright launder" a codebase from the GPL to perhaps the public domain, but if someone did that to their own leaked source code, they would be very upset.
  Meanwhile, a friend of mine who works in films has said that he keeps hearing rumors that OpenAI would like a cut of stuff made with their stuff. We should presume tthat true.
  Regardless, I'm sure everyone on this thread wants an *equitable* situation for proprietary and FOSS licensing. I'll expand on that more in a moment though.
  
  In conversation about 2 months ago permalink
  
  Rich Felker repeated this.
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 23:01:36 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  However, it's not actually the laundering angle I am concerned with here entirely, it's whether we're turning FOSS codebases into potential legal toxic waste dumps that we will have a hell of a time cleaning up later.
  The previous Conservancy post, which @bkuhn linked upthread, indicates that Conservancy does indeed consider the matter unsettled.
  Current LLMs wouldn't "default to copyleft", since they also include all-rights-reserved mixed in there. If the result of output of these systems is a slurry of inputs which carry their licensing somehow, their default licensing output situation is one of a hazard.
  I note that @bkuhn and @ossguy seem to be hinting at hoping a "copyleft based LLM" with all-copyleft output it a winning scenario. I'm going to state plainly: I believe that's an impossible outcome.
  @richardfontana
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 23:03:35 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @bkuhn @ossguy @richardfontana Rather than focus on the GPL, let's choose a different copyleft license. In fact, let's choose a gradient of licenses.
  - CC0's public domain declaration w/ minimal fallback license
  - CC BY
  - CC BY-SA
  Imagine for a moment an LLM trained entirely on the above three licenses, and then one that's CC BY and CC0, and then one that's just CC0.
  Let's look at both extremes and then we'll find out the real dangers come from observing the middle.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 23:08:12 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @bkuhn @ossguy @richardfontana First let's imagine the only-CC0 based LLM.
  I would fully agree that no matter the law and legal case law passed and established, the CC0 based input LLM is clearly effectively in the public domain, or like CC0 itself, equivalent to it. This one is relatively simple.
  Let's make things more complicated.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 23:10:06 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @bkuhn @ossguy @richardfontana Regarding the one containing CC0, CC BY, and CC BY-SA, the situation is more uncertain and seems highly affected by legal outcomes in upcoming law and cases to be set. There is the possibility that indeed, the LLM is considered a slurry of inputs and this is legally acceptable, and effectively any output which is not verbatim of its inputs in some way is effectively under the public domain.
  Now, of course, the problem is that we don't have to just worry about the US, we have to worry *internationally*. When considered from this angle, that FOSS is an international endeavour, this hope that things are in the public domain feels a lot dicier.
  The assumption is that then this effectively leads to the output being under the terms of CC BY-SA. This is fine, great even, right?! Because effectively everything is share-alike (Bradley I don't wanna get into whether BY-SA is copyleft or something weaker). We slap CC BY-SA on the output, it's fine. Right??????
  
  In conversation about 2 months ago permalink
  
  Rich Felker repeated this.
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 23:13:10 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @bkuhn @ossguy @richardfontana Except, I actually believe this scenario isn't legally viable. And it's easier to understand if we scale back to the middle case.
  Let's now look at the LLM trained on CC0 and CC BY. Because it's the BY aspect that makes everything complicated.
  There is *NO WAY* in current LLM technology, nor I believe from studying how neural networks work, any viable computationally performant LLM, that they can track provenance. The BY clause cannot be upheld.
  This isn't a theoretical concern for me; someone built another vibecoded Scheme-to-WASM-GC compiler that looks an awful lot like Spritely's own Hoot compiler in places. They didn't attribute us. They probably didn't know. But like many FOSS licenses, Apache v2 does require certain levels of attribution to be upheld. Most FOSS projects do.
  You can't uphold the CC BY requirement, as far as I can tell.
  
  In conversation about 2 months ago permalink
  
  Rich Felker repeated this.
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 23:14:19 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @bkuhn @ossguy @richardfontana Now here is a counter-argument: how do people attribute Wikipedia? They generally just attribute Wikipedia! And people seem to be mostly fine with this.
  It feels fine, when you were a contributor to the Wikipedia project.
  It feels a lot less fine when you are a contributor to a specific project, to have everything just sucked up into "the generic LLM". Claude did it! Claude did it all by itself.
  In conversation about 2 months ago permalink
  Attachments
  1. Domain not in remote thumbnail source whitelist: project.it
    
    Project Informatica
    
    Fornisce hardware e software, soluzioni sistemistiche e di rete personalizzate. I prodotti informatici sono corredati da servizi di assistenza e manutenzione personalizzati.
  2. Domain not in remote thumbnail source whitelist: www.this.it
    
    Progetti architettura e servizi tecnici per immobili
    
    Consulenza tecnica di architettura ed ingegneria per progettazione, ristrutturazione di immobili, pratiche edilizie, perizie. Investimenti, valorizzazione e trasformazione di immobili
  Rich Felker repeated this.
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 23:16:37 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @bkuhn @ossguy @richardfontana If we are pushing for an *equitable* scenario for copyright output, there is only one "good outcome" in terms of copyright, and that is that everything is effectively in the public domain. The dream of having a "copyleft LLM" doesn't work.
  And even if it did, there are several problems:
  - Nobody is using that *now*, and contributors are facing contributions *now*, and there is legal uncertainty about accepting those contributions *right now*.
  - It is unlikely that the "copyleft LLM" would be very useful. The way people use these tools is conversational in a way that requires them to effectively have to be trained on the entire internet to be functional. Not just copyleft codebases.
  The copyleft LLM dream is a joke.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 23:19:09 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @bkuhn @ossguy @richardfontana I say "good outcome", and I'm not saying it's an outcome I want, because "what I want" is pretty complicated here. I'm saying, it's the only one where there is the possibility of legal output from these tools that can safely be incorporated into FOSS projects *at all* that is *equitable* for both FOSS and proprietary situations.
  And yup, unfortunately, that would mean copyright-laundering of FOSS codebases through LLMs would be possible to strip copyleft.
  It would also mean the same for proprietary codebases.
  Frankly I think it would kind of rule if we stabbed copyright in the gut that badly, but there's so much vested interest from various copyright holding corporations, I don't think we're likely to get that. Do you?
  In conversation about 2 months ago permalink
  Attachments
  1. No result found on File_thumbnail lookup.
    
    COPYLEFT.IT
  Rich Felker repeated this.
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 23:22:01 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @bkuhn @ossguy @richardfontana So let me summarize:
  - Without knowing the legal status of accepting LLM contributions, we're potentially polluting our codebases with stuff that we are going to have a HELL of a time cleaning up later
  - The idea of a copyleft-only LLM is a joke and we should not rely on it
  - We really only have two realistic scenarios: either FOSS projects cannot accept LLM based contributions legally from an international perspective, or everything is effectively in the public domain as outputted from these machines, but at least in the latter scenario we get to weaken copyright for everyone.
  That's leaving out a lot of other considerations about LLMs and the ethics of using them, which I think most of the other replies were focused on, I largely focused on the copyright implications aspects in this subthread. Because yes, I agree, it can be important to focus a conversation.
  But we can't ignore this right now.
  We're putting FOSS codebases at risk.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 23:33:25 JST Christine Lemmer-Webber
  in reply to
  - Lord Caramac the Clueless, KSC
  - Richard Fontana
  @LordCaramac @bkuhn @ossguy @richardfontana If you are talking about my personal wishes, I would agree. Personally, I perceive of FOSS as a *reaction to* allowing copyright and other intellectual restrictions laws to apply to software.
  This puts me at odds with some other copyleft advocates. I see copyleft as useful because it "turns the teeth of the machine against itself". If you have copyright, then great, we will use it to have a way to force the commons to stay open.
  But it would be better to have no copyright at all, and if we could give it up, I would give it up.
  But it's a far-fetched dream that it could happen. Maybe it will. I am not so sure. If it truly is possible to "copyright launder" any work through an LLM, we'd be as close to it as we ever could be.
  But again, whatever scenario, in my view, has to be equitable. If it's possible to do that to GPL'ed software, it's only just to be possible to do it to any proprietary software, including reverse engineering binaries.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Lord Caramac the Clueless, KSC (lordcaramac@discordian.social)'s status on Saturday, 18-Apr-2026 23:33:26 JST Lord Caramac the Clueless, KSC
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber @bkuhn @ossguy @richardfontana I think we should just destroy copyright entirely and expand the public domain to contain everything that has ever been published. Intellectual property was a very bad idea in the first place IMHO.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Saturday, 18-Apr-2026 23:49:41 JST Christine Lemmer-Webber
  in reply to
  @noisytoot @LordCaramac @ossguy @bkuhn @richardfontana I agree with you, and also have no idea why my post was set to DE.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Noisytoot (noisytoot@berkeley.edu.pl)'s status on Saturday, 18-Apr-2026 23:49:43 JST Noisytoot
  in reply to
  @cwebber @LordCaramac @bkuhn @ossguy @richardfontana In a world without copyright (assuming no other changes), nothing would prevent people from withholding source code and attempting to restrict people’s freedom by technical means (DRM). On the other hand, it would also be entirely legal to reverse engineer everything and bypass the DRM.
  Copyright should be removed, but DRM and providing binaries without source code should also be made illegal.
  Also why is your post language set to de?
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 01:31:43 JST Evan Prodromou
  in reply to
  - Christine Lemmer-Webber
  - AI Cases Bot
  @bkuhn @cwebber @ai_cases both great resources, tysm!
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 01:33:39 JST Christine Lemmer-Webber
  in reply to
  - infinite love ⴳ
  - Richard Fontana
  @trwnh @bkuhn @ossguy @richardfontana Plenty of Microsoft code has been released under "shared source" licenses and also leaks
  
  In conversation about 2 months ago permalink
- Embed this notice
  infinite love ⴳ (trwnh@mastodon.social)'s status on Sunday, 19-Apr-2026 01:33:40 JST infinite love ⴳ
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber @bkuhn @ossguy @richardfontana how do you launder proprietary codebases if the source isn't available? i just see this as 2 negatives since it would incentivize trade secrets
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 01:34:02 JST Christine Lemmer-Webber
  in reply to
  - Evan Prodromou
  - AI Cases Bot
  @evan @bkuhn @ai_cases I will admit that getting into a big ol licensing debate does feel very original-fediverse
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 01:35:25 JST Christine Lemmer-Webber
  in reply to
  - Jens Finkhäuser
  - Richard Fontana
  @jens @bkuhn @ossguy @richardfontana This is indeed a serious risk, though tangential to this subthread. But it's a concern I also have.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Jens Finkhäuser (jens@social.finkhaeuser.de)'s status on Sunday, 19-Apr-2026 01:35:27 JST Jens Finkhäuser
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber @bkuhn @ossguy @richardfontana Worse IMHO is that we're putting FOSS as a movement at risk if we deskill everyone to the point where you either pay money to have code generated for you, or there is no code.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Richard Fontana (richardfontana@mastodon.social)'s status on Sunday, 19-Apr-2026 01:46:07 JST Richard Fontana
  in reply to
  - Christine Lemmer-Webber
  @cwebber copyleft-only LLM is nonsensical , agreed @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 01:46:45 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @richardfontana @bkuhn @ossguy Glad to hear we agree there!
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 01:50:22 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber
  Are you concerned that the LLMs generate nontrivial verbatim excerpts of copyrighted works?
  Or that there is a hidden "intellectual property" in the deep patterns that they use?
  Say, when an LLM was trained on a file I made with an interesting loop structure, and it emits code with a similar loop structure, even if the variable names, problem domain, details, or programming language differ.
  What if a court says I can demand royalties for my "IP"?
  @bkuhn @ossguy @richardfontana
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 01:53:30 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber @bkuhn @ossguy @richardfontana
  Like, not copyrightable, not patents, but some secret third thing, kind of what people mean when we say that someone "stole our idea".
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 01:53:55 JST Christine Lemmer-Webber
  in reply to
  - Evan Prodromou
  - Richard Fontana
  @evan @bkuhn @ossguy @richardfontana I am talking about copyright
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 01:54:03 JST Christine Lemmer-Webber
  in reply to
  - Evan Prodromou
  - Richard Fontana
  @evan @richardfontana I am saying we don't know the answer to that question, and it seems that @bkuhn and @ossguy agree that we don't know the answer to it, based on previous posts, and the lack of knowledge about what the copyright implications of LLM based contributions means that we are creating a schrodingers-licensing-timebomb for our FOSS codebases
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 01:55:35 JST Christine Lemmer-Webber
  in reply to
  - Evan Prodromou
  - Richard Fontana
  @evan @bkuhn @ossguy @richardfontana Say for a moment that we *did* make a model which intentionally pulled in leaked source code from various proprietary codebases.
  What would your opinion be on the legal-hazard state of accepting that code output? Would you consider it relatively safe from a copyright perspective?
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 01:57:14 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber excellent, thanks!
  @bkuhn @ossguy @richardfontana
  
  In conversation about 2 months ago permalink
- Embed this notice
  Richard Fontana (richardfontana@mastodon.social)'s status on Sunday, 19-Apr-2026 01:59:03 JST Richard Fontana
  in reply to
  - Christine Lemmer-Webber
  @cwebber I think adequate compliance might be possible with good enough detection/matching tools but I don't necessarily expect such tools to be developed (let alone available to foss projects) (my assumption is that the few such tools in use today are pretty bad) @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 01:59:03 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @richardfontana @bkuhn @ossguy That's a problem so hard it throws the "NP complete" debate out the window in favor of something brand new. Given that these codebases have no trouble "translating" from one language's source code into another, how on *earth* could you possibly hope to build a compliance tool around that?
  Laughable, to anyone who tries.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 02:09:59 JST Christine Lemmer-Webber
  in reply to
  - Stefano Zacchiroli
  - Richard Fontana
  @zacchiro @bkuhn @ossguy @richardfontana While true, there is a big difference in that the previous scenario was someone out of compliance with what the community actually accepted as hygienic and acceptable contributions, and those contributions were relatively rare.
  Saying that we don't need to worry about the risks from these tools right now from a licensing situation is different: it's advising on a path being acceptable where we *don't know* whether or not it's generally safe practice to recommend! And which most in this thread seem to agree we don't know. Even your post seems to say "it seems like it'll probably be okay and end up in our favor".
  I guess I feel increasingly like I am maybe the only "oldschool FOSS licensing wonk" who cares about this, and maybe that means I should just give up.
  But *damn* I can't believe it feels like when people are both saying "we don't know what the implications will be" we're also saying "so go ahead and say those patches are a-ok!"
  
  In conversation about 2 months ago permalink
- Embed this notice
  Stefano Zacchiroli (zacchiro@mastodon.xyz)'s status on Sunday, 19-Apr-2026 02:10:01 JST Stefano Zacchiroli
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber @bkuhn @ossguy @richardfontana
  My current answer to your "is it safe" question is to answer a slightly different question. Namely: "is it any less safe than accepting code from a random employee that claims to be submitting under a inbound=outbound regime, whereas in fact they cannot?". The latter we have been doing for decades, with limited damages to the commons.
  (I *also* think the legal odds are more in our favor with AI-assisted contributions than in the previous case.)
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 02:13:18 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  @richardfontana As said here, given the "translation between languages" aspect, I can't really see that as likely to be true https://social.coop/@cwebber/116426770262334234
  Which maybe that means that all this stuff really is public domain, a position I am *fully willing to accept*! But I don't think it's known, and I don't think @bkuhn or @ossguy are eager to adopt that perspective
  In conversation about 2 months ago permalink
  Attachments
  1. No result found on File_thumbnail lookup.
    
    Christine Lemmer-Webber (@cwebber@social.coop)
    
    from Christine Lemmer-Webber
    
    @richardfontana@mastodon.social @bkuhn@copyleft.org @ossguy@copyleft.org That's a problem so hard it throws the "NP complete" debate out the window in favor of something brand new. Given that these codebases have no trouble "translating" from one language's source code into another, how on *earth* could you possibly hope to build a compliance tool around that? Laughable, to anyone who tries.
- Embed this notice
  Richard Fontana (richardfontana@mastodon.social)'s status on Sunday, 19-Apr-2026 02:13:19 JST Richard Fontana
  in reply to
  - Christine Lemmer-Webber
  @cwebber to be clear compliance cannot somehow be built in to the LLM for reasons you stated, but ancillary tools for LLM users to reconstruct provenance exist and conceivably could be made more useful @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 02:16:57 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber
  This is probably a healthy concern.
  I think there might be some good ways to hedge one's bets, though.
  Use LLMs for rubber ducking, code scanning and review, rather than code generation.
  Keep LLM code contributions minimal and unremarkable, too.
  Don't make them load-bearing. If the code is central to the program, it's too unique.
  @richardfontana @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 02:17:52 JST Christine Lemmer-Webber
  in reply to
  - Evan Prodromou
  - Richard Fontana
  @evan @richardfontana @bkuhn @ossguy Yeah! I actually already said elsewhere in the thread I don't think we need to worry about using these tools for such scenarios from a *licensing* perspective, only when the genAI is explicitly checked into the codebase
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 02:28:19 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  I think the worst case scenario is that the inserted code matches exactly one snippet in the training data.
  So you could try to go for zero matches, by using such idiosyncratic and unrecommended coding conventions that nobody else has code like yours.
  Or you could try to go for lots of matches, by using bog standard coding conventions and software patterns.
  @cwebber @richardfontana @bkuhn @ossguy
  In conversation about 2 months ago permalink
  Attachments
  1. Domain not in remote thumbnail source whitelist: cdn1.dan.com
    
    data.so - Domain Name For Sale | Dan.com
    
    from @undeveloped
    
    I found a great domain name for sale on Dan.com. Check it out!
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 02:28:36 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber the weights themselves?
  @richardfontana @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 02:29:43 JST Christine Lemmer-Webber
  in reply to
  - Evan Prodromou
  - Richard Fontana
  @evan @richardfontana @bkuhn @ossguy Sorry, I missed a word when I edited the sentence, I meant "genAI output"
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 02:32:58 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  But maybe that's wrong; I don't know. Maybe if I wrote a Person.setName() method that was in the training set, and the LLM generated an identical Person.setName() code snippet for someone else, I could claim that the code is a copyright violation, even if there were thousands of other identical and independent Person.setName() methods in the training set.
  @cwebber @richardfontana @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 02:34:52 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber it's sometimes a distinction that people blur!
  @richardfontana @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Stephen Foskett (sfoskett@techfieldday.net)'s status on Sunday, 19-Apr-2026 02:54:50 JST Stephen Foskett
  in reply to
  @evan @cwebber @bkuhn @ossguy @richardfontana Another major concern is that works generated by AI are not copyrightable per the US Supreme Court. So code generated by an LLM can not be licensed at all, open or closed. https://www.reuters.com/legal/government/us-supreme-court-declines-hear-dispute-over-copyrights-ai-generated-material-2026-03-02/
  In conversation about 2 months ago permalink
  Attachments
  1. Untitled attachment
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 02:54:50 JST Christine Lemmer-Webber
  in reply to
  @sfoskett @evan @bkuhn @ossguy @richardfontana That outcome I am not worried about; code that's not copyrightable is considered in the public domain within the US, which means there aren't any real risks to incorporating into FOSS projects. But the Supreme Court punted on it, they didn't rule that way.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 02:55:36 JST Evan Prodromou
  in reply to
  @sfoskett you can incorporate public domain code into a licensed work.
  @cwebber @bkuhn @ossguy @richardfontana
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 02:56:00 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber
  This is a really interesting question! TIL about CA vs. Altai and the abstraction-filtration-comparison test.
  https://en.wikipedia.org/wiki/Computer_Associates_International%2C_Inc._v._Altai%2C_Inc.
  I'm not sure how automatable it is. Interesting to try though!
  @richardfontana @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Richard Fontana (richardfontana@mastodon.social)'s status on Sunday, 19-Apr-2026 03:21:39 JST Richard Fontana
  in reply to
  - Evan Prodromou
  - Christine Lemmer-Webber
  @evan I feel pretty confident in saying the abstraction-filtration-comparison test cannot possibly be automated @cwebber @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 03:21:39 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @richardfontana @cwebber @bkuhn @ossguy Yeah, I thought my job couldn't be automated, either, and yet here we are.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 03:25:18 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @richardfontana @cwebber @bkuhn @ossguy Seriously, though, a lot of the work seems like it is tractable to LLM automation?
  Like, the abstraction part seems like it's just summarizing components at the function, module, and program level. This is the command-line argument parser, this is the database abstraction layer, this is the logging module. LLMs are pretty good at this!
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 03:31:49 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  For filtration, it seems like merger or scènes à faire would also be kind of automatable, maybe with human oversight. Is there a way to make a mailing daemon without a logging module? Maybe, but it's so common that everyone does it that way. Could you have a Person class without a getter and setter for the name? Probably not?
  @richardfontana @cwebber @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 03:36:10 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  The comparison seems tough, but I'd put an LLM to the task. "How similar are the database abstraction layers in activitypub-bot and Fedify?" Again, I'd probably want some human review, but for that code stuff LLMs are pretty good.
  @richardfontana @cwebber @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 03:42:08 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  I consider myself an expert on this process since I learned about it 45 minutes ago, but it seems like AFC follows the hierarchical layers of modern programming-in-the-large -- statements, functions, modules, packages, program. That is the stuff that LLMs handle pretty well.
  @richardfontana @cwebber @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 03:47:53 JST Christine Lemmer-Webber
  in reply to
  - Evan Prodromou
  - Richard Fontana
  @bkuhn @evan @richardfontana @ossguy One thing I worry about is that the chardet rewrite might not generalize. The chardet maintainer used *more* care in the rewrite than most projects which have followed suit for laundering would. https://dan-blanchard.github.io/blog/chardet-rewrite-controversy/
  Even then, it raises questions, because even the maintainer admits, chardet was part of the training set.
  It's very similar to how a friend recently sent me, "Claude managed to reverse engineer Bubble Bobble without using any reverse engineering tools, just inspecting the binary!" https://kotrotsos.medium.com/we-pointed-an-ai-at-raw-binary-files-from-1986-662ba30120f3
  Which like, Claude is enough of a black box already but Bubble Bobble is also one of the most studied ROMs in history, so that's hard to evaluate whether it's true. You'd have to choose a less studied ROM as a test case, not Bubble Bobble, which the internet has discussed to death.
  In conversation about 2 months ago permalink
  Attachments
  1. No result found on File_thumbnail lookup.
    
    set.it
    
    This domain may be for sale!
  2. Untitled attachment
  3. Untitled attachment
- Embed this notice
  Bradley M. Kühn (bkuhn@fedi.copyleft.org)'s status on Sunday, 19-Apr-2026 03:47:54 JST Bradley M. Kühn
  in reply to
  @evan
  I actually think that these copyright concepts aren't particularly automatable, and even if we try, its pure arms race.
  And the merger doctrine isn't the big problem here, it is the more complex analysis where merger doctrine clearly doesn't apply that needs analysis and I suspect the analysis is difficult to (even partially) automate.
  But I'm looking into it.
  Cf: chardet situation https://github.com/chardet/chardet/issues/355#issuecomment-4145369025
  @richardfontana @cwebber @ossguy
  In conversation about 2 months ago permalink
  Attachments
  1. Untitled attachment
  2. Domain not in remote thumbnail source whitelist: opengraph.githubassets.com
    
    Clarification regarding prior comment in #334 (scope and LGPL question) · Issue #355 · chardet/chardet
    
    This is not an attempt to reopen #334 (which was resolved by the license change in #349). Rather (since the discussion is locked there) I want to briefly clarify one point from my earlier comment, ...
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 03:50:36 JST Christine Lemmer-Webber
  in reply to
  - Evan Prodromou
  - Richard Fontana
  @bkuhn @evan @richardfontana @ossguy Probably a ton of people here think I am anti-AI-output, and that I would be upset to find out that the chardet rewrite were legal.
  Actually, I'm not! I'd be fine with the ability to copyright launder software to some degree, as long as we could do the same for proprietary software (including in binary form).
  I'm concerned about whether or not we have an *equitable* situation, though. And I'm *more concerned* that we need to advise people, who are incorporating code *today*.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 04:11:31 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @bkuhn I just did an abstraction and filtration pass on a medium-sized application framework (~30K LOC), and I think it did a pretty good job:
  https://claude.ai/share/071ccb69-5d22-4673-905a-362d9663e7d0
  It missed a few things (e.g. ActivityPub relays). Then again, I have no idea how this kind of review is supposed to work. I didn't go down to the function or statement level -- that'd probably be much noisier.
  Maybe chardet 6 and 7 would be a better test of the technique?
  @richardfontana @cwebber @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 04:35:25 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  If I were going to productize this, I'd do AF passes on a huge training dataset like The Stack and generate some kind of fingerprint for each program. (Estimated cost: billions!)
  https://huggingface.co/datasets/bigcode/the-stack
  Then, I'd have a tool to let you fingerprint your own code and C it against the big database -- maybe give you a list of high-similarity codebases.
  And you could re-run the comparison each time you push to Git -- maybe only Cing what changed.
  @bkuhn @richardfontana @cwebber @ossguy
  In conversation about 2 months ago permalink
  Attachments
  1. Untitled attachment
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 05:17:01 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  I gave it a try. It's quite wordy! Claude thought that a lot of Pilgrim's work would be filtered since it was a direct port from the Mozilla C++ codebase. I pushed back that they shared the same license, and it loosened up that constraint.
  https://claude.ai/share/e4aae73c-14d1-462e-9773-4381adde54f7
  Warning: if you read this document, it will get AI in you, and it will make you AI and you will become an AI-booster like me and Sam Altman. It will also burn down the rainforest.
  @bkuhn @richardfontana @cwebber @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 05:19:08 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  I think you could make the case that Claude is not an uninterested party in this discussion, since Blanchard used Claude to generate the code, so maybe it's lying to cover up its tracks.
  @bkuhn @richardfontana @cwebber @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 05:26:13 JST Evan Prodromou
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  I might ask ChatGPT to give it a try, and give it some extra incentive to dig deeper because if it digs up some dirt on Claude it'd be good for business.
  @bkuhn @richardfontana @cwebber @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 08:00:47 JST Evan Prodromou
  in reply to
  @bkuhn @richardfontana @cwebber @ossguy @karen thanks! I hope I wasn't too flip.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bradley M. Kühn (bkuhn@fedi.copyleft.org)'s status on Sunday, 19-Apr-2026 08:00:48 JST Bradley M. Kühn
  in reply to
  @evan wrote:
  > “I consider myself an expert on this process since I learned about it 45 minutes ago ”
  This is the second time you've made me 🤣 in this thread. Thanks for being comic relief (and I know that's not *all* you're doing, but that part is particularly helpful). Thank you!
  Cc:
  @richardfontana @cwebber @ossguy
  @karen
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 08:04:22 JST Evan Prodromou
  in reply to
  @bkuhn @richardfontana @cwebber @ossguy @karen sadly no!
  I really don't like having anyone, including AI systems, write for me under my own name. Not least because I don't like the style and tone of ChatGPT and friends. They just write very blandly.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bradley M. Kühn (bkuhn@fedi.copyleft.org)'s status on Sunday, 19-Apr-2026 08:04:23 JST Bradley M. Kühn
  in reply to
  LLM-backed genAI never makes as good jokes as you do, @evan
  But are you finally coming clean with us here today that, in fact, #EvanPoll's are all created by a genAI system?
  Cc: @richardfontana @cwebber @ossguy @karen
  
  In conversation about 2 months ago permalink
- Embed this notice
  Evan Prodromou (evan@cosocial.ca)'s status on Sunday, 19-Apr-2026 08:06:19 JST Evan Prodromou
  in reply to
  @bkuhn @richardfontana @cwebber @ossguy @karen hahahaha sorry!
  It wasn't till I had gone through the exercise that I realized I was doing work in a similar vein that you'd already committed to do. I hope it wasn't too monstrous.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Richard Fontana (richardfontana@mastodon.social)'s status on Sunday, 19-Apr-2026 08:06:21 JST Richard Fontana
  in reply to
  - Evan Prodromou
  - Christine Lemmer-Webber
  @evan oh I mean of course you could use LLMs to help with the analysis @cwebber @bkuhn @ossguy
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bradley M. Kühn (bkuhn@fedi.copyleft.org)'s status on Sunday, 19-Apr-2026 08:06:21 JST Bradley M. Kühn
  in reply to
  @richardfontana wrote:
  > “oh I mean of course you could use LLMs to help with the analysis ”
  I'm catching up backwards on this thread, but do you see now the monster you created by telling @evan that?
  🤣
  cc: @cwebber @ossguy @karen
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 10:38:03 JST Christine Lemmer-Webber
  in reply to
  - Jed Brown
  - Richard Fontana
  @bkuhn
  > Re: “copyleft-only #LLM”: I didn't propose that. I proposed copylefting the human-modified output of LLMs.
  You didn't propose it, but @ossguy brought it up here: https://fedi.copyleft.org/@ossguy/116411885602822736
  You two have been speaking fairly collaboratively in this thread, so I'm assuming relatively synced at the moment. Since I presume the goal of "only train on copylefted [and implied, compatible] software" would not be, from your end, to erase copyleft, my assumption here is that the hope was that the output was also copylefted.
  And I presume reading this that you do hope that would be the presumption: https://fedi.copyleft.org/@bkuhn/116428083639528264
  @richardfontana @jedbrown
  In conversation about 2 months ago permalink
  Attachments
  1. No result found on File_thumbnail lookup.
    
    Denver Gingerich (@ossguy@copyleft.org)
    
    from Denver Gingerich
    
    @js@nil.im The intent of the post was not to enumerate the issues with LLMs, which I think most of us here know well. Rather, we want to think about how to engage with people about their newfound ability to make software, and how to use that to benefit others. If that means we need to make models trained only on copylefted software, so be it. But let's have that as a separate discussion.
  2. No result found on File_thumbnail lookup.
    
    Bradley M. Kühn (@bkuhn@copyleft.org)
    
    from Bradley M. Kühn
    
    @RichardJActon@fosstodon.org The copyleft-ish hack I propose is *we* (FOSS community) assume that any output of an LLM-backed genAI system *is* copylefted (since we are pretty sure all such systems — at least those designed for software development assist — have been trained on copylefted codebases). Then, we copyleft any work that comes out of the system. The only threat is proprietary software in the training set, & the industry can't abide enforcing *that*! @cwebber@social.coop @ossguy @richardfontana@mastodon.social @evan@cosocial.ca @kees@hachyderm.io
- Embed this notice
  Bradley M. Kühn (bkuhn@fedi.copyleft.org)'s status on Sunday, 19-Apr-2026 10:38:05 JST Bradley M. Kühn
  in reply to
  @cwebber
  Re: “polluting”, my reply is: https://fedi.copyleft.org/@bkuhn/116426437134023846 (elsewhere in thread).
  Re: “copyleft-only #LLM”: I didn't propose that. I proposed copylefting the human-modified output of LLMs.
  Re: “two scenarios”: IMO you propose a false dichotomy.
  I hope you come to one of #SFC's public sessions on this, as I'd be glad to talk more about it, & this discussion doesn't lend itself to online debate because it's so complex.
  cc: @ossguy @richardfontana
  @jedbrown
  #AI #OpenSource #FOSS
  In conversation about 2 months ago permalink
  Attachments
  1. No result found on File_thumbnail lookup.
    
    Bradley M. Kühn (@bkuhn@copyleft.org)
    
    from Bradley M. Kühn
    
    @cwebber@social.coop I agree with @ossguy in particular because if *we* are copylefting our code (even if assisted by #LLM-backed gen-#AI), we won't face a copyleft claim later. Furthermore, it is highly unlikely these LLMs are (a) trained on proprietary software, and (b) any proprietary software company that so-trained would later claim infringement. #Microsoft has all but admitted they refuse to train Copilot on their own code anyway. Cc: @LordCaramac@discordian.social @richardfontana@mastodon.social
  2. Untitled attachment
  3. No result found on File_thumbnail lookup.
    
    http://complex.cc/
- Embed this notice
  Rich Felker (dalias@hachyderm.io)'s status on Sunday, 19-Apr-2026 11:14:28 JST Rich Felker
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber @bkuhn @ossguy @richardfontana On top of "potential legal toxic waste dumps" we'd be making them known technical toxic waste dumps. 🙃
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 21:22:16 JST Christine Lemmer-Webber
  in reply to
  @bkuhn @richardfontana @RichardJActon @ossguy @evan @kees I presume a new copyleft-next clause would result in something that's GPL-incompatible if it's a new restriction though, right?
  If that's true, wouldn't it result in a mix of incompatible copyleft licenses combined into the output? Or do you have a plan on how to deal with this?
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bradley M. Kühn (bkuhn@fedi.copyleft.org)'s status on Sunday, 19-Apr-2026 21:22:17 JST Bradley M. Kühn
  in reply to
  @richardfontana
  It's solved with a new copyleft-next clause I have not pitched you yet.
  Remember how I keep telling we need to talk every week? 🤣
  @RichardJActon @cwebber @ossguy @evan @kees
  
  In conversation about 2 months ago permalink
- Embed this notice
  Richard Fontana (richardfontana@mastodon.social)'s status on Sunday, 19-Apr-2026 21:22:18 JST Richard Fontana
  in reply to
  @bkuhn not sure I understand the hack. Is the hack a persuasive argument to convince FOSS community to copyleft LLM output? @RichardJActon @cwebber @ossguy @evan @kees
  
  In conversation about 2 months ago permalink
- Embed this notice
  Richard J. Acton (richardjacton@fosstodon.org)'s status on Sunday, 19-Apr-2026 21:22:19 JST Richard J. Acton
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber @bkuhn @ossguy @richardfontana Under this view it doesn't matter how the training data was licensed as it's a fair use defense. The outputs being uncopyrightable / effectively public domain allows people to claim they wrote it when it's convenient and they want to be able to copyright it as it's hard to prove if it was AI generated or human authored. And simultaneously to claim that it was the output of and LLM when they want to strip inconvenient licensing terms.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Bradley M. Kühn (bkuhn@fedi.copyleft.org)'s status on Sunday, 19-Apr-2026 21:22:19 JST Bradley M. Kühn
  in reply to
  @RichardJActon
  The copyleft-ish hack I propose is *we* (FOSS community) assume that any output of an LLM-backed genAI system *is* copylefted (since we are pretty sure all such systems — at least those designed for software development assist — have been trained on copylefted codebases).
  Then, we copyleft any work that comes out of the system.
  The only threat is proprietary software in the training set, & the industry can't abide enforcing *that*!
  @cwebber @ossguy @richardfontana
  @evan
  @kees
  
  In conversation about 2 months ago permalink
- Embed this notice
  Richard J. Acton (richardjacton@fosstodon.org)'s status on Sunday, 19-Apr-2026 21:22:20 JST Richard J. Acton
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber @bkuhn @ossguy @richardfontana I'd don't see a great way out of the copyright stripping conclusions for them without changes to the law. As I understand their defense for training on copyrighted materials - it's predicated on the models being a "transformative" and not competing directly with the original works in the market. The models themselves don't compete with the training material only their outputs do - and the LLM companies want any liability for that to be on users not them.
  
  In conversation about 2 months ago permalink
- Embed this notice
  Rich Felker (dalias@hachyderm.io)'s status on Sunday, 19-Apr-2026 21:36:57 JST Rich Felker
  in reply to
  @cwebber @bkuhn @richardfontana @RichardJActon @ossguy @evan @kees I'm cynically assuming the new license would be more permissive. Basically GPLv3 plus explicit permission to incorporate into a "copyleft only LLM". 🤮 To normalize the idea of such a thing existing and get leverage against parties incorporating into other LLMs by saying "no that's only allowed under these terms" rather than "no you can't do that".
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Sunday, 19-Apr-2026 21:44:11 JST Christine Lemmer-Webber
  in reply to
  @dalias @bkuhn @richardfontana @RichardJActon @ossguy @evan @kees Hm, that could be clever if perceived as a permission rather than restriction. I guess it would have to be under the argument that it isn't a restriction, because it would depend on whether or not the default legislative environment provided the restriction
  
  In conversation about 2 months ago permalink
- Embed this notice
  Rue Mohr (ruenahcmohr@infosec.exchange)'s status on Monday, 20-Apr-2026 04:33:22 JST Rue Mohr
  in reply to
  @silverwizard @richardfontana @cwebber @bkuhn @ossguy
  OMG, I didn't think it could do that...
  (its right)
  In conversation about 2 months ago permalink
  Attachments
  1. Given a hex dump of a binary, that contained no debug information, claude managed to write source code for the program. This is a hello world program I made from assembler source.
    https://media.infosec.exchange/infosec.exchange/media_attachments/files/116/433/009/506/486/997/original/07cfb22ed1c57946.png
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Monday, 20-Apr-2026 04:33:22 JST Christine Lemmer-Webber
  in reply to
  @RueNahcMohr @silverwizard @richardfontana @bkuhn @ossguy Yes, my understand is they use a lot of tool calling of existing FOSS binary analysis / disassembly tools in the process
  
  In conversation about 2 months ago permalink
- Embed this notice
  Rue Mohr (ruenahcmohr@infosec.exchange)'s status on Monday, 20-Apr-2026 04:33:24 JST Rue Mohr
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber @bkuhn @ossguy @richardfontana
  Taking technical debt to new highs.
  I'm waiting for LLMs that can tear down copyright programs from binary images and rewrite them as opensource. oh see the sparks fly then!
  
  In conversation about 2 months ago permalink
- Embed this notice
  silverwizard (silverwizard@convenient.email)'s status on Monday, 20-Apr-2026 04:33:24 JST silverwizard
  in reply to
  malus.sh assuming they're serious @RueNahcMohr @richardfontana @cwebber @ossguy @bkuhn
  In conversation about 2 months ago permalink
  Attachments
  1. Domain not in remote thumbnail source whitelist: malus.sh
    
    MALUS - Clean Room as a Service | Liberation from Open Source Attribution
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Monday, 20-Apr-2026 21:59:05 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  - Christian Hergert
  @chergert @bkuhn @ossguy @richardfontana Potentially. The question is whether or not "trade secret" also covers their "shared source" releases
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christian Hergert (chergert@my.devsuite.app)'s status on Monday, 20-Apr-2026 21:59:11 JST Christian Hergert
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber @bkuhn @ossguy @richardfontana
  > I also sense that Microsoft is perfectly fine with the idea that you can "copyright launder" a codebase from the GPL to perhaps the public domain, but if someone did that to their own leaked source code, they would be very upset.
  If it's leaked, that implies trade-secret law rather than copyright, no?
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christine Lemmer-Webber (cwebber@social.coop)'s status on Tuesday, 21-Apr-2026 00:22:00 JST Christine Lemmer-Webber
  in reply to
  - Richard Fontana
  - Christian Hergert
  @chergert @bkuhn @ossguy @richardfontana Well under the more modern understanding that FOSS software licenses are both copyright and contract, I suppose the concern is also copyright and contract laundering generally ;P
  
  In conversation about 2 months ago permalink
- Embed this notice
  Christian Hergert (chergert@my.devsuite.app)'s status on Tuesday, 21-Apr-2026 00:22:02 JST Christian Hergert
  in reply to
  - Richard Fontana
  - Christine Lemmer-Webber
  @cwebber @bkuhn @ossguy @richardfontana
  That code availability is by contract, so presumably it still falls under contract law before copyright?
  Not that I'm not all for someone finding out :)
  
  In conversation about 2 months ago permalink

Public

Conversation

Notices

Feeds