Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://cosocial.ca/users/evan/statuses/114360599218276647">Evan Prodromou (evan@cosocial.ca)'s status on Saturday, 19-Apr-2025 04:24:37 JST</a><a href="https://cosocial.ca/@evan" title="evan@cosocial.ca"><img src="https://gnusocial.jp/avatar/77066-48-20240924220431.webp" width="48" height="48" alt="Evan Prodromou" style="position: absolute; left: 0; top: 0;">Evan Prodromou</a><div><a href="https://tldr.nettime.org/@tomjennings/114359938693213100" rel="in-reply-to">in reply to</a><ul><li></ul></div></section><article><p><a href="https://tldr.nettime.org/@tomjennings">@tomjennings</a> I'm not a fan of the "stolen work" idea. I don't think it's accurate. The output of that indexing process isn't a verbatim copy of the original corpus of text, but something much more like a search engine index. The generated text often includes facts and ideas from the original works, but that's not covered by copyright or other IP protection. I agree that LLM training bots should respect robots.txt and other signals that the authors don't want their work used for training.</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/4915012#notice-9631222">In conversation</a><time datetime="2025-04-19T04:24:37+09:00" title="Saturday, 19-Apr-2025 04:24:37 JST">about a month ago</time> <span>from <span><a href="https://cosocial.ca/@evan/114360599218276647" rel="external" title="Sent from cosocial.ca via ActivityPub">cosocial.ca</a></span></span><a href="https://cosocial.ca/@evan/114360599218276647">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
Evan Prodromou (evan@cosocial.ca)'s status on Saturday, 19-Apr-2025 04:24:37 JSTEvan Prodromou
in reply to
- tom jennings
@tomjennings I'm not a fan of the "stolen work" idea. I don't think it's accurate. The output of that indexing process isn't a verbatim copy of the original corpus of text, but something much more like a search engine index. The generated text often includes facts and ideas from the original works, but that's not covered by copyright or other IP protection. I agree that LLM training bots should respect robots.txt and other signals that the authors don't want their work used for training.
In conversationabout a month ago from cosocial.capermalink

Public

Embed Notice

HTML Code

Corresponding Notice