tired: opt-out of AI training datasets
wired: enthusiastically opt-in all the garbage that's sitting on your disk
Conversation
Notices
-
Embed this notice
Gabriele Svelto (gabrielesvelto@fosstodon.org)'s status on Thursday, 21-Mar-2024 08:33:49 JST Gabriele Svelto
- Matthew Lyon repeated this.
-
Embed this notice
Gabriele Svelto (gabrielesvelto@fosstodon.org)'s status on Thursday, 21-Mar-2024 08:33:49 JST Gabriele Svelto
I wonder if I could cook up a script that turns Star Trek erotic fan fiction into Rust code, then upload *that* to GitHub
-
Embed this notice
Gabriele Svelto (gabrielesvelto@fosstodon.org)'s status on Thursday, 21-Mar-2024 08:33:50 JST Gabriele Svelto
So I just learned what "The Stack" is today: an aggregation of GitHub repos for machine learning from which I can opt out.
But I won't.
I won't because they scraped some hot garbage I wrote in bash and Python that would make you faint. Bottom-of-the-barrel throw-away scripts full of coding crimes. Stuff like
find | grep | awk | xargs | ugh
...invoked via subprocess.run() then fed into more garbage.
I want "artificial intelligence" to learn this. It's going to be fantastic.
Paul Cantrell repeated this.