Conversation

Notices

Embed this notice
Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 00:39:10 JST Paul Cantrell

RE: https://unstable.systems/@jneen/116618931097778342
Worth looking at both the quoted text here and •especially• the linked page, which is quite good.
I’ll add another item of my own. The first screenshot mentions giving an LLM the task of “implementing an HTTP server in JavaScript from scratch” in 90 minutes. Sounds impressive, right? Until you remember that every open-source Javascript HTTP server in existence ••was in the training data••.
1/
In conversation about 19 days ago from hachyderm.io permalink
Attachments
1. Domain not in remote thumbnail source whitelist: cdn.unstable.systems
  
  jneen collective (@jneen@unstable.systems)
  
  from jneen collective
  
  Attached: 2 images Extremely good and concise summary of the statistical/methodological shenanigans involved in measuring the so-called "AI boost" Here's a couple of my favourites: https://third-bit.com/2026/05/20/twelve-ways-to-be-wrong/
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 00:41:44 JST Paul Cantrell
  in reply to
  
  IT WAS IN THE TRAINING DATA. Your gave the machine a pile of correct answers and free license to plagiarize.
  I remember people being wowed that Claude Code could implement a complete C compiler. But somehow it doesn’t sound quite as impressive when you phrase it as “given every existing C compiler as input, the LLM can produce a C compiler as output.”
  2/
  
  In conversation about 19 days ago permalink
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 00:49:57 JST Paul Cantrell
  in reply to
  
  It’s just •astonishing• how many eye-popping stories of LLMs doing amazing omg-verge-of-magical-superintelligence things turn out to be just unvarnished plagiarism.
  In the first months after ChatGPT’s release, I remember a French dept colleague being amazed that GPT could translate and summarize a passage of Le Petit Prince.
  It was a lot less impressive when I dug up the 2 or 3 online passages which it had copied almost verbatim and stitched together (sprinkling in a couple of extra words that made it less accurate).
  3/
  In conversation about 19 days ago permalink
  Attachments
  1. Untitled attachment
  2. Domain not in remote thumbnail source whitelist: www.prince.it
    
    Prince.it - Oltre Kiss, Purple Rain e Sign 'O' The Times
    
    from admin
    
    Un viaggio nell'universo dell'artista si Minneapolis, cercando di andare oltre le classiche Kiss, Purple Rain e Sign 'O' The Times.
  Rich Felker repeated this.
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 00:53:01 JST Paul Cantrell
  in reply to
  
  To be fair, competent plagiarism is a nontrivial task. It takes skill for a human to do: finding relevant fragments from an input dataset and stitching those fragments together in a well-formed way is not nothing! It’s impressive and interesting that people have figured out how to make machines do it.
  But •that• is not the promise on which investors are valuing the AI megajuggernaut at trillions of dollars.
  The reasons plagiarism is cheating in school and malpractice in professional contexts are many of the same reasons that LLMs are not going to replace all knowledge-based human labor.
  4/
  
  In conversation about 19 days ago permalink
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 00:56:38 JST Paul Cantrell
  in reply to
  
  The reasons we consider plagiarism to be cheating in school and malpractice in professional contexts are many of the same reasons that LLMs are not going to replace all knowledge-based human labor. (Details left as an exercise for the reader.)
  And yes, a whole lot of what LLMs do •would• count as plagiarism if one of my students did it manually, and •should• count as plagiarism just the same if they use a machine to do it — not just in a “that’s cheating!!” sense, but perhaps more importantly in a “that’s not really doing the work” sense.
  5/
  
  In conversation about 19 days ago permalink
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 01:15:31 JST Paul Cantrell
  in reply to
  
  One last example:
  The first LLM code example that really made my eyes pop was early after the release of GPT, when somebody got it to combine Breakout with Conway’s Game of Life (a truly delightful idea). It worked!
  Funny thing: the Breakout code and the Life code had a •completely• different style and flavor. Red flag. In about 15 minutes of web searching, I was able to find one of the projects (can’t remember if it was the Breakout or the Life half) which it had copied wholesale, with just a few variable renames. And the other half? It was in Python, but it used dictionaries where it really should have used objects — tons of `thing["prop"]` where it should have said `thing.prop`, and lots of other un-Pythonic stuff besides. It was a machine translate of code from another language, very likely Javascript.
  The entire thing was a plagiarized Breakout and a plagiarized Game of Life, one transpiled, and all stuck together in a single run loop. To be fair, figuring out how to (1) run both halves of the logic from a single loop and (2) count the Life cells as Breakout bricks is work I'd cheer on from a second-semester intro CS student! It's not, however, quite what's being sold by these companies.
  6/
  
  In conversation about 19 days ago permalink
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 01:19:28 JST Paul Cantrell
  - datarama
  @datarama
  I know just enough French to have read Le Petit Prince in the original language (with some struggle), and…it really is beautiful in French in a way that translations don't capture. “On ne voit bien qu'avec le coeur” can translate into English quite directly as “One does not see well but with the heart,” but it just doesn't have the same poetry and magic at all.
  
  In conversation about 19 days ago permalink
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 01:20:28 JST Paul Cantrell
  in reply to
  
  As per my posts, I have the luxury of not having LLM vendors shoved down my throat, and I generally avoid them for ethical reasons:
  https://hachyderm.io/@inthehands/116581463138461199
  But because this all these questions about the usage and limits of these tools keep crashing through my doors, all of our doors, whatever we think of the ethical showstoppers, well…
  …fight off amazing percentages of LLM overhype with this one weird question.
  /end
  In conversation about 19 days ago permalink
  Attachments
  1. “Goose chasing person” meme. PERSON is labeled “LLM hype.” GOOSE: “Was the answer in the training data? Was the answer in the training data, motherfucker?”
    https://media.hachyderm.io/media_attachments/files/116/619/120/787/182/813/original/aabb4e145abb9319.jpeg
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 01:22:08 JST Paul Cantrell
  in reply to
  - Michael Busch
  @michael_w_busch
  Yup. Same with passing the bar.
  
  In conversation about 19 days ago permalink
- Embed this notice
  Michael Busch (michael_w_busch@mastodon.online)'s status on Saturday, 23-May-2026 01:22:09 JST Michael Busch
  in reply to
  
  @inthehands
  Back in 2023, OpenAI was hyping ChatGPT by claiming "it can pass the GRE".
  When all it was doing was autocomplete of the answer keys that had been used as input.
  
  In conversation about 19 days ago permalink
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 01:22:30 JST Paul Cantrell
  in reply to
  - Galbinus Caeli
  @GalbinusCaeli
  To be fair, this is an algorithmically difficult problem that was still largely an open question 10-15 years ago! Scale down your expectations by 2 or 3 orders of magnitude, and modern machine learning is truly impressive.
  Not a $10 trillion industry. But it's impressive in a “cool research” sense, and also in a “oo, that may pose serious societal danger” sense.
  
  In conversation about 19 days ago permalink
- Embed this notice
  Galbinus Caeli (galbinuscaeli@spacey.space)'s status on Saturday, 23-May-2026 01:22:31 JST Galbinus Caeli
  in reply to
  
  @inthehands Given pictures of a giraffe, a rhinoceros, an elephant, and a squirrel, find the squirrel. Feel free to reference dictionaries, encyclopedias, nature documentaries and previous responses to this same question.
  
  In conversation about 19 days ago permalink
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 01:32:56 JST Paul Cantrell
  in reply to
  - disregard Joe Groff
  @joe
  Yeah, people were having that same argument about humans and creativity on the more academic side of my circles back in 2023. It would be an interesting one if it didn't have all this investment money weighing it down! (Human learning, both technical and artistic, almost always starts with imitation and repetition; clearly it's a building block of this messy constellation of things that we call “intelligence.”)
  I do think the models are getting better at atomizing, as you put it, and I'm disappointed that there's not more research on this family of reverse-mapping problems. One question I've wondered about: can we quantify how much the output depended on a given input? e.g. how would the probability of given output have changed if the model were trained without <pattern> in its training data?
  
  In conversation about 19 days ago permalink
- Embed this notice
  disregard Joe Groff (joe@f.duriansoftware.com)'s status on Saturday, 23-May-2026 01:32:57 JST disregard Joe Groff
  in reply to
  
  @inthehands i've noticed a trend in anecdotes recently where people are finding it harder to trace their novel-seeming LLM outputs back to inputs. i wonder if this is a result of them atomizing their inputs more finely, or being "better" at swapping the tokens around to make output look original. (an AI bro might argue that at some point human creativity is doing the same thing…)
  
  In conversation about 19 days ago permalink
- Embed this notice
  Shafik Yaghmour (shafik@hachyderm.io)'s status on Saturday, 23-May-2026 01:35:36 JST Shafik Yaghmour
  in reply to
  
  @inthehands
  "HTML parsers in Portland" is another great example
  https://hachyderm.io/@shafik/116044646072511071
  
  In conversation about 19 days ago permalink
- Embed this notice
  Shafik Yaghmour (shafik@hachyderm.io)'s status on Saturday, 23-May-2026 01:49:23 JST Shafik Yaghmour
  in reply to
  
  @inthehands
  The real world measurements of CCC were very bad: https://harshanu.space/en/tech/ccc-vs-gcc/
  One of the SQLite benchmarks had a
  158,129x slowdown 😱
  If you read the blog post in detail:
  https://www.anthropic.com/engineering/building-c-compiler
  you will see that they could not longer add new features w/o breaking old features. So effectively it was a dead end long before it could become useful in any real world contexts.
  In conversation about 19 days ago permalink
  Attachments
  1. Untitled attachment
- Embed this notice
  Benoît Jones (huxley@mstdn.social)'s status on Saturday, 23-May-2026 06:17:24 JST Benoît Jones
  in reply to
  - datarama
  - Janeishly
  @janeishly @inthehands @datarama
  Love your argument, and an example came to me as I was reading it.
  Obelix's dog is called Idefix in the original french, which is funny because idée fixe means stubborn (literally fixed idea). But in English translation the dog is called Dogmatix, which has dogma in it but also of course, the word 'dog'!
  
  In conversation about 19 days ago permalink
- Embed this notice
  Janeishly (janeishly@beige.party)'s status on Saturday, 23-May-2026 06:17:26 JST Janeishly
  in reply to
  - datarama
  @inthehands @datarama No, but if it's a good translation there *should* be other parts in English that are better than the original French. As a translator you can't always capture a particular phrase beautifully, but you can average the beauty over the whole text.
  The same applies to humour. Some jokes don't translate but you'll spot an opportunity to get one in elsewhere that wouldn't work in the source language.
  Needless to say, machines don't bother doing this.
  
  In conversation about 19 days ago permalink
  
  Paul Cantrell repeated this.
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 06:19:22 JST Paul Cantrell
  in reply to
  @huxley @janeishly @datarama
  Lovely example! Asterix is a childhood friend, one part of how I learned what little French I know. My dad kept a sticker of Asterix on the monitor of our Apple ][+, to remind him whenever he was getting frustrated at the computer to be like Asterix: optimistic and indefatigable!
  
  In conversation about 19 days ago permalink
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 06:24:23 JST Paul Cantrell
  in reply to
  - Petrichor ᚄᚔᚅᚐᚁᚆᚃᚒᚔᚂ
  @sinabhfuil
  The post you are replying to answers the question you're asking.
  
  In conversation about 19 days ago permalink
- Embed this notice
  Petrichor ᚄᚔᚅᚐᚁᚆᚃᚒᚔᚂ (sinabhfuil@mastodon.ie)'s status on Saturday, 23-May-2026 06:24:24 JST Petrichor ᚄᚔᚅᚐᚁᚆᚃᚒᚔᚂ
  in reply to
  @huxley @janeishly @inthehands @datarama how is idefix stubborn? Idée fixée?
  
  In conversation about 19 days ago permalink
- Embed this notice
  Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 23-May-2026 06:29:42 JST Paul Cantrell
  in reply to
  @huxley @janeishly @datarama
  Same with Tintin, on both counts.
  Even as a kid, I could tell that the stereotype-based humor had an awful stink to it — though that was also an important early education for me in what stereotyping looked like, how colonialism left a cultural footprint on the colonizers, how culture's relationship with alcoholism had changed, etc.
  I do find that some (L'Oreille Cassée, L'Affaire Calcule, Tintin au Tibet) still hold up for me, notwithstanding the cultural sour notes.
  
  In conversation about 19 days ago permalink
- Embed this notice
  Benoît Jones (huxley@mstdn.social)'s status on Saturday, 23-May-2026 06:29:44 JST Benoît Jones
  in reply to
  - datarama
  - Janeishly
  @inthehands @janeishly @datarama Me too, I read them as a child, in both french and English because my mum is french. Also Tintin, though that hasn't aged so well!
  
  In conversation about 19 days ago permalink
- Embed this notice
  Aatch (aatch@mastodon.nz)'s status on Saturday, 23-May-2026 06:56:59 JST Aatch
  in reply to
  
  @inthehands that reminds of when I saw somebody claim to translate the constitution into Japanese on-the-fly (it was obviously a joke/troll). I pasted it into a machine translation tool and it indeed produced the US Constitution... Including the old-style capitalisation of nouns that 1700's English had.
  The exact translation was in the training data so it produced the exact translation.
  
  In conversation about 19 days ago permalink

Public

Conversation

Notices

Feeds