Conversation

Notices

Embed this notice
✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Sunday, 24-Aug-2025 20:39:13 JST ✧✦Catherine✦✧

while evaluating "AI" for something i found a new atrocious failure mode. it goes like this:
me: [image] are there differential pairs on this image?
chatgpt: yes. [confident-sounding fluff about diffpairs] should i highlight them?
me: yes
chatgpt: [a similar and quite high resolution, but _completely brand new_ image, with some random-ass highlight, still not featuring any differential pairs]
In conversation about 5 months ago from mastodon.social permalink
Attachments
1. Untitled attachment
  https://files.mastodon.social/media_attachments/files/115/083/543/695/808/549/original/960ab8f535bba9bf.png
2. Untitled attachment
  https://files.mastodon.social/media_attachments/files/115/083/543/997/435/133/original/dd72bf79849bce03.png
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Sunday, 24-Aug-2025 20:44:58 JST ✧✦Catherine✦✧
  in reply to
  
  this is actively evil
  
  In conversation about 5 months ago permalink
- Embed this notice
  poleguy looking for lost tools (poleguy@mastodon.social)'s status on Sunday, 24-Aug-2025 21:46:39 JST poleguy looking for lost tools
  in reply to
  
  @whitequark how is this new? I haven't used ai for combined chat and images much, but my attempts at generative images typically produce fairly random outputs loosely correlated to a prompt. So these images don't seem surprising. And the chat text sounds like typical llm output.
  Isn't this expected? So far my ai expectation is that I should just reprompt until I am happy with the results, or give up if I notice I have strayed too far away from the training set. Either way I am the judge.
  
  In conversation about 5 months ago permalink
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Sunday, 24-Aug-2025 21:58:25 JST ✧✦Catherine✦✧
  in reply to
  - poleguy looking for lost tools
  @poleguy i did not expect "highlight something on the image" produce a completely new fucking image
  
  In conversation about 5 months ago permalink
- Embed this notice
  poleguy looking for lost tools (poleguy@mastodon.social)'s status on Monday, 25-Aug-2025 00:07:48 JST poleguy looking for lost tools
  in reply to
  
  @whitequark one trouble with the interface is that it is not clear if "highlight" is a magical first class feature that's added as actual coding on top of the llm and gan stuff or if it is just part of the llm and gan prompt engineering. It seems like it just modified the prompt so you should expect a whole new image?
  
  In conversation about 5 months ago permalink
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Monday, 25-Aug-2025 00:18:14 JST ✧✦Catherine✦✧
  in reply to
  - poleguy looking for lost tools
  @poleguy the first image is something that i uploaded. it's not generated
  
  In conversation about 5 months ago permalink
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Monday, 25-Aug-2025 00:22:34 JST ✧✦Catherine✦✧
  in reply to
  - F4GRX SÃ©bastien
  @f4grx the generated highlight to be overlaid on top of the image i uploaded
  
  In conversation about 5 months ago permalink
- Embed this notice
  F4GRX SÃ©bastien (f4grx@chaos.social)'s status on Monday, 25-Aug-2025 00:22:36 JST F4GRX SÃ©bastien
  in reply to
  
  @whitequark I dont know what you were expecting, tbh :)
  
  In conversation about 5 months ago permalink
- Embed this notice
  crzwdjk ✅ (crzwdjk@mastodon.social)'s status on Monday, 25-Aug-2025 00:53:53 JST crzwdjk ✅
  in reply to
  
  @whitequark Seems like the thing that these models are most confidently and dangerously wrong about is their own capabilities and limitations. Which makes sense, data about that is almost entirely absent from the training set. Would you have thought to ask it to highlight if it didn't suggest it?
  
  In conversation about 5 months ago permalink
- Embed this notice
  poleguy looking for lost tools (poleguy@mastodon.social)'s status on Monday, 25-Aug-2025 00:55:31 JST poleguy looking for lost tools
  in reply to
  
  @whitequark that is interesting, and potentially could lead you to not notice an important change because we are biased to see what we expect. But unless they leaned on a non ai algorithm to highlight, this all sounds "about right" for ai.
  
  In conversation about 5 months ago permalink
- Embed this notice
  poleguy looking for lost tools (poleguy@mastodon.social)'s status on Monday, 25-Aug-2025 00:58:50 JST poleguy looking for lost tools
  in reply to
  
  @whitequark have you seen the "remove the electric wires from the photo of a house" example?
  
  In conversation about 5 months ago permalink
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Monday, 25-Aug-2025 02:27:19 JST ✧✦Catherine✦✧
  in reply to
  - poleguy looking for lost tools
  @poleguy nope
  
  In conversation about 5 months ago permalink
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Monday, 25-Aug-2025 02:27:20 JST ✧✦Catherine✦✧
  in reply to
  - crzwdjk ✅
  @crzwdjk yes
  
  In conversation about 5 months ago permalink
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Monday, 25-Aug-2025 02:30:34 JST ✧✦Catherine✦✧
  in reply to
  - F4GRX SÃ©bastien
  @f4grx there are absolutely models that use "system commands" of sorts (this is how they're able to run `rm / -rf` on your system!), and i figured that if the LLM knows how to run the diffusion model, it might also know how to do an overlay instead of regenerating the entire image--indeed i didn't expect it to regenerate the entire image because the image is _huge_ and this can't possibly be cheap for openai
  
  In conversation about 5 months ago permalink
- Embed this notice
  F4GRX SÃ©bastien (f4grx@chaos.social)'s status on Monday, 25-Aug-2025 02:30:36 JST F4GRX SÃ©bastien
  in reply to
  
  @whitequark from the mental image I have of LLMs, this sounds impossible to do, this request is much too deterministic and does not fit in any kind of task that can be obtained through random processes.
  
  In conversation about 5 months ago permalink
- Embed this notice
  Glyph (glyph@mastodon.social)'s status on Monday, 25-Aug-2025 05:34:51 JST Glyph
  in reply to
  
  @whitequark for someone who knows zero about electronics (i.e. I don't know what a "differential pair" is) would you mind explaining why this is particularly bad
  
  In conversation about 5 months ago permalink
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Monday, 25-Aug-2025 05:56:33 JST ✧✦Catherine✦✧
  in reply to
  - Glyph
  @glyph
  - the "actively evil" remark is because it presents an image transformed into latent space and back (with completely wrong, horribly nonsensical but _just_ plausible enough if you don't squint details) as "result of highlighting"
  - diffpairs are these wigglies (left); they are a dead tell for PCI Express (right); i asked chatgpt to identify the standard (Legacy PCI or PCI Express) from the image because someone used the image in a news article about nvidia...
  In conversation about 5 months ago permalink
  Attachments
  1. Untitled attachment
    https://files.mastodon.social/media_attachments/files/115/085/730/356/520/041/original/31d98748c3cc141a.png
  2. Untitled attachment
    https://files.mastodon.social/media_attachments/files/115/085/732/633/786/046/original/30456844bb5592dc.png
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Monday, 25-Aug-2025 05:57:07 JST ✧✦Catherine✦✧
  in reply to
  - Glyph
  @glyph and I thought it looks way too outdated and wrong in proportions, pin count, etc for it to be anything made in the last 15 years, and wondered if the models will be able to identify this incredibly, blatantly, unmistakably to a moderately trained eye difference or not
  
  In conversation about 5 months ago permalink
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Monday, 25-Aug-2025 05:59:49 JST ✧✦Catherine✦✧
  in reply to
  - Glyph
  @glyph there were other tells that this is a PCI card (the notch is on the wrong side, the pin count is wrong, there's no 'bulge' on the bracket side, the power/ground connections are in the wrong places, there's another parallel bus on it, the overall PCB layout and fabrication looked anachronistic) but the presence/absence of diffpairs is a 100% sensitivity, 100% specificity test for distinguishing PCI vs PCI Express
  
  In conversation about 5 months ago permalink
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Monday, 25-Aug-2025 06:01:27 JST ✧✦Catherine✦✧
  in reply to
  - Glyph
  @glyph the other differences were either probabilistic/speculative (eg PCB layout style or guesses about their power distribution) or required me to go and look it up (pin counts, notch locations), which... a model "should" still be able to do better than i do, so i focused on the one where there is no ambiguity whatsoever and we're on even ground
  
  In conversation about 5 months ago permalink
- Embed this notice
  Brad (bk1e@mastodon.social)'s status on Monday, 25-Aug-2025 06:06:11 JST Brad
  in reply to
  
  @whitequark It even replaced the blurred Realtek chip in the background, and some of the other components.
  
  In conversation about 5 months ago permalink
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Monday, 25-Aug-2025 06:07:37 JST ✧✦Catherine✦✧
  in reply to
  - Glyph
  @glyph oh, i should explain what makes something a differential pair vs how it looks. a differential pair is a coupled pair of microstrip lines (a type of waveguide, similar in concept to coaxial cables you're familiar with) that are routed at a fixed, precisely controlled distance from each other, and within a low, precisely controlled length difference (these wavy patterns on them keep this difference small)
  
  In conversation about 5 months ago permalink
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Monday, 25-Aug-2025 06:08:00 JST ✧✦Catherine✦✧
  in reply to
  - Glyph
  @glyph diffpairs have given us the last ~25 years of gains in peripheral interface throughput, have dedicated, highly specific tools for making them in PCB layout software, are one of the most distinctive and emphasized visual structures on a PCB, and are described in an unimaginable amount of training material
  you would expect any of this to matter
  
  In conversation about 5 months ago permalink
- Embed this notice
  ✧✦Catherine✦✧ (whitequark@mastodon.social)'s status on Monday, 25-Aug-2025 06:09:52 JST ✧✦Catherine✦✧
  in reply to
  - Brad
  @bk1e and did a sharpen pass over the nvidia logo and lettering while keeping the background blurred
  
  In conversation about 5 months ago permalink
- Embed this notice
  Glyph (glyph@mastodon.social)'s status on Monday, 25-Aug-2025 10:11:23 JST Glyph
  in reply to
  
  @whitequark thanks so much for the thorough explanation. Feeling very smart / pleased with myself for correctly anticipating as I was reading why a trace might have little asymmetrical bumps like that :)
  
  In conversation about 5 months ago permalink
- Embed this notice
  poleguy looking for lost tools (poleguy@mastodon.social)'s status on Tuesday, 26-Aug-2025 10:25:19 JST poleguy looking for lost tools
  in reply to
  
  @whitequark I can't find it in my history. Summary: the ai feature removed electric wires but also changed the house from a duplex to a single family and made many other changes. The before and after results looked like a "find the differences between the two pictures" game from a kids magazine.
  If you have to play that game after each prompt you are better removing the wires by hand in Krita. Even if you don't know how you would be better off skilling up in Krita than prompt engineering.
  
  In conversation about 5 months ago permalink

Public

Conversation

Notices

Feeds