while evaluating "AI" for something i found a new atrocious failure mode. it goes like this:
me: [image] are there differential pairs on this image?
chatgpt: yes. [confident-sounding fluff about diffpairs] should i highlight them?
me: yes
chatgpt: [a similar and quite high resolution, but _completely brand new_ image, with some random-ass highlight, still not featuring any differential pairs]