Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://infosec.exchange/users/david_chisnall/statuses/113554112644010193">David Chisnall (*Now with 50% more sarcasm!*) (david_chisnall@infosec.exchange)'s status on Thursday, 28-Nov-2024 07:33:34 JST</a><a href="https://infosec.exchange/@david_chisnall" title="david_chisnall@infosec.exchange"><img src="https://gnusocial.jp/avatar/241214-48-20240302204654.webp" width="48" height="48" alt="David Chisnall (*Now with 50% more sarcasm!*)" style="position: absolute; left: 0; top: 0;">David Chisnall (*Now with 50% more sarcasm!*)</a></section><article><p>I can’t remember if I posted this here before but, I case I didn’t:</p><p>LLMs are the new memory-safety bugs.</p><p>The reason that memory-safety bugs are so bad is not that they’re common (they are, but they’d still be bad if we fixed 90% of them), it’s that they step outside of the language abstract machine. When a memory-safety bug occurs, the program will do something completely unpredictable. You can’t reason at the source level about what will happen. Some piece of unrelated state will me modified or used as input to some calculation.</p><p>This is how LLMs work by design. This is not a bug. They arrange data in an n-dimensional latent space and will give outputs that are nearby in that space, but you have no way of articulating the shape of that space in anything the resembles source code. If you ask a question about a topic, the latent space may contain nearby replies that include knowledge of that topic, or the gaps may have been painted in with something totally unrelated.</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/4083337#notice-7978896">In conversation</a><time datetime="2024-11-28T07:33:34+09:00" title="Thursday, 28-Nov-2024 07:33:34 JST">about 5 months ago</time> <span>from <span><a href="https://infosec.exchange/@david_chisnall/113554112644010193" rel="external" title="Sent from infosec.exchange via ActivityPub">infosec.exchange</a></span></span><a href="https://infosec.exchange/@david_chisnall/113554112644010193">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
David Chisnall (*Now with 50% more sarcasm!*) (david_chisnall@infosec.exchange)'s status on Thursday, 28-Nov-2024 07:33:34 JST David Chisnall (*Now with 50% more sarcasm!*)
I can’t remember if I posted this here before but, I case I didn’t:
LLMs are the new memory-safety bugs.
The reason that memory-safety bugs are so bad is not that they’re common (they are, but they’d still be bad if we fixed 90% of them), it’s that they step outside of the language abstract machine. When a memory-safety bug occurs, the program will do something completely unpredictable. You can’t reason at the source level about what will happen. Some piece of unrelated state will me modified or used as input to some calculation.
This is how LLMs work by design. This is not a bug. They arrange data in an n-dimensional latent space and will give outputs that are nearby in that space, but you have no way of articulating the shape of that space in anything the resembles source code. If you ask a question about a topic, the latent space may contain nearby replies that include knowledge of that topic, or the gaps may have been painted in with something totally unrelated.
In conversationabout 5 months ago from infosec.exchangepermalink

Public

Embed Notice

HTML Code

Corresponding Notice