Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://qoto.org/users/lupyuen/statuses/112204855053142607">Lup Yuen Lee 李立源 (lupyuen@qoto.org)'s status on Wednesday, 03-Apr-2024 11:23:25 JST</a><a href="https://qoto.org/@lupyuen" title="lupyuen@qoto.org"><img src="https://gnusocial.jp/avatar/4878-48-20220812234138.webp" width="48" height="48" alt="Lup Yuen Lee 李立源" style="position: absolute; left: 0; top: 0;">Lup Yuen Lee 李立源</a></section><article><p>"a Large Language Model (<a href="https://qoto.org/tags/LLM" rel="tag">#LLM</a>) can be convinced to tell you how to build a bomb if you prime it with a few dozen less-harmful questions first"</p><p><a href="https://techcrunch.com/2024/04/02/anthropic-researchers-wear-down-ai-ethics-with-repeated-questions/" rel="nofollow noreferrer">https://techcrunch.com/2024/04/02/anthropic-researchers-wear-down-ai-ethics-with-repeated-questions/</a></p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/2893914#notice-5751978">In conversation</a><time datetime="2024-04-03T11:23:25+09:00" title="Wednesday, 03-Apr-2024 11:23:25 JST">about a year ago</time> <span>from <span><a href="https://qoto.org/@lupyuen/112204855053142607" rel="external" title="Sent from qoto.org via ActivityPub">qoto.org</a></span></span><a href="https://qoto.org/@lupyuen/112204855053142607">permalink</a><h4>Attachments</h4><ol><li><article><header><div>Domain not in remote thumbnail source whitelist: techcrunch.com</div><h5><a href="https://techcrunch.com/2024/04/02/anthropic-researchers-wear-down-ai-ethics-with-repeated-questions/">Anthropic researchers wear down AI ethics with repeated questions | TechCrunch</a></h5><div> from <span>Devin Coldewey</span></div></header><div>How do you get an AI to answer a question it's not supposed to? There are many such "jailbreak" techniques, and Anthropic researchers just found a new</div><footer></footer></article></li></ol></footer></blockquote>

Corresponding Notice

Embed this notice
Lup Yuen Lee 李立源 (lupyuen@qoto.org)'s status on Wednesday, 03-Apr-2024 11:23:25 JST Lup Yuen Lee 李立源
"a Large Language Model (#LLM) can be convinced to tell you how to build a bomb if you prime it with a few dozen less-harmful questions first"
https://techcrunch.com/2024/04/02/anthropic-researchers-wear-down-ai-ethics-with-repeated-questions/
In conversationabout a year ago from qoto.orgpermalink
Attachments
1. Domain not in remote thumbnail source whitelist: techcrunch.com
  Anthropic researchers wear down AI ethics with repeated questions | TechCrunch
  from Devin Coldewey
  How do you get an AI to answer a question it's not supposed to? There are many such "jailbreak" techniques, and Anthropic researchers just found a new

Public

Embed Notice

HTML Code

Corresponding Notice