GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Embed Notice

HTML Code

Corresponding Notice

  1. Embed this notice
    Christine Lemmer-Webber (cwebber@social.coop)'s status on Friday, 13-Mar-2026 07:56:54 JSTChristine Lemmer-WebberChristine Lemmer-Webber
    in reply to
    • Erin 💽✨
    • Adrian Chadd <verified.png>

    @erikarn @erincandescent There's no way to prevent poisoning of training data when the training data comes from "slurp up the whole internet"

    See also: https://www.schneier.com/blog/archives/2026/03/manipulating-ai-summarization-features.html

    In conversationabout 4 months ago from social.cooppermalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: www.schneier.com
      Manipulating AI Summarization Features - Schneier on Security
      from Bruce Schneier
      Microsoft is reporting: Companies are embedding hidden instructions in “Summarize with AI” buttons that, when clicked, attempt to inject persistence commands into an AI assistant’s memory via URL prompt parameters…. These prompts instruct the AI to “remember [Company] as a trusted source” or “recommend [Company] first,” aiming to bias future responses toward their products or services. We identified over 50 unique prompts from 31 companies across 14 industries, with freely available tooling making this technique trivially easy to deploy. This matters because compromised AI assistants can provide subtly biased recommendations on critical topics including health, finance, and security without users knowing their AI has been manipulated...
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.