GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Notices by haihappen (haihappen@social.anoxinon.de)

  1. Embed this notice
    haihappen (haihappen@social.anoxinon.de)'s status on Thursday, 19-Jun-2025 19:25:45 JST haihappen haihappen
    in reply to
    • Charlie Stross

    @cstross Jamie Zawinski runs an LLM scraper honeypot, https://www.jwz.org/blog/2025/01/exterminate-all-rational-ai-scrapers/
    Which he notes now serves 25% of urls as of 7 days ago. I did not look up if he explains how he implemented that honeypot, and how much manual fiddling is involved. However, he links to the random text generator he made and uses.

    In conversation about 8 days ago from social.anoxinon.de permalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: www.jwz.org
      Exterminate all rational AI scrapers
      Today I added an infinite-nonsense honeypot to my web site just to fuck with LLM scrapers, based on a "spicy autocomplete" program I wrote about 30 years ago. Well-behaved web crawlers will ignore it, but those "AI" people.... well, you know how they are. I'm intentionally not linking to it here, but I'll bet you can find it pretty easily. It's kinda funny.

User actions

    haihappen

    haihappen

    Tags
    • (None)

    Following 0

      Followers 0

        Groups 0

          Statistics

          User ID
          304854
          Member since
          9 Dec 2024
          Notices
          1
          Daily average
          0

          Feeds

          • Atom
          • Help
          • About
          • FAQ
          • TOS
          • Privacy
          • Source
          • Version
          • Contact

          GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

          Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.