GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Conversation

Notices

  1. Embed this notice
    jwz (jwz@mastodon.social)'s status on Thursday, 16-Jan-2025 13:56:23 JST jwz jwz

    Exterminate all rational AI scrapers.

    Today I added an infinite-nonsense honeypot to my web site just to fuck with LLM scrapers, based on a "spicy autocomplete" program I wrote about 30 years ago. Well-behaved web crawlers will ignore it, but those "AI" people.... well, you know how they are.

    I'm intentionally not linking to it here, but I'll bet you can find it pretty easily. It's kinda funny.
    https://jwz.org/b/ykgX

    In conversation about 4 months ago from mastodon.social permalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: www.jwz.org
      Exterminate all rational AI scrapers
      Today I added an infinite-nonsense honeypot to my web site just to fuck with LLM scrapers, based on a "spicy autocomplete" program I wrote about 30 years ago. Well-behaved web crawlers will ignore it, but those "AI" people.... well, you know how they are. I'm intentionally not linking to it here, but I'll bet you can find it pretty easily. It's kinda funny.
    • Anil Dash repeated this.
    • Embed this notice
      jwz (jwz@mastodon.social)'s status on Friday, 17-Jan-2025 06:14:36 JST jwz jwz
      in reply to

      What's the most zipbomb-like JPEG or PNG that can be constructed? Small file size, plausible dimensions, but massive memory footprint?

      In conversation about 4 months ago permalink
    • Embed this notice
      mhoye (mhoye@mastodon.social)'s status on Friday, 17-Jan-2025 06:19:06 JST mhoye mhoye
      in reply to

      @jwz This guy's been working on it for a long time: https://www.bamsoftware.com/hacks/deflate.html

      The current best looks to be "420 bytes of PNG expand to represent 50 gigapixels and 141 GB in memory".

      In conversation about 4 months ago permalink

      Attachments


      Haelwenn /элвэн/ :triskell: likes this.
    • Embed this notice
      marcusb (marcusb@mastodon.sdf.org)'s status on Friday, 17-Jan-2025 07:51:34 JST marcusb marcusb
      • Infoseepage

      @Infoseepage @jwz there are several other tools with similar goals. I wrote one with static sites in mind (no server-side dependencies, all non-sense pre-generated. https://marcusb.org/hacks/quixotic.html). Others I know about are: https://github.com/Fingel/django-llm-poison, https://codeberg.org/MikeCoats/poison-the-wellms, https://codeberg.org/timmc/marko/, https://zadzmo.org/code/nepenthes/, and https://github.com/earthboundkid/heffalump.

      In conversation about 4 months ago permalink

      Attachments

      1. No result found on File_thumbnail lookup.
        ZADZMO code
        from https://zadzmo.org/humans.txt
      2. Domain not in remote thumbnail source whitelist: marcusb.org
        Quixotic
        Quixotic is a nonsense generator designed to help static website operators confuse and confound bots and content-stealing LLM scrapers.
      3. Domain not in remote thumbnail source whitelist: opengraph.githubassets.com
        GitHub - Fingel/django-llm-poison: A django app that poisons content when served to AI bots. ☠️
        A django app that poisons content when served to AI bots. ☠️ - Fingel/django-llm-poison
      4. Domain not in remote thumbnail source whitelist: codeberg.org
        poison-the-wellms
        from MikeCoats
        A reverse-proxy that serves diassociated-press style reimaginings of your upstream pages, poisoning any LLMs that scrape your content.
      5. Domain not in remote thumbnail source whitelist: codeberg.org
        marko
        from timmc
        marko
      6. Domain not in remote thumbnail source whitelist: opengraph.githubassets.com
        GitHub - earthboundkid/heffalump: Heffalump is an endless honeypot
        Heffalump is an endless honeypot. Contribute to earthboundkid/heffalump development by creating an account on GitHub.
    • Embed this notice
      tom jennings (tomjennings@tldr.nettime.org)'s status on Saturday, 18-Jan-2025 05:26:10 JST tom jennings tom jennings
      in reply to
      • mhoye

      @mhoye @jwz

      For eg corp social media site avatar image upload, there's usually a maximum file size limit (in addition to pixel). For that case you'd want to bound file size, so the question would be, what's the largest image compressible into eg. 16KB?

      In conversation about 4 months ago permalink
      Haelwenn /элвэн/ :triskell: likes this.
    • Embed this notice
      ortiz0852 (ortiz0852@mastodon.social)'s status on Saturday, 18-Jan-2025 08:03:10 JST ortiz0852 ortiz0852
      in reply to

      @jwz Hello dear happy new year 2025 will be your best💯🥳 year connect with Mrs William Sarah 👇👇👇👇👇👇👇👇👇👇

      https://www.facebook.com/Williamsarahfx12

      You'll definitely* thank me later .

      In conversation about 4 months ago permalink
    • Embed this notice
      jwz (jwz@mastodon.social)'s status on Saturday, 17-May-2025 02:10:59 JST jwz jwz
      in reply to

      Happy to be serving AI bots such URLs as:

      the/virtuoso/righteous/who/to/a/the/he/
      topanga/what/non/been/emotions/thereby/to/most/implementado/
      black/layer/in/this/why/read/thrush/earths/foran/
      also/its/chase/cremation/a/connection/shamelessly/page/
      space/instead/putt/they/
      seaming/are/demonstrates/mixers/dresses/fillmore/cambron/
      seaming/rational/is/ministry/
      perron/packard/apr/piece/were/will/i/mother/
      fixes/in/should/each/period/
      peached/supprised/all/so/a/onecomplete/connection/affleurant/

      In conversation about 2 days ago permalink

Feeds

  • Activity Streams
  • RSS 2.0
  • Atom
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.