GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Conversation

Notices

  1. Embed this notice
    Ars Technica (arstechnica@mastodon.social)'s status on Tuesday, 20-May-2025 10:09:20 JST Ars Technica Ars Technica

    AI haters build tarpits to trap and trick AI scrapers that ignore robots.txt
    Attackers explain how an anti-spam defense became an AI weapon.
    https://arstechnica.com/tech-policy/2025/01/ai-haters-build-tarpits-to-trap-and-trick-ai-scrapers-that-ignore-robots-txt/?utm_brand=arstechnica&utm_social-type=owned&utm_source=mastodon&utm_medium=social

    In conversation about 4 days ago from mastodon.social permalink

    Attachments


    1. https://files.mastodon.social/media_attachments/files/113/908/061/719/390/733/original/94502b2d0abbab47.jpg
    2. Domain not in remote thumbnail source whitelist: cdn.arstechnica.net
      AI haters build tarpits to trap and trick AI scrapers that ignore robots.txt
      Attackers explain how an anti-spam defense became an AI weapon.
    • clacke@libranet.de is my main likes this.
    • Embed this notice
      Leszek (lpryszcz@genomic.social)'s status on Tuesday, 20-May-2025 10:09:58 JST Leszek Leszek
      in reply to

      @arstechnica I enabled nephentes 2 weeks ago and added robots.txt after a week.
      #Amazon bot continues to crawl, ignoring robots.txt, currently over 200k bullshit pages. Other bots as well, but to smaller extent.
      #OpenAI bot appeared, ignored robots.txt, but stopped after a few thousands requests, so I guess they indeed actively recognise tarpits. Or at least the one I use.

      In conversation about 4 days ago permalink
      clacke@libranet.de is my main likes this.
    • Embed this notice
      Kevin Freitas (kevinfreitas@mastodon.social)'s status on Tuesday, 20-May-2025 10:10:08 JST Kevin Freitas Kevin Freitas
      in reply to

      @arstechnica For y'all's consideration: my simple wordpress AI poison plugin.

      https://kevinfreitas.net/tools-experiments/

      #AI #LLM #GPT

      In conversation about 4 days ago permalink
      clacke@libranet.de is my main likes this.
    • Embed this notice
      Aleksandra Fedorova :fedora: (bookwar@fosstodon.org)'s status on Tuesday, 20-May-2025 22:00:52 JST Aleksandra Fedorova :fedora: Aleksandra Fedorova :fedora:
      in reply to

      @arstechnica

      > AI haters build tarpits to trap and trick AI scrapers...
      > AI haters

      The words you looked for are "AI-survivors".

      In conversation about 3 days ago permalink
      clacke likes this.
    • Embed this notice
      clacke@libranet.de is my main (notclacke@fedia.social)'s status on Wednesday, 21-May-2025 10:17:11 JST clacke@libranet.de is my main clacke@libranet.de is my main
      in reply to

      A comment on the internet said: "I'm not much if an IT person. Could someone explain this in nautical terms?"

      I had to respond, of course, having once been a sailor myself.

      In conversation about 3 days ago permalink
    • Embed this notice
      clacke@libranet.de is my main (notclacke@fedia.social)'s status on Wednesday, 21-May-2025 10:17:39 JST clacke@libranet.de is my main clacke@libranet.de is my main
      in reply to

      Aye, a tarrrr pit is one from which you don't get very farrrr, because once you realize where you arrrr, you're already stuck in the tarrr. It's a sirrren's call that promises you a golden dorrr, but it's all just a lurrrre, and not what you were looking forrrr. It's a bait and a hook with the promise of treasure, but inside each chest is just another chest, it's a cruise that is also a test, and if you fail the test you'll never rest, you'll be haunting that pit forever morrre, like the Flying Dutchman, never returning to shorrre.

      In conversation about 3 days ago permalink
      Linux Walt (@lnxw37j1) {3EB165E0-5BB1-45D2-9E7D-93B31821F864} likes this.

Feeds

  • Activity Streams
  • RSS 2.0
  • Atom
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.