GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Embed Notice

HTML Code

Corresponding Notice

  1. Embed this notice
    pistolero (p@fsebugoutzone.org)'s status on Friday, 04-Apr-2025 12:41:40 JSTpistoleropistolero
    in reply to
    • Ignas Kiela
    @ignaloidas

    > they had a bunch of problems with dumb llm scrapers

    So have I. I've just kicked Amazon and Facebook and Claudebot out of Fedilist, FSE has gotten DDoS'd and scraped and whatnot, sometimes by companies that went to lengths to conceal the scraping, once by a guy that was renting a bucket of machines from a massive cluster UU owns, etc. Unlike SourceHut, FSE doesn't generate revenue, and I still managed to solve this problem without interstitial pages or draining anyone's battery. (I'm a professional.)

    > Reportedly, they scraped every single git blame, for some fucking reason.

    Yeah, they do this with cgit, too. I ran into it when Google was doing it, but since then, Amazon/Facebook/Claudebot/etc. have been through: https://fsebugoutzone.org/notice/AqxQDFgjbxsTzXAwFM . (Facebook uses a unique UA for its AI-scraper versus its general-purpose crawler.)
    In conversationabout 2 months ago from fsebugoutzone.orgpermalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: media.freespeechextremist.com
      pistolero: “@amerika You know what those stupid LLM scrapers *love*? cgit. If you can afford the disk and the cycles on a VM, they will read the diffs between every two commits on every repo. Did you know t...”
      pistolero (@p@fsebugoutzone.org): “@amerika You know what those stupid LLM scrapers *love*? cgit. If you can afford the disk and the cycles on a VM, they will read the diffs between every two commits on every repo. Did you know t...”
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.