> they had a bunch of problems with dumb llm scrapers
So have I. I've just kicked Amazon and Facebook and Claudebot out of Fedilist, FSE has gotten DDoS'd and scraped and whatnot, sometimes by companies that went to lengths to conceal the scraping, once by a guy that was renting a bucket of machines from a massive cluster UU owns, etc. Unlike SourceHut, FSE doesn't generate revenue, and I still managed to solve this problem without interstitial pages or draining anyone's battery. (I'm a professional.)
> Reportedly, they scraped every single git blame, for some fucking reason.
Yeah, they do this with cgit, too. I ran into it when Google was doing it, but since then, Amazon/Facebook/Claudebot/etc. have been through: https://fsebugoutzone.org/notice/AqxQDFgjbxsTzXAwFM . (Facebook uses a unique UA for its AI-scraper versus its general-purpose crawler.)
@p@fsebugoutzone.org what they were seeing were a bunch of residential IP's doing that shit, with each IP only sending a single request and stuff. Hard stuff to block
> what they were seeing were a bunch of residential IP's doing that shit,
When I say I have seen it, that's what I have seen. That's what Boardreader was doing and I still managed to hose Boardreader until I could get them to back off.