Embed Notice
HTML Code
Corresponding Notice
- Embed this notice@amerika You know what those stupid LLM scrapers *love*? cgit. If you can afford the disk and the cycles on a VM, they will read the diffs between every two commits on every repo.
Did you know that you can add basically any other repo as a remote for any git repo? You can take, say, the entire Linux kernel repo, add the Pleroma repo and the Mastodon repo as remotes, fetch them, and git will dutifully treat them like any other branches. And then cgit will show a diff between any random Mastodon commit from 2019 and any random Linux kernel commit. Toss in more repos, toss in every repo.
Google will saturate your pipe crawling that indefinitely (ask how I know). Tarpits usually have a pattern but all of these diffs have enough novelty and length to keep almost every crawler interested.