@amerika You know what those stupid LLM scrapers *love*? cgit. If you can afford the disk and the cycles on a VM, they will read the diffs between every two commits on every repo.
Did you know that you can add basically any other repo as a remote for any git repo? You can take, say, the entire Linux kernel repo, add the Pleroma repo and the Mastodon repo as remotes, fetch them, and git will dutifully treat them like any other branches. And then cgit will show a diff between any random Mastodon commit from 2019 and any random Linux kernel commit. Toss in more repos, toss in every repo.
Google will saturate your pipe crawling that indefinitely (ask how I know). Tarpits usually have a pattern but all of these diffs have enough novelty and length to keep almost every crawler interested.
@p Haha, even more makes me wish stagit would do branches now, would be really fun to have a tarpit repo that's basically all the various repos I have checked out (and I think with some hardlinks/reflinks the cost of that could be almost zero).