@tchambers OK, "straightforward" might have been a bit of an exaggeration. One approach would be to extend nodeinfo with this information, or to check the ToS and privacy policy as part of federation requests, and reject ones that don't fit the bill.
But even without ToS changes, since as we agree the legal situation is murky, given how much regulatory pressure Meta's under, there's significant legal risk for them to scrape fediverse sites -- and the gain is relatively small.