Did you see this? The whole thing with "the stack".
https://post.lurk.org/@emenel/112111014479288871
Some jerks did mass scraping of open source projects, putting them in a collection called "the stack" which they specifically recommend other people use as machine learning sources. If you look at their "Github opt-out repository" you'll find just page after page of people asking to have their stuff removed:
https://github.com/bigcode-project/opt-out-v2/issues
(1/2)