“Now that the seal is broken on scraping #Bluesky posts into datasets for machine learning, people are trolling users and one-upping each other by making increasingly massive datasets of non-anonymized, full-text Bluesky posts taken directly from the social media platform’s public firehose — including one that contains almost 300 million posts...”
#privacy #bigData #xitter #yourNameHere #infosec
https://www.404media.co/bluesky-posts-machine-learning-ai-datasets-hugging-face/