How does Stable Diffusion 1.5 know how to create CSAM? It turns out it was trained on thousands of illegal images contained in the extremely popular LAION-5B image set.
I’m so incredibly proud of my friend and colleague @det
Story:
https://www.404media.co/laion-datasets-removed-stanford-csam-child-abuse/
Paper:
https://stacks.stanford.edu/file/druid:kh752sm9123/ml_training_data_csam_report-2023-12-20.pdf