Public
- Public
- Network
- Groups
- Featured
- Popular
- People

Conversation

Notices

Embed this notice
Alex Stamos (alex@cybervillains.com)'s status on Thursday, 21-Dec-2023 11:32:04 JST Alex Stamos
in reply to
- David Thiel
Lots of people have worried about CSAM in training sets, including the LAION team themselves, but David actually created a novel mechanism to detect it.
Hopefully this will change the way these training sets are created in the future.
@det
In conversation Thursday, 21-Dec-2023 11:32:04 JST from cybervillains.com permalink
Attachments
1. Untitled attachment
  https://assets.cybervillains.com/media_attachments/files/111/613/571/450/087/116/original/18359c48832f490d.png
- Embed this notice
  Alex Stamos (alex@cybervillains.com)'s status on Thursday, 21-Dec-2023 11:32:05 JST Alex Stamos
  - David Thiel
  How does Stable Diffusion 1.5 know how to create CSAM? It turns out it was trained on thousands of illegal images contained in the extremely popular LAION-5B image set.
  I’m so incredibly proud of my friend and colleague @det
  Story:
  https://www.404media.co/laion-datasets-removed-stanford-csam-child-abuse/
  Paper:
  https://stacks.stanford.edu/file/druid:kh752sm9123/ml_training_data_csam_report-2023-12-20.pdf
  In conversation Thursday, 21-Dec-2023 11:32:05 JST permalink
  Attachments
  1. Domain not in remote thumbnail source whitelist: www.404media.co
    
    Largest Dataset Powering AI Images Removed After Discovery of Child Sexual Abuse Material
    
    from @samleecole
    
    The model is a massive part of the AI-ecosystem, used by Google and Stable Diffusion. The removal follows discoveries made by Stanford researchers, who found thousands instances of suspected child sexual abuse material in the dataset.
  2. Untitled attachment
  Mike McCue repeated this.

Public

Conversation

Notices

Feeds