they claim "Zero exposure to original source" but surely all popular Open Source projects have been read and parsed by every LLM in existence many times over
@bagder So this is logical conclusion of the python chardet library re-license.
Getting a court to rule is probably the only way to get a answer one way or another, but getting the LLMs to say definitively what is in the training set will probably be hard (claimed trade secrets).