@thomasfuchs Serious question: how do we support smaller companies or individuals training LLM's (e.g. Open Source LLMs) if we have strict copyright enforcement and licensing? There are precious few up-to-date training data sets that are licensed under an Open Source license or Public Domain.
Enforcing copyright and licensing for training data will 100% make training larger or up-to-date LLMs impossible for anyone without lawyers and tens of millions of dollars. I don't think that's a good long-term outcome.