Thinking about the OSI "Open Source AI" Definition and how to proceed. Like say they changed their definition to demand that all training data was "available" (right now you only need to describe it) meaning there are URLs that you can access. Think YouTube Videos or social media posts or whatnot. But not all content is under a free license, some explicitly copyrighted with "all rights reserved".
Would you consider a machine learning system trained on that data still "open source" in the intention of the Open Source definition (https://opensource.org/osd)?