OpenAI: wE cAnT cReAtE AI wItHoUt cOpYrIgHtEd wOrKs
Open License Corpus with 228B tokens: *exists* https://huggingface.co/datasets/kernelmachine/open-license-corpus
various open licensed AIs and datasets: *exists*
https://medium.com/@bnjmn_marie/silo-a-new-llm-exclusively-trained-on-open-text-a6f4e1bf5243
https://huggingface.co/search/full-text?q=open+license&type=dataset
Hell you can even get pure chat/roleplaying datasets if you want something to typefuck with in terrible grammar (oh god why): https://huggingface.co/datasets?sort=trending&search=roleplay
GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.
All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.