I’m hoping that some Open Source project will start collecting and distributing _trusted data collections_ as raw material to train LLM’s on. Thats where the value is; not the models themselves. Which are mostly trying to un-shittify the shit they got fed.
To me, the biggest issue with the whole thing is _I do not want something trained on the entirety of crap out there_. We _all_ know that most “content” available is biased, incorrect, racist, ignorant, hot garbage.
GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.
All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.