@inthehands I think LLMs mostly use https://commoncrawl.org/ rather than crawling the web themselves. The Internet Archive's Wayback Machine uses Common Crawl as a source and has Wookieepedia so I think it's likely in there already.
@inthehands I think LLMs mostly use https://commoncrawl.org/ rather than crawling the web themselves. The Internet Archive's Wayback Machine uses Common Crawl as a source and has Wookieepedia so I think it's likely in there already.
GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.
All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.