Given:
- The web is increasingly drowning in LLM spamblogs that generate any plausible text that maximizes SEO and drives clicks.
- An appreciable and increasing proportion of those clicks are LLMs crawling for input data
- The statistical language generation function of LLMs is different from human language by any value larger than none
- Many of these spambots have little if any active supervision
It must necessarily be true, then, that:
- language models partially drive the loss function for generated text
- language models make different words than we make
- language models like different words than we like
- there are some websites that only language models go on
- there are some websites that are very popular, only with language models
- there is an increasingly large shadow internet that is not dead internet, but a "live" internet, by language models, for language models, that will become increasingly untethered from human language and is entirely powered by grift