Back when I was a kid at school we had IT lessons. One of the first things we were taught was “GIGO: Garbage in; Garbage out.”
A computer supplied with rubbish can only output rubbish.
Anyway; if you train LLMs on “everything out there”, you’re going to get “general shit” out. It can’t know what’s good or bad or right. Just what’s most likely. Think of the most average person imaginable with the most average hot take.
Let me know when you can train LLM’s locally on curated content.