@jxself Tricky 🤔 But I have some ideas.
Sites with cloudflare's reverse proxy doesn't load at all without #javascript. And it's easy to determine what sites are using #cloudflare through their IP addresses. Blacklisting cloudflare sites would do half the job.
Next we may create a blacklist of the top garbage sites requiring JS that always come up in the search results. Now we are mostly left with indie sites, almost all of which are readable without JS (few exceptions could be sacrificed)