@pinkdrunkenelephants @mcc In the EU, there actually is some legislation. Copyright explicitly *doesn't* protect works from being used in machine learning for academic research, but ML training for commercial products must respect a "machine-readable opt-out".
But that's easy enough to get around. That's why eg. Stability funded an "independent research lab" who did the actual data gathering for them.