@ajroach42 @sam doesn't that book actually support more my side of the argument?
As I understand the criticism around consent/plagiarism is that the training corpus for the pretraining comes from big piles of relatively indiscriminately scraped data including copyrighted works.
...