No joke. I just ran a test compressing the 100+ GB and it is compressing down to maybe 1/10th the size using 7z on the LZMA2 Ultra setting.
@Infoseepage I believe it. CSV data is usually incredibly repetitive and can often compress down quite a lot.
Maybe we should update the Internet Archive page for this dataset with the compressed files?