The insufficiency is simple: When a LLM trained on software can output portions of copyrighted software, which they absolutely can and do, and when that gets used in proprietary software, all the provinance tracking of the dataset used to train it becomes irrelevant. At that point my license has been violated.
Software Heratige's statement's silence on this topic, in their list of principles, is deafening.