The cool thing is that it’s impossible to find out the original licenses or if that code was used as bad examples—and inherent quality of LLMs is that they are black boxes of mystery.
It doesn’t help that Big Tech makers of them don’t disclose their training data (likely because of a mixture of breaking copyright law and liability).