@anildash if that model existed, it’d be worse than GPT-1, you need insane amount of data for good performance