Playing GPT-x at chess reveals a major limitation of Large Language Models: a complete lack of dynamic reasoning.
So many tasks require the ability to plan ahead and evaluate potential outcomes (e.g., is this a good chess move or a bad chess move?)
It's arguably the whole point of intelligence.
This is coupled with a complete lack of temporal reasoning. LLMs have no sense of history or ordering of events.