Like, GPT-3 failed questions of the form "I'm in the basement and look at the sky. What do I see?" GPT-4 fixed this by having humans correct its mistakes. I imagine if I were a kid getting this question for the first time, especially in a place where there aren't typically basements, what I'd do is probably imagine being in a basement.