... GPT enthusiast "Advent of AI" (henceforth enthusiast) who tried to solve the problems using ChatGPT Plus and then also wrote a blog about it using ChatGPT Plus. Since the blog is generated by ChatGPT it's absolute shit and potentially contains hallucinations, however the github repo with the transcripts is valuable. The enthusiast turned out to be completely useless: it resorted often to babystepping ChatGPT through to the result and he stopped on day 6 anyway. The youtuber was much more informative, for the most part he stuck to letting ChatGPT solve the problem on its own. However he did give it, on a few occasions, some big hints, either by debugging ChatGPT's solution for it or explaining it how to solve the problem. I have noted this in the results. Results part 1 part 2 notes day 1 OK FAIL day 2 OK OK day 3 FAIL N/A day 4 OK OK Uses brute force solution for part 2 day 5 OK FAIL day 6 FAIL N/A ChatGPT Plus solves both parts day 7 FAIL N/A day 8 OK FAIL ChatGPT Plus solves part 2 if you tell it what the solution is day 9 FAIL N/A ChatGPT Plus solves both parts day 10 FAIL N/A day 11 FAIL N/A ChatGPT Plus could solve part 1 with a big hint day 12 FAIL N/A The perofrmance of GPT-4 this year was a bit worse than GPT-3.5 last year. Last year GPT-3.5 could solve 3 days on its own (1, 2 and 4) while GPT-4 this year could only solve 2 full days (2 and 4). ChatGPT Plus however did a bit better, solving on its own 4 days (2, 4, 6 and 9).
https://files.mastodon.social/media_attachments/files/111/758/879/780/328/903/original/5773a1bf81e320b9.png