Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://tech.lgbt/users/nicuveo/statuses/110707156920117903">Antoine Leblanc (nicuveo@tech.lgbt)'s status on Saturday, 15-Jul-2023 10:40:25 JST</a><a href="https://tech.lgbt/@nicuveo" title="nicuveo@tech.lgbt"><img src="https://gnusocial.jp/avatar/149585-48-20230715013940.webp" width="48" height="48" alt="Antoine Leblanc" style="position: absolute; left: 0; top: 0;">Antoine Leblanc</a><div><a href="https://tech.lgbt/@nicuveo/110703091996013791" rel="in-reply-to">in reply to</a></div></section><article><p>oh my god i had missed this from the postmortem, but the way they validated the test queries they ran (asking the LLM for explanations of code samples) was... TO FEED THE RESULTS INTO GPT-4 AND ASK IT TO CLASSIFY THEM AS "ACCURATE" OR "INACCURATE". THEY ONLY REVIEWED SOME OF THE ONES LABELLED AS "INACCURATE".</p><p>i want to scream.</p><p>at least, i feel vindicated for engaging in the thread despite not being a heavy MDN user, because it is extremely apparent that they have no idea what they're doing.</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/1795500#notice-3523304">In conversation</a><time datetime="2023-07-15T10:40:25+09:00" title="Saturday, 15-Jul-2023 10:40:25 JST">Saturday, 15-Jul-2023 10:40:25 JST</time> <span>from <span><a href="https://tech.lgbt/@nicuveo/110707156920117903" rel="external" title="Sent from tech.lgbt via ActivityPub">tech.lgbt</a></span></span><a href="https://tech.lgbt/@nicuveo/110707156920117903">permalink</a><h4>Attachments</h4><ol><li><label><a rel="external" href="https://gnusocial.jp/attachment/1260180">Untitled attachment</a></label><br></li></ol></footer></blockquote>

Corresponding Notice

Embed this notice
Antoine Leblanc (nicuveo@tech.lgbt)'s status on Saturday, 15-Jul-2023 10:40:25 JST Antoine Leblanc
in reply to
oh my god i had missed this from the postmortem, but the way they validated the test queries they ran (asking the LLM for explanations of code samples) was... TO FEED THE RESULTS INTO GPT-4 AND ASK IT TO CLASSIFY THEM AS "ACCURATE" OR "INACCURATE". THEY ONLY REVIEWED SOME OF THE ONES LABELLED AS "INACCURATE".
i want to scream.
at least, i feel vindicated for engaging in the thread despite not being a heavy MDN user, because it is extremely apparent that they have no idea what they're doing.
In conversationSaturday, 15-Jul-2023 10:40:25 JST from tech.lgbtpermalink
Attachments
1. Untitled attachment

Public

Embed Notice

HTML Code

Corresponding Notice