Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://ioc.exchange/users/invisv/statuses/110810705685199377">INVISV (invisv@ioc.exchange)'s status on Tuesday, 01-Aug-2023 06:26:36 JST</a><a href="https://ioc.exchange/@invisv" title="invisv@ioc.exchange"><img src="https://gnusocial.jp/avatar/106526-48-20230313105342.webp" width="48" height="48" alt="INVISV" style="position: absolute; left: 0; top: 0;">INVISV</a><div><a href="https://hachyderm.io/@inthehands/110810574440734278" rel="in-reply-to">in reply to</a><ul><li><li><a href="https://gnusocial.jp/user/53259" title="inthehands@hachyderm.io">Paul Cantrell</a></li></ul></div></section><article><p><a href="https://hachyderm.io/@inthehands">@inthehands</a> There was a good analysis by <a href="https://mastodon.social/@randomwalker">@randomwalker</a> that argued why such tests don't mean much when applied to AI (in addition to the fact that the tests themselves, even when applied to people, aren't exactly what they claim to be):</p><p><a href="https://aisnakeoil.substack.com/p/gpt-4-and-professional-benchmarks" rel="nofollow noreferrer">https://aisnakeoil.substack.com/p/gpt-4-and-professional-benchmarks</a></p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/1881626#notice-3690932">In conversation</a><time datetime="2023-08-01T06:26:36+09:00" title="Tuesday, 01-Aug-2023 06:26:36 JST">Tuesday, 01-Aug-2023 06:26:36 JST</time> <span>from <span><a href="https://ioc.exchange/@invisv/110810705685199377" rel="external" title="Sent from ioc.exchange via ActivityPub">ioc.exchange</a></span></span><a href="https://ioc.exchange/@invisv/110810705685199377">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
INVISV (invisv@ioc.exchange)'s status on Tuesday, 01-Aug-2023 06:26:36 JSTINVISV
in reply to
- Arvind Narayanan
- Paul Cantrell
@inthehands There was a good analysis by @randomwalker that argued why such tests don't mean much when applied to AI (in addition to the fact that the tests themselves, even when applied to people, aren't exactly what they claim to be):
https://aisnakeoil.substack.com/p/gpt-4-and-professional-benchmarks
In conversationTuesday, 01-Aug-2023 06:26:36 JST from ioc.exchangepermalink

Public

Embed Notice

HTML Code

Corresponding Notice