Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://hachyderm.io/users/Lobrien/statuses/111704838052681510">Larry O'Brien (lobrien@hachyderm.io)'s status on Saturday, 06-Jan-2024 03:49:38 JST</a><a href="https://hachyderm.io/@Lobrien" title="lobrien@hachyderm.io"><img src="https://gnusocial.jp/avatar/186431-48-20231011190702.webp" width="48" height="48" alt="Larry O'Brien" style="position: absolute; left: 0; top: 0;">Larry O'Brien</a><div><ul><li><li><a href="https://gnusocial.jp/user/139236" title="lisamelton@mastodon.social">Lisa Melton</a></li></ul></div></section><article><p><a href="https://hachyderm.io/@thomasfuchs">@thomasfuchs</a> <a href="https://mastodon.social/@lisamelton">@lisamelton</a> Citation needed. The differences between GPT generations have been qualitative so far. More blocks and larger context windows mean more abstract features and more state. Don’t know how you conclude that won’t improve quality. The “tech bro” conceit is “scaling is _all_ you need,” following “The Bitter Lesson” argument: meta-methods that can find and capture complexity &gt; methods inserted manually. Don’t know any studies that strongly refute this.</p><p><a href="https://www.cs.utexas.edu/~eunsol/courses/data/bitter_lesson.pdf" rel="nofollow noreferrer">https://www.cs.utexas.edu/~eunsol/courses/data/bitter_lesson.pdf</a></p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/2546838#notice-5042825">In conversation</a><time datetime="2024-01-06T03:49:38+09:00" title="Saturday, 06-Jan-2024 03:49:38 JST">Saturday, 06-Jan-2024 03:49:38 JST</time> <span>from <span><a href="https://hachyderm.io/@Lobrien/111704838052681510" rel="external" title="Sent from hachyderm.io via ActivityPub">hachyderm.io</a></span></span><a href="https://hachyderm.io/@Lobrien/111704838052681510">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
Larry O'Brien (lobrien@hachyderm.io)'s status on Saturday, 06-Jan-2024 03:49:38 JSTLarry O'Brien
- Thomas 🔭🕹️
- Lisa Melton
@thomasfuchs @lisamelton Citation needed. The differences between GPT generations have been qualitative so far. More blocks and larger context windows mean more abstract features and more state. Don’t know how you conclude that won’t improve quality. The “tech bro” conceit is “scaling is _all_ you need,” following “The Bitter Lesson” argument: meta-methods that can find and capture complexity > methods inserted manually. Don’t know any studies that strongly refute this.
https://www.cs.utexas.edu/~eunsol/courses/data/bitter_lesson.pdf
In conversationSaturday, 06-Jan-2024 03:49:38 JST from hachyderm.iopermalink

Public

Embed Notice

HTML Code

Corresponding Notice