Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://mastodon.social/users/slowenough/statuses/113664856009083284">John Abbe (aka Slow) (slowenough@mastodon.social)'s status on Tuesday, 17-Dec-2024 07:34:18 JST</a><a href="https://mastodon.social/@slowenough" title="slowenough@mastodon.social"><img src="https://gnusocial.jp/avatar/20437-48-20221218031241.webp" width="48" height="48" alt="John Abbe (aka Slow)" style="position: absolute; left: 0; top: 0;">John Abbe (aka Slow)</a><div><a href="https://mastodon.social/@peterkaminski/113664601762503249" rel="in-reply-to">in reply to</a><ul><li></ul></div></section><article><p><a href="https://mastodon.social/@peterkaminski">@peterkaminski</a> "LCMs employ a hierarchical structure, mirroring human reasoning processes"</p><p>Not a good sign if they think our reasoning is hierarchical.</p><p>"reduces sequence length compared to token-level processing, addressing the quadratic complexity of standard Transformers and enabling more efficient handling of long contexts"</p><p>Now seeing "concepts" as a kind of compression (of strings of tokens), which I've seen articulated before as a way of understanding much of what's happening with LLMs.</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/4199224#notice-8207214">In conversation</a><time datetime="2024-12-17T07:34:18+09:00" title="Tuesday, 17-Dec-2024 07:34:18 JST">about 5 months ago</time> <span>from <span><a href="https://mastodon.social/@slowenough/113664856009083284" rel="external" title="Sent from mastodon.social via ActivityPub">mastodon.social</a></span></span><a href="https://mastodon.social/@slowenough/113664856009083284">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
John Abbe (aka Slow) (slowenough@mastodon.social)'s status on Tuesday, 17-Dec-2024 07:34:18 JSTJohn Abbe (aka Slow)
in reply to
- Peter Kaminski
@peterkaminski "LCMs employ a hierarchical structure, mirroring human reasoning processes"
Not a good sign if they think our reasoning is hierarchical.
"reduces sequence length compared to token-level processing, addressing the quadratic complexity of standard Transformers and enabling more efficient handling of long contexts"
Now seeing "concepts" as a kind of compression (of strings of tokens), which I've seen articulated before as a way of understanding much of what's happening with LLMs.
In conversationabout 5 months ago from mastodon.socialpermalink

Public

Embed Notice

HTML Code

Corresponding Notice