Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://hachyderm.io/users/inthehands/statuses/114716692438752085">Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 21-Jun-2025 01:44:47 JST</a><a href="https://hachyderm.io/@inthehands" title="inthehands@hachyderm.io"><img src="https://gnusocial.jp/avatar/53259-48-20231130154634.webp" width="48" height="48" alt="Paul Cantrell" style="position: absolute; left: 0; top: 0;">Paul Cantrell</a><div><a href="https://mastodon.social/@grimalkina/114716620125381661" rel="in-reply-to">in reply to</a><ul><li><li><a href="https://gnusocial.jp/user/28697" title="3psboyd@mastodon.social">Matt Boyd</a></li><li><a href="https://gnusocial.jp/user/80072" title="trochee@dair-community.social">Jeremy Kahn</a></li><li><a href="https://gnusocial.jp/user/87810" title="grimalkina@mastodon.social">Dr. Cat Hicks</a></li></ul></div></section><article><p><a href="https://mastodon.social/@grimalkina">@grimalkina</a> </p><p>(Disclaimer: slightly out of my depth)</p><p>One thing researchers who were seriously studying LLMs pre-hypestravaganza were really excited about is the progression from completely bespoke model training (prep your problem-specific training data and train from scratch) to “transfer learning” (take a generic model and do a little extra training to specialize it, eg generic species classifier tuned to recognize birds) to “zero-shot classifiers” where you get a completely generic model to focus on a specific thing in the query itself, no additional training. Serious people treat this kind of usage with the same empirical caution they’d have applied to any other classifier system.</p><p>This really can work for the kind of use case Cat was talking about (slight increase in error rate for a huge increase in throughput), and in that usage pattern, LLMs are very much in the family of old-school classifiers.</p><p>Jeremy, I’d argue the difference you’re talking about isn’t just in the model itself, but in how it’s used, how it’s presented, how the UI works, and how it’s marketed. When you change the mindset from “zero-shot classifiers with flexible queries” to “prompt engineering,” you have substantively changed the nature of the beast. </p><p><a href="https://dair-community.social/@trochee">@trochee</a> <a href="https://mastodon.social/@glyph">@glyph</a> <a href="https://alvarado.social/@dave">@dave</a> <a href="https://mastodon.social/@3psboyd">@3psboyd</a></p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/5231505#notice-10265809">In conversation</a><time datetime="2025-06-21T01:44:47+09:00" title="Saturday, 21-Jun-2025 01:44:47 JST">about 23 days ago</time> <span>from <span><a href="https://gnusocial.jp/notice/10265809" rel="external" title="Sent from gnusocial.jp via ActivityPub">gnusocial.jp</a></span></span><a href="https://gnusocial.jp/notice/10265809">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
Paul Cantrell (inthehands@hachyderm.io)'s status on Saturday, 21-Jun-2025 01:44:47 JSTPaul Cantrell
in reply to
@grimalkina
(Disclaimer: slightly out of my depth)
One thing researchers who were seriously studying LLMs pre-hypestravaganza were really excited about is the progression from completely bespoke model training (prep your problem-specific training data and train from scratch) to “transfer learning” (take a generic model and do a little extra training to specialize it, eg generic species classifier tuned to recognize birds) to “zero-shot classifiers” where you get a completely generic model to focus on a specific thing in the query itself, no additional training. Serious people treat this kind of usage with the same empirical caution they’d have applied to any other classifier system.
This really can work for the kind of use case Cat was talking about (slight increase in error rate for a huge increase in throughput), and in that usage pattern, LLMs are very much in the family of old-school classifiers.
Jeremy, I’d argue the difference you’re talking about isn’t just in the model itself, but in how it’s used, how it’s presented, how the UI works, and how it’s marketed. When you change the mindset from “zero-shot classifiers with flexible queries” to “prompt engineering,” you have substantively changed the nature of the beast.
@trochee @glyph @dave @3psboyd
In conversationabout 23 days ago from gnusocial.jppermalink

Public

Embed Notice

HTML Code

Corresponding Notice