Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://mastodon.xyz/users/zacchiro/statuses/113429801651531486">Stefano Zacchiroli (zacchiro@mastodon.xyz)'s status on Tuesday, 05-Nov-2024 19:10:20 JST</a><a href="https://mastodon.xyz/@zacchiro" title="zacchiro@mastodon.xyz"><img src="https://gnusocial.jp/avatar/114272-48-20230423125528.webp" width="48" height="48" alt="Stefano Zacchiroli" style="position: absolute; left: 0; top: 0;">Stefano Zacchiroli</a><div><a href="https://gnusocial.jp/notice/7691541" rel="in-reply-to">in reply to</a><ul><li><li><a href="https://gnusocial.jp/user/289577" title="chaz@burn.capital">⍨</a></li></ul></div></section><article><p><a href="https://gnusocial.jp/lxo">@lxo</a> <a href="https://burn.capital/@chaz">@chaz</a> It's a good approach, but I don't think it's needed.</p><p>If we start from first principles, there's no doubt that to fully exercise freedoms of study and modify, you need the training data. (You can exercise *some* of those freedoms even without training data, but in a suboptimal way. I can give precise examples if you're curious.)</p><p>The "data is too big" problem is IMO a distraction. There are relevant ML systems that are small enough to make retraining them from scratch viable.</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/3885435#notice-7697597">In conversation</a><time datetime="2024-11-05T19:10:20+09:00" title="Tuesday, 05-Nov-2024 19:10:20 JST">about 6 months ago</time> <span>from <span><a href="https://mastodon.xyz/@zacchiro/113429801651531486" rel="external" title="Sent from mastodon.xyz via ActivityPub">mastodon.xyz</a></span></span><a href="https://mastodon.xyz/@zacchiro/113429801651531486">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
Stefano Zacchiroli (zacchiro@mastodon.xyz)'s status on Tuesday, 05-Nov-2024 19:10:20 JSTStefano Zacchiroli
in reply to
- Alexandre Oliva (moving to @lxo@snac.lx.oliva.nom.br)
- ⍨
@lxo @chaz It's a good approach, but I don't think it's needed.
If we start from first principles, there's no doubt that to fully exercise freedoms of study and modify, you need the training data. (You can exercise *some* of those freedoms even without training data, but in a suboptimal way. I can give precise examples if you're curious.)
The "data is too big" problem is IMO a distraction. There are relevant ML systems that are small enough to make retraining them from scratch viable.
In conversationabout 6 months ago from mastodon.xyzpermalink

Public

Embed Notice

HTML Code

Corresponding Notice