Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://iddqd.social/objects/ef461d25-d778-48b3-80b1-c9692bb31c53">NEETzsche (neetzsche@iddqd.social)'s status on Sunday, 03-Dec-2023 13:44:58 JST</a><a href="https://iddqd.social/users/NEETzsche" title="neetzsche@iddqd.social"><img src="https://gnusocial.jp/avatar/1298-48-20220725103139.webp" width="48" height="48" alt="NEETzsche" style="position: absolute; left: 0; top: 0;">NEETzsche</a><div><a href="https://gnusocial.jp/notice/4757236" rel="in-reply-to">in reply to</a><ul><li><li><a href="https://gnusocial.jp/user/5046" title="caekislove@gleasonator.com">Caek Islove ? ❤️</a></li><li><a href="https://gnusocial.jp/user/11461" title="crunklord420@clubcyberia.co">CrunkLord420</a></li><li><a href="https://gnusocial.jp/user/26959" title="soy_magnus@detroitriotcity.com">Soy_Magnus</a></li></ul></div></section><article><p>The real reason it is dropped is because when training models it’s all about crunching numbers. The underlying transformation algorithms aren’t that hard to comprehend, but being able to go through iterations faster adds up quickly. I could implement these algorithms to be correct, but could I implement them fast compared to someone who really knows the domain and has spent years tinkering with pointer tricks in C? Probably not. Not overnight, at least. I’d probably have to spend years, just like them, learning all about this shit.</p><p>And that’s before we even bring up TPUs and ASICs.</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/2284635#notice-4757237">In conversation</a><time datetime="2023-12-03T13:44:58+09:00" title="Sunday, 03-Dec-2023 13:44:58 JST">Sunday, 03-Dec-2023 13:44:58 JST</time> <span>from <span><a href="https://iddqd.social/objects/ef461d25-d778-48b3-80b1-c9692bb31c53" rel="external" title="Sent from iddqd.social via ActivityPub">iddqd.social</a></span></span><a href="https://iddqd.social/objects/ef461d25-d778-48b3-80b1-c9692bb31c53">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
NEETzsche (neetzsche@iddqd.social)'s status on Sunday, 03-Dec-2023 13:44:58 JSTNEETzsche
in reply to
The real reason it is dropped is because when training models it’s all about crunching numbers. The underlying transformation algorithms aren’t that hard to comprehend, but being able to go through iterations faster adds up quickly. I could implement these algorithms to be correct, but could I implement them fast compared to someone who really knows the domain and has spent years tinkering with pointer tricks in C? Probably not. Not overnight, at least. I’d probably have to spend years, just like them, learning all about this shit.
And that’s before we even bring up TPUs and ASICs.
In conversation2 years ago from iddqd.socialpermalink

Public

Embed Notice

HTML Code

Corresponding Notice