Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://blob.cat/objects/94101c40-4289-4569-bc2c-3dfa79f07208">iced depresso (icedquinn@blob.cat)'s status on Sunday, 18-Feb-2024 16:11:11 JST</a><a href="https://blob.cat/users/icedquinn" title="icedquinn@blob.cat"><img src="https://gnusocial.jp/avatar/1514-48-20240327163110.webp" width="48" height="48" alt="iced depresso" style="position: absolute; left: 0; top: 0;">iced depresso</a><div><a href="https://blob.cat/objects/a37102b7-4cbe-42f0-8d29-0347ae442243" rel="in-reply-to">in reply to</a><ul><li></ul></div></section><article>it basically takes all input parameters and adds a whole layer of neurons. then it runs that network through your standard gradient descent stuff. it then does a 'top k' pass so only the top k number of neurons (basically lowest error in testing) is kept and the rest are trashed. then it repeats the process again, until some stop condition.<br><br>wasn't thinking about it much but thats probably very similar to what this paper is doing, albeit in a way that is different and doesn't require changing how pytorch works.</article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/2731183#notice-5417651">In conversation</a><time datetime="2024-02-18T16:11:11+09:00" title="Sunday, 18-Feb-2024 16:11:11 JST">Sunday, 18-Feb-2024 16:11:11 JST</time> <span>from <span><a href="https://blob.cat/objects/94101c40-4289-4569-bc2c-3dfa79f07208" rel="external" title="Sent from blob.cat via ActivityPub">blob.cat</a></span></span><a href="https://blob.cat/objects/94101c40-4289-4569-bc2c-3dfa79f07208">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
iced depresso (icedquinn@blob.cat)'s status on Sunday, 18-Feb-2024 16:11:11 JSTiced depresso
in reply to
- iced depresso
it basically takes all input parameters and adds a whole layer of neurons. then it runs that network through your standard gradient descent stuff. it then does a 'top k' pass so only the top k number of neurons (basically lowest error in testing) is kept and the rest are trashed. then it repeats the process again, until some stop condition.

wasn't thinking about it much but thats probably very similar to what this paper is doing, albeit in a way that is different and doesn't require changing how pytorch works.
In conversationSunday, 18-Feb-2024 16:11:11 JST from blob.catpermalink

Public

Embed Notice

HTML Code

Corresponding Notice