Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://mastodon.social/users/DarkestKale/statuses/113051691478943231">:trebuchet: Kale :trebuchet: (darkestkale@mastodon.social)'s status on Saturday, 31-Aug-2024 03:19:46 JST</a><a href="https://mastodon.social/@DarkestKale" title="darkestkale@mastodon.social"><img src="https://gnusocial.jp/avatar/137138-48-20231111100711.webp" width="48" height="48" alt=":trebuchet: Kale :trebuchet:" style="position: absolute; left: 0; top: 0;">:trebuchet: Kale :trebuchet:</a><div><a href="https://mastodon.social/@DarkestKale/113050764405645552" rel="in-reply-to">in reply to</a></div></section><article><p>Huh.</p><p>Ok, so parameter size is determined by (base) model architecture (which makes sense).</p><p>So... my model, currently on gpt-2 small is about 124M</p><p>Could go higher, definitely, and I definitely probably will... just not sure if what I need is moar dataset or moar training or MOAR PARAMETERS</p><p>So many little levers and buttons and things to toy with.</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/3591039#notice-7045075">In conversation</a><time datetime="2024-08-31T03:19:46+09:00" title="Saturday, 31-Aug-2024 03:19:46 JST">about 3 months ago</time> <span>from <span><a href="https://mastodon.social/@DarkestKale/113051691478943231" rel="external" title="Sent from mastodon.social via ActivityPub">mastodon.social</a></span></span><a href="https://mastodon.social/@DarkestKale/113051691478943231">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
:trebuchet: Kale :trebuchet: (darkestkale@mastodon.social)'s status on Saturday, 31-Aug-2024 03:19:46 JST :trebuchet: Kale :trebuchet:
in reply to
Huh.
Ok, so parameter size is determined by (base) model architecture (which makes sense).
So... my model, currently on gpt-2 small is about 124M
Could go higher, definitely, and I definitely probably will... just not sure if what I need is moar dataset or moar training or MOAR PARAMETERS
So many little levers and buttons and things to toy with.
In conversationabout 3 months ago from mastodon.socialpermalink

Public

Embed Notice

HTML Code

Corresponding Notice