Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://mstdn.social/users/mattwilcox/statuses/111992157034183715">Matt Wilcox (mattwilcox@mstdn.social)'s status on Sunday, 25-Feb-2024 22:26:55 JST</a><a href="https://mstdn.social/@mattwilcox" title="mattwilcox@mstdn.social"><img src="https://gnusocial.jp/avatar/21188-48-20230409115024.webp" width="48" height="48" alt="Matt Wilcox" style="position: absolute; left: 0; top: 0;">Matt Wilcox</a><div><a href="https://mstdn.social/@mattwilcox/111992139533144409" rel="in-reply-to">in reply to</a></div></section><article><p>I’m hoping that some Open Source project will start collecting and distributing _trusted data collections_ as raw material to train LLM’s on. Thats where the value is; not the models themselves. Which are mostly trying to un-shittify the shit they got fed.</p><p>To me, the biggest issue with the whole thing is _I do not want something trained on the entirety of crap out there_. We _all_ know that most “content” available is biased, incorrect, racist, ignorant, hot garbage.</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/2759723#notice-5471844">In conversation</a><time datetime="2024-02-25T22:26:55+09:00" title="Sunday, 25-Feb-2024 22:26:55 JST">about a year ago</time> <span>from <span><a href="https://mstdn.social/@mattwilcox/111992157034183715" rel="external" title="Sent from mstdn.social via ActivityPub">mstdn.social</a></span></span><a href="https://mstdn.social/@mattwilcox/111992157034183715">permalink</a><h4>Attachments</h4><ol><li><label><a rel="external" href="https://gnusocial.jp/attachment/2304249">Untitled attachment</a></label><br></li></ol></footer></blockquote>

Corresponding Notice

Embed this notice
Matt Wilcox (mattwilcox@mstdn.social)'s status on Sunday, 25-Feb-2024 22:26:55 JST Matt Wilcox
in reply to
I’m hoping that some Open Source project will start collecting and distributing _trusted data collections_ as raw material to train LLM’s on. Thats where the value is; not the models themselves. Which are mostly trying to un-shittify the shit they got fed.
To me, the biggest issue with the whole thing is _I do not want something trained on the entirety of crap out there_. We _all_ know that most “content” available is biased, incorrect, racist, ignorant, hot garbage.
In conversationabout a year ago from mstdn.socialpermalink
Attachments
1. Untitled attachment

Public

Embed Notice

HTML Code

Corresponding Notice