Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://pleroma.shunderdo.me/objects/27a4f3d2-34b7-451d-8131-d8a741f241a1">Cheetah Meld (pingviini@pleroma.shunderdo.me)'s status on Friday, 01-May-2026 20:34:07 JST</a><a href="https://pleroma.shunderdo.me/users/pingviini" title="pingviini@pleroma.shunderdo.me"><img src="https://gnusocial.jp/avatar/4939-48-20220813081536.webp" width="48" height="48" alt="Cheetah Meld" style="position: absolute; left: 0; top: 0;">Cheetah Meld</a><div><a href="https://gnusocial.jp/notice/12534922" rel="in-reply-to">in reply to</a><ul><li><li><a href="https://gnusocial.jp/user/376108" title="kaia@pleroma.soykaf.com">kaia</a></li></ul></div></section><article><a href="https://minidisc.tokyo/@SuperDicq">@SuperDicq</a> <a href="https://pleroma.soykaf.com/users/kaia">@kaia</a> <br><br>&gt; These local models are still proprietary and you have no control over them.<br><br>Simply not true. Many many models out there (and not just academic, irrelevant models, I mean models that people actually use) are licensed freely (Apache 2.0 or MIT are popular licenses) and that includes the parameters, i.e. all those numbers in the gigabytes-large .safetensors file that you download and then stuff into your GPU's VRAM. The software used for inferencing and (additional) training with these models is also freely licensed.<br><br>What's true is that, more often (but not always!) the training data used to produce those parameters is proprietary and/or undisclosed. The computing power required to train a model *completely from scratch* is also prohibitively time-consuming and/or expensive for any single user with consumer hardware.<br><br>So it's easy to assume that the big blob file used for inferencing is completely opaque and immutable, regardless of licensing, and effectively people are still using a black box, but that's not true - just like an executable binary program isn't really a black box. It's on your computer, you can look at the binary, you can patch it, you can disassemble it, you can run it in a debugger, you can run it in a sandbox, you can inject your own code into it, etc.<br><br>You can do a lot of similar things with models on your computer, like training Low-Rank Adaptations for these models using small datasets on consumer hardware to adapt and modify the output of the model, or modify the parameters directly to modify the output of a model. People routinely do both of these things to teach models new tricks (example: Generate images of things that the base model does not know about) or to make the model do things the original creator trained it not to do (example: Do ERP or use politically incorrect language).</article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/6365366#notice-12535370">In conversation</a><time datetime="2026-05-01T20:34:07+09:00" title="Friday, 01-May-2026 20:34:07 JST">about 2 months ago</time> <span>from <span><a href="https://pleroma.shunderdo.me/objects/27a4f3d2-34b7-451d-8131-d8a741f241a1" rel="external" title="Sent from pleroma.shunderdo.me via ActivityPub">pleroma.shunderdo.me</a></span></span><a href="https://pleroma.shunderdo.me/objects/27a4f3d2-34b7-451d-8131-d8a741f241a1">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
Cheetah Meld (pingviini@pleroma.shunderdo.me)'s status on Friday, 01-May-2026 20:34:07 JSTCheetah Meld
in reply to
- SuperDicq
- kaia
@SuperDicq @kaia

> These local models are still proprietary and you have no control over them.

Simply not true. Many many models out there (and not just academic, irrelevant models, I mean models that people actually use) are licensed freely (Apache 2.0 or MIT are popular licenses) and that includes the parameters, i.e. all those numbers in the gigabytes-large .safetensors file that you download and then stuff into your GPU's VRAM. The software used for inferencing and (additional) training with these models is also freely licensed.

What's true is that, more often (but not always!) the training data used to produce those parameters is proprietary and/or undisclosed. The computing power required to train a model *completely from scratch* is also prohibitively time-consuming and/or expensive for any single user with consumer hardware.

So it's easy to assume that the big blob file used for inferencing is completely opaque and immutable, regardless of licensing, and effectively people are still using a black box, but that's not true - just like an executable binary program isn't really a black box. It's on your computer, you can look at the binary, you can patch it, you can disassemble it, you can run it in a debugger, you can run it in a sandbox, you can inject your own code into it, etc.

You can do a lot of similar things with models on your computer, like training Low-Rank Adaptations for these models using small datasets on consumer hardware to adapt and modify the output of the model, or modify the parameters directly to modify the output of a model. People routinely do both of these things to teach models new tricks (example: Generate images of things that the base model does not know about) or to make the model do things the original creator trained it not to do (example: Do ERP or use politically incorrect language).
In conversationabout 2 months ago from pleroma.shunderdo.mepermalink

Public

Embed Notice

HTML Code

Corresponding Notice