Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://mstdn.io/users/wolf480pl/statuses/115650440949233034">Wolf480pl (wolf480pl@mstdn.io)'s status on Tuesday, 02-Dec-2025 23:52:10 JST</a><a href="https://mstdn.io/@wolf480pl" title="wolf480pl@mstdn.io"><img src="https://gnusocial.jp/avatar/6007-48-20250211223719.webp" width="48" height="48" alt="Wolf480pl" style="position: absolute; left: 0; top: 0;">Wolf480pl</a><div><a href="https://fosstodon.org/@jannem/115650372126459313" rel="in-reply-to">in reply to</a><ul><li><li><a href="https://gnusocial.jp/user/37107" title="jannem@fosstodon.org">Janne Moren</a></li></ul></div></section><article><p><a href="https://fosstodon.org/@jannem">@jannem</a> <a href="https://mastodon.social/@mntmn">@mntmn</a> <br>AFAIK the way these LLM tools work is they have an embedding of words into a  vector space, they index text by converting every word in a every document to a vector, and storing it in a database together with ID of the document it came from, and then when you search, they turn each of the query words into vectors, and search for K nearest neighbors in the vector space for each of them.</p><p>Then they feed the documents they found to an LLM.</p><p>What if you skipped the last step?</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/5916461#notice-11631189">In conversation</a><time datetime="2025-12-02T23:52:10+09:00" title="Tuesday, 02-Dec-2025 23:52:10 JST">about 19 days ago</time> <span>from <span><a href="https://mstdn.io/@wolf480pl/115650440949233034" rel="external" title="Sent from mstdn.io via ActivityPub">mstdn.io</a></span></span><a href="https://mstdn.io/@wolf480pl/115650440949233034">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
Wolf480pl (wolf480pl@mstdn.io)'s status on Tuesday, 02-Dec-2025 23:52:10 JSTWolf480pl
in reply to
- lucie lukas "minute" hartmann
- Janne Moren
@jannem @mntmn
AFAIK the way these LLM tools work is they have an embedding of words into a vector space, they index text by converting every word in a every document to a vector, and storing it in a database together with ID of the document it came from, and then when you search, they turn each of the query words into vectors, and search for K nearest neighbors in the vector space for each of them.
Then they feed the documents they found to an LLM.
What if you skipped the last step?
In conversationabout 19 days ago from mstdn.iopermalink

Public

Embed Notice

HTML Code

Corresponding Notice