Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://saturation.social/users/irwin/statuses/112944143595985119">irwin (irwin@saturation.social)'s status on Monday, 12-Aug-2024 00:55:01 JST</a><a href="https://saturation.social/@irwin" title="irwin@saturation.social"><img src="https://gnusocial.jp/avatar/18368-48-20230309160714.webp" width="48" height="48" alt="irwin" style="position: absolute; left: 0; top: 0;">irwin</a></section><article><p>UX idea for local LLMs: </p><p>Speed and responsiveness are highly desirable when chatting with LLMs, but on edge devices we don’t have the same kind of computing horsepower at our disposal. So why don’t we use the same kind of tactics humans use in normal conversation: linguistic fillers, signals we are formulating our thoughts (like the … animation in chat), even asking clarifying questions. These tactics can be used with little computing power while the real answer is formulated in the background.</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/3504733#notice-6885959">In conversation</a><time datetime="2024-08-12T00:55:01+09:00" title="Monday, 12-Aug-2024 00:55:01 JST">about a year ago</time> <span>from <span><a href="https://saturation.social/@irwin/112944143595985119" rel="external" title="Sent from saturation.social via ActivityPub">saturation.social</a></span></span><a href="https://saturation.social/@irwin/112944143595985119">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
irwin (irwin@saturation.social)'s status on Monday, 12-Aug-2024 00:55:01 JST irwin
UX idea for local LLMs:
Speed and responsiveness are highly desirable when chatting with LLMs, but on edge devices we don’t have the same kind of computing horsepower at our disposal. So why don’t we use the same kind of tactics humans use in normal conversation: linguistic fillers, signals we are formulating our thoughts (like the … animation in chat), even asking clarifying questions. These tactics can be used with little computing power while the real answer is formulated in the background.
In conversationabout a year ago from saturation.socialpermalink

Public

Embed Notice

HTML Code

Corresponding Notice