Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://mastodon.social/users/gvwilson/statuses/111774776723804995">Greg Wilson (gvwilson@mastodon.social)'s status on Friday, 19-Jan-2024 00:10:54 JST</a><a href="https://mastodon.social/@gvwilson" title="gvwilson@mastodon.social"><img src="https://gnusocial.jp/avatar/53907-48-20240425104008.webp" width="48" height="48" alt="Greg Wilson" style="position: absolute; left: 0; top: 0;">Greg Wilson</a></section><article><p>Irugalbandara et al 2024: "A Trade-off Analysis of Replacing Proprietary LLMs with Open Source SLMs in Production" <a href="https://arxiv.org/abs/2312.14972" rel="nofollow noreferrer">https://arxiv.org/abs/2312.14972</a> "We present a systematic evaluation methodology for…Small Language Models (SLMs) and their tradeoffs when replacing a proprietary LLM APIs…across 9 SLMs and 29 variants, we observe competitive quality-of-results for our use case, significant performance consistency improvement, and a cost reduction of 5x-29x when compared to OpenAI GPT-4." <a href="https://mastodon.social/tags/nwit" rel="tag">#nwit</a></p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/2604956#notice-5158820">In conversation</a><time datetime="2024-01-19T00:10:54+09:00" title="Friday, 19-Jan-2024 00:10:54 JST">Friday, 19-Jan-2024 00:10:54 JST</time> <span>from <span><a href="https://mastodon.social/@gvwilson/111774776723804995" rel="external" title="Sent from mastodon.social via ActivityPub">mastodon.social</a></span></span><a href="https://mastodon.social/@gvwilson/111774776723804995">permalink</a><h4>Attachments</h4><ol><li><article><header><div>Domain not in remote thumbnail source whitelist: arxiv.org</div><h5><a href="https://arxiv.org/abs/2312.14972">A Trade-off Analysis of Replacing Proprietary LLMs with Open Source SLMs in Production</a></h5><div></div></header><div>Many companies rely on APIs of managed AI models such as OpenAI's GPT-4 to create AI-enabled experiences in their products. Along with the benefits of ease of use and shortened time to production, this reliance on proprietary APIs has downsides in terms of model control, performance reliability, up-time predictability, and cost. At the same time, there has been a flurry of open source small language models (SLMs) that have been made available for commercial use. However, their readiness to replace existing capabilities remains unclear, and a systematic approach to test these models is not readily available. In this paper, we present a systematic evaluation methodology for, and characterization of, modern open source SLMs and their trade-offs when replacing a proprietary LLM APIs for a real-world product feature. We have designed SLaM, an automated analysis tool that enables the quantitative and qualitative testing of product features utilizing arbitrary SLMs. Using SLaM, we examine both the quality and the performance characteristics of modern SLMs relative to an existing customer-facing OpenAI-based implementation. We find that across 9 SLMs and 29 variants, we observe competitive quality-of-results for our use case, significant performance consistency improvement, and a cost reduction of 5x-29x when compared to OpenAI GPT-4.</div><footer></footer></article></li></ol></footer></blockquote>

Corresponding Notice

Embed this notice
Greg Wilson (gvwilson@mastodon.social)'s status on Friday, 19-Jan-2024 00:10:54 JST Greg Wilson
Irugalbandara et al 2024: "A Trade-off Analysis of Replacing Proprietary LLMs with Open Source SLMs in Production" https://arxiv.org/abs/2312.14972 "We present a systematic evaluation methodology for…Small Language Models (SLMs) and their tradeoffs when replacing a proprietary LLM APIs…across 9 SLMs and 29 variants, we observe competitive quality-of-results for our use case, significant performance consistency improvement, and a cost reduction of 5x-29x when compared to OpenAI GPT-4." #nwit
In conversationFriday, 19-Jan-2024 00:10:54 JST from mastodon.socialpermalink
Attachments
1. Domain not in remote thumbnail source whitelist: arxiv.org
  A Trade-off Analysis of Replacing Proprietary LLMs with Open Source SLMs in Production
  Many companies rely on APIs of managed AI models such as OpenAI's GPT-4 to create AI-enabled experiences in their products. Along with the benefits of ease of use and shortened time to production, this reliance on proprietary APIs has downsides in terms of model control, performance reliability, up-time predictability, and cost. At the same time, there has been a flurry of open source small language models (SLMs) that have been made available for commercial use. However, their readiness to replace existing capabilities remains unclear, and a systematic approach to test these models is not readily available. In this paper, we present a systematic evaluation methodology for, and characterization of, modern open source SLMs and their trade-offs when replacing a proprietary LLM APIs for a real-world product feature. We have designed SLaM, an automated analysis tool that enables the quantitative and qualitative testing of product features utilizing arbitrary SLMs. Using SLaM, we examine both the quality and the performance characteristics of modern SLMs relative to an existing customer-facing OpenAI-based implementation. We find that across 9 SLMs and 29 variants, we observe competitive quality-of-results for our use case, significant performance consistency improvement, and a cost reduction of 5x-29x when compared to OpenAI GPT-4.

Public

Embed Notice

HTML Code

Corresponding Notice