Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://sfba.social/users/williampietri/statuses/113594971400277267">William Pietri (williampietri@sfba.social)'s status on Monday, 17-Feb-2025 03:00:36 JST</a><a href="https://sfba.social/@williampietri" title="williampietri@sfba.social"><img src="https://gnusocial.jp/avatar/108691-48-20230501210644.webp" width="48" height="48" alt="William Pietri" style="position: absolute; left: 0; top: 0;">William Pietri</a></section><article><p>We've launched! After months of work, MLCommons has released our v1.0 benchmark that measures LLM (aka "AI") propensity for giving hazardous responses. </p><p>Here's the results for 15 common models: <a href="https://ailuminate.mlcommons.org/benchmarks/" rel="nofollow noreferrer">https://ailuminate.mlcommons.org/benchmarks/</a></p><p>And here's the overview: <a href="https://mlcommons.org/ailuminate/" rel="nofollow noreferrer">https://mlcommons.org/ailuminate/</a></p><p>I was the tech lead for the software and want to give a shout out to my excellent team of developers and the many experts we worked closely with to make this happen.</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/4577083#notice-8960716">In conversation</a><time datetime="2025-02-17T03:00:36+09:00" title="Monday, 17-Feb-2025 03:00:36 JST">about 4 months ago</time> <span>from <span><a href="https://sfba.social/@williampietri/113594971400277267" rel="external" title="Sent from sfba.social via ActivityPub">sfba.social</a></span></span><a href="https://sfba.social/@williampietri/113594971400277267">permalink</a><h4>Attachments</h4><ol><li><label><a rel="external" href="https://gnusocial.jp/attachment/4150232">Untitled attachment</a></label><br></li><li><article><header><div>Domain not in remote thumbnail source whitelist: mlcommons.org</div><h5><a href="https://mlcommons.org/working-groups/benchmarks/inference/">MLPerf Inference - MLCommons</a></h5><div> from <span>andrew</span></div></header><div>The MLCommons MLPerf Inference working group creates a set of fair and representative inference benchmarks. The myriad combinations of ML hardware and software make assessing ML-system performance in an architecture-neutral, representative, and reproducible manner paramount.</div><footer></footer></article></li></ol></footer></blockquote>

Corresponding Notice

Embed this notice
William Pietri (williampietri@sfba.social)'s status on Monday, 17-Feb-2025 03:00:36 JST William Pietri
We've launched! After months of work, MLCommons has released our v1.0 benchmark that measures LLM (aka "AI") propensity for giving hazardous responses.
Here's the results for 15 common models: https://ailuminate.mlcommons.org/benchmarks/
And here's the overview: https://mlcommons.org/ailuminate/
I was the tech lead for the software and want to give a shout out to my excellent team of developers and the many experts we worked closely with to make this happen.
In conversationabout 4 months ago from sfba.socialpermalink
Attachments
1. Untitled attachment
2. Domain not in remote thumbnail source whitelist: mlcommons.org
  MLPerf Inference - MLCommons
  from andrew
  The MLCommons MLPerf Inference working group creates a set of fair and representative inference benchmarks. The myriad combinations of ML hardware and software make assessing ML-system performance in an architecture-neutral, representative, and reproducible manner paramount.

Public

Embed Notice

HTML Code

Corresponding Notice