Public
- Public
- Network
- Groups
- Featured
- Popular
- People

Untitled attachment

Download link

Untitled attachment

Notices where this attachment appears

Embed this notice
William Pietri (williampietri@sfba.social)'s status on Monday, 17-Feb-2025 03:00:36 JST William Pietri

We've launched! After months of work, MLCommons has released our v1.0 benchmark that measures LLM (aka "AI") propensity for giving hazardous responses.
Here's the results for 15 common models: https://ailuminate.mlcommons.org/benchmarks/
And here's the overview: https://mlcommons.org/ailuminate/
I was the tech lead for the software and want to give a shout out to my excellent team of developers and the many experts we worked closely with to make this happen.

In conversation about 4 months ago from sfba.social permalink