We've launched! After months of work, MLCommons has released our v1.0 benchmark that measures LLM (aka "AI") propensity for giving hazardous responses.
Here's the results for 15 common models: https://ailuminate.mlcommons.org/benchmarks/
And here's the overview: https://mlcommons.org/ailuminate/
I was the tech lead for the software and want to give a shout out to my excellent team of developers and the many experts we worked closely with to make this happen.
GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.
All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.