GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Embed Notice

HTML Code

Corresponding Notice

  1. Embed this notice
    The Markup (themarkup@mastodon.themarkup.org)'s status on Saturday, 20-Jul-2024 20:54:00 JSTThe MarkupThe Markup

    The tests that big techs use to benchmark their AI tools have many issues, and high scores might be misleading.

    Here’s why: https://themarkup.org/artificial-intelligence/2024/07/17/everyone-is-judging-ai-by-these-tests-but-experts-say-theyre-close-to-meaningless

    In conversationabout a year ago from mastodon.themarkup.orgpermalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: mrkp-static-production.themarkup.org
      Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless – The Markup
      from @themarkup
      Benchmarks used to rank AI models are several years old, often sourced from amateur websites, and, experts worry, lending automated systems a dubious sense of authority
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.