GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Conversation

Notices

  1. Embed this notice
    The Markup (themarkup@mastodon.themarkup.org)'s status on Saturday, 20-Jul-2024 20:54:00 JST The Markup The Markup

    The tests that big techs use to benchmark their AI tools have many issues, and high scores might be misleading.

    Here’s why: https://themarkup.org/artificial-intelligence/2024/07/17/everyone-is-judging-ai-by-these-tests-but-experts-say-theyre-close-to-meaningless

    In conversation about a year ago from mastodon.themarkup.org permalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: mrkp-static-production.themarkup.org
      Everyone Is Judging AI by These Tests. But Experts Say They’re Close to Meaningless – The Markup
      from @themarkup
      Benchmarks used to rank AI models are several years old, often sourced from amateur websites, and, experts worry, lending automated systems a dubious sense of authority

    Feeds

    • Activity Streams
    • RSS 2.0
    • Atom
    • Help
    • About
    • FAQ
    • TOS
    • Privacy
    • Source
    • Version
    • Contact

    GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

    Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.