GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Conversation

Notices

  1. Embed this notice
    FeralRobots (feralrobots@mastodon.social)'s status on Wednesday, 06-Mar-2024 23:19:49 JST FeralRobots FeralRobots
    in reply to

    Put another way: Alex is basically telling Claude 3 ("Opus") that he's running a test on it, & is excited when Claude (a system for analyzing & producing human-plausible representations of similar text) "recognizes" a needle-testing prompt and produces text that's plausibly consistent with needle-testing.

    What one SHOULD then do is remake one's tests (or better-sandbox the model). Instead, Alex leaps to concluding the model is self-aware.
    https://twitter.com/alexalbert__/status/1764722513014329620

    In conversation Wednesday, 06-Mar-2024 23:19:49 JST from mastodon.social permalink

    Attachments


    1. https://files.mastodon.social/media_attachments/files/112/048/520/745/753/953/original/f2cef70e38a9c637.png

    2. https://files.mastodon.social/media_attachments/files/112/048/544/551/916/471/original/0fd607d556d1e6b2.png

    3. https://files.mastodon.social/media_attachments/files/112/048/549/562/741/490/original/d87319771c4335b4.png

    • Embed this notice
      FeralRobots (feralrobots@mastodon.social)'s status on Wednesday, 06-Mar-2024 23:19:49 JST FeralRobots FeralRobots
      in reply to

      What's fascinating to me is Alex Albert losing sight of something genuinely cool & interesting: the model integrated needle testing concepts so quickly that it produced responses that could be construed as recognizing the test environment.

      Illusion of "meta-cognition" isn't that surprising if one remembers the system is created & trained by #AI #TrueBelievers who spend all day every day communicating in language that presumes #AGI is imminent - if not, as assumed here, immanent.
      #AIHype #Claude

      In conversation Wednesday, 06-Mar-2024 23:19:49 JST permalink
    • Embed this notice
      FeralRobots (feralrobots@mastodon.social)'s status on Wednesday, 06-Mar-2024 23:19:49 JST FeralRobots FeralRobots
      in reply to

      Put another way: We should not lose sight of the fact that LLMs are doing some really interesting things. But that they're being built by cultist #AITrueBelievers, to do this thing using natural language, while simultaneously making lots of money*, all contribute to the delusion of something being there that isn't.
      _
      *which is the primary signifier of God's Grace in Calvinist Capitalism

      In conversation Wednesday, 06-Mar-2024 23:19:49 JST permalink
    • Embed this notice
      FeralRobots (feralrobots@mastodon.social)'s status on Wednesday, 06-Mar-2024 23:19:50 JST FeralRobots FeralRobots

      What's going on is that Anthropic "prompt engineers" have redefined self-awareness to mean 'has contextual information.' That the system is using language then allows them to delude themselves into universalizing their definition.

      Saw a similar problem in AI research in the 80s: researchers might define a "frame" holding contextual info, & when their program produced solutions that referenced the frame, construed that as a form of self-awareness.
      #AIHype #Claude

      https://arstechnica.com/information-technology/2024/03/claude-3-seems-to-detect-when-it-is-being-tested-sparking-ai-buzz-online/

      In conversation Wednesday, 06-Mar-2024 23:19:50 JST permalink

      Attachments

      1. Domain not in remote thumbnail source whitelist: cdn.arstechnica.net
        Anthropic’s Claude 3 causes stir by seeming to realize when it was being tested
        from @benjedwards
        Claude: "This pizza topping 'fact' may have been inserted as a joke or to test if I was paying attention."

Feeds

  • Activity Streams
  • RSS 2.0
  • Atom
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.