GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Conversation

Notices

  1. Embed this notice
    Emma Stamm (emma@assemblag.es)'s status on Saturday, 10-May-2025 02:12:49 JST Emma Stamm Emma Stamm

    #AI / #ML people here:

    I'm working on an article about whether reasoning models’ outputs or “chains of thought” faithfully reflect their internal processes or not. I want to know how researchers evaluate "faithfulness." How can they be sure chains of thought aren't hallucinations?

    Any resources you could point me towards would be helpful, including articles, people to talk to, etc.

    (& this is yet another request where boosts would go a long way! 🙏 thank you thank you ) #genAI #LLM #LLMs

    In conversation about 2 days ago from assemblag.es permalink
    • Rich Felker repeated this.
    • Embed this notice
      JP (jplebreton@mastodon.social)'s status on Saturday, 10-May-2025 02:13:02 JST JP JP
      in reply to
      • Prof. Emily M. Bender(she/her)

      @emma i don't have a specific link handy but @emilymbender has probably at some point spelled out exactly what "reasoning" means for these systems; my understanding is that it is just a pattern for building successive understanding-free prompts on the basis of an original human prompt, with the main benefit being that there's like a smidge more of a paper trail that humans can scrutinize rather than the complete black box of the classic prompt-response with tiny context window loop.

      In conversation about 2 days ago permalink
      Rich Felker repeated this.
    • Embed this notice
      Prof. Emily M. Bender(she/her) (emilymbender@dair-community.social)'s status on Saturday, 10-May-2025 02:13:13 JST Prof. Emily M. Bender(she/her) Prof. Emily M. Bender(she/her)
      in reply to
      • JP

      @jplebreton @emma

      Hi Emma & JP, I don't think I have anything in writing, but indeed "chain of thought" is still just repeated responses to "what's a likely next word", based on additional training data that looks like "chain of thought".

      A couple of relevant MAIHT3k episodes:

      https://www.buzzsprout.com/2126417/episodes/16095327-episode-44-openai-s-ridiculous-reasoning-october-28-2024

      https://www.buzzsprout.com/2126417/episodes/15438320-episode-36-about-that-dangerous-capabilities-fanfiction-feat-ali-alkhatib-june-24-2024

      In conversation about 2 days ago permalink

Feeds

  • Activity Streams
  • RSS 2.0
  • Atom
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.