Conversation

Notices

Embed this notice
Emma Stamm (emma@assemblag.es)'s status on Saturday, 10-May-2025 02:12:49 JST Emma Stamm

#AI / #ML people here:
I'm working on an article about whether reasoning models’ outputs or “chains of thought” faithfully reflect their internal processes or not. I want to know how researchers evaluate "faithfulness." How can they be sure chains of thought aren't hallucinations?
Any resources you could point me towards would be helpful, including articles, people to talk to, etc.
(& this is yet another request where boosts would go a long way! 🙏 thank you thank you ) #genAI #LLM #LLMs

In conversation about a month ago from assemblag.es permalink
- Rich Felker repeated this.
- Embed this notice
  JP (jplebreton@mastodon.social)'s status on Saturday, 10-May-2025 02:13:02 JST JP
  in reply to
  - Prof. Emily M. Bender(she/her)
  @emma i don't have a specific link handy but @emilymbender has probably at some point spelled out exactly what "reasoning" means for these systems; my understanding is that it is just a pattern for building successive understanding-free prompts on the basis of an original human prompt, with the main benefit being that there's like a smidge more of a paper trail that humans can scrutinize rather than the complete black box of the classic prompt-response with tiny context window loop.
  
  In conversation about a month ago permalink
  
  Rich Felker repeated this.
- Embed this notice
  Prof. Emily M. Bender(she/her) (emilymbender@dair-community.social)'s status on Saturday, 10-May-2025 02:13:13 JST Prof. Emily M. Bender(she/her)
  in reply to
  - JP
  @jplebreton @emma
  Hi Emma & JP, I don't think I have anything in writing, but indeed "chain of thought" is still just repeated responses to "what's a likely next word", based on additional training data that looks like "chain of thought".
  A couple of relevant MAIHT3k episodes:
  https://www.buzzsprout.com/2126417/episodes/16095327-episode-44-openai-s-ridiculous-reasoning-october-28-2024
  https://www.buzzsprout.com/2126417/episodes/15438320-episode-36-about-that-dangerous-capabilities-fanfiction-feat-ali-alkhatib-june-24-2024
  
  In conversation about a month ago permalink

Public

Notices

Feeds