Public
- Public
- Network
- Groups
- Featured
- Popular
- People

Screenshot: The researchers started by assuming that there exists a hypothetical bipartite graph that corresponds to an LLM’s behavior on test data. To explain the change in the LLM’s loss on test data, they imagined a way to use the graph to describe how the LLM gains skills. Take, for instance, the skill “understands irony.” This idea is represented with a skill node, so the researchers look to see what text nodes this skill node connects to. If almost all of these connected text nodes are successful — meaning that the LLM’s predictions on the text represented by these nodes are highly accurate — then the LLM is competent in this particular skill. But if more than a certain fraction of the skill node’s connections go to failed text nodes, then the LLM fails at this skill. Source: https://www.quantamagazine.org/new-theory-suggests-chatbots-can-understand-text-20240122/

Download link

Screenshot: The researchers started by assuming that there exists a hypothetical bipartite graph that corresponds to an LLM’s behavior on test data. To explain the change in the LLM’s loss on test data, they imagined a way to use the graph to describe how the LLM gains skills. Take, for instance, the skill “understands irony.” This idea is represented with a skill node, so the researchers look to see what text nodes this skill node connects to. If almost all of these connected text nodes are successful — meaning that the LLM’s predictions on the text represented by these nodes are highly accurate — then the LLM is competent in this particular skill. But if more than a certain fraction of the skill node’s connections go to failed text nodes, then the LLM fails at this skill. Source: https://www.quantamagazine.org/new-theory-suggests-chatbots-can-understand-text-20240122/
https://cdn.masto.host/daircommunitysocial/media_attachments/files/111/802/007/667/604/512/original/8f750a2c60fd6162.png

Notices where this attachment appears

Embed this notice
Prof. Emily M. Bender(she/her) (emilymbender@dair-community.social)'s status on Tuesday, 23-Jan-2024 23:17:37 JST Prof. Emily M. Bender(she/her)

I feel very vindicated for not making time to answer journalist's queries about papers that "prove" things based on hypothesized graphs and fabricated data.

In conversation about 10 months ago from dair-community.social permalink