GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Screenshot: The researchers started by assuming that there exists a hypothetical bipartite graph that corresponds to an LLM’s behavior on test data. To explain the change in the LLM’s loss on test data, they imagined a way to use the graph to describe how the LLM gains skills. Take, for instance, the skill “understands irony.” This idea is represented with a skill node, so the researchers look to see what text nodes this skill node connects to. If almost all of these connected text nodes are successful — meaning that the LLM’s predictions on the text represented by these nodes are highly accurate — then the LLM is competent in this particular skill. But if more than a certain fraction of the skill node’s connections go to failed text nodes, then the LLM fails at this skill. Source: https://www.quantamagazine.org/new-theory-suggests-chatbots-can-understand-text-20240122/

Download link

https://cdn.masto.host/daircommunitysocial/media_attachments/files/111/802/007/667/604/512/original/8f750a2c60fd6162.png

Notices where this attachment appears

  1. Embed this notice
    Prof. Emily M. Bender(she/her) (emilymbender@dair-community.social)'s status on Tuesday, 23-Jan-2024 23:17:37 JST Prof. Emily M. Bender(she/her) Prof. Emily M. Bender(she/her)

    I feel very vindicated for not making time to answer journalist's queries about papers that "prove" things based on hypothesized graphs and fabricated data.

    In conversation Tuesday, 23-Jan-2024 23:17:37 JST from dair-community.social permalink
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.