GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Conversation

Notices

  1. Embed this notice
    Peter Kaminski (peterkaminski@mastodon.social)'s status on Tuesday, 17-Dec-2024 07:34:18 JST Peter Kaminski Peter Kaminski

    This seems like a big deal.

    Meta AI proposes "Large Concept Models" to complement "Large Language Models".

    (i.e., reasoning in an embedding space of concepts rather than words)

    https://github.com/facebookresearch/large_concept_model

    https://arxiv.org/abs/2412.08821

    popularized article: https://www.marktechpost.com/2024/12/15/meta-ai-proposes-large-concept-models-lcms-a-semantic-leap-beyond-token-based-language-modeling/

    In conversation about 5 months ago from mastodon.social permalink

    Attachments


    1. Invalid filename.
    2. Domain not in remote thumbnail source whitelist: www.marktechpost.com
      Meta AI Proposes Large Concept Models (LCMs): A Semantic Leap Beyond Token-based Language Modeling
      from https://www.facebook.com/MarkTechPost/
      Meta AI Proposes Large Concept Models (LCMs): A Semantic Leap Beyond Token-based Language Modeling
    • Embed this notice
      John Abbe (aka Slow) (slowenough@mastodon.social)'s status on Tuesday, 17-Dec-2024 07:34:17 JST John Abbe (aka Slow) John Abbe (aka Slow)
      in reply to

      @peterkaminski (There are hierarchies, among our ways of thinking, sure. But ultimately they are not clean or coherent hierarchies, that is they branch in all kinds of weird ways that defy exclusively top-down control, organization, or even understanding.)

      In conversation about 5 months ago permalink
    • Embed this notice
      anderbill (band@hachyderm.io)'s status on Tuesday, 17-Dec-2024 07:34:17 JST anderbill anderbill
      in reply to
      • John Abbe (aka Slow)

      @slowenough @peterkaminski and that resistance to coherence and understanding is very good.

      In conversation about 5 months ago permalink
    • Embed this notice
      John Abbe (aka Slow) (slowenough@mastodon.social)'s status on Tuesday, 17-Dec-2024 07:34:18 JST John Abbe (aka Slow) John Abbe (aka Slow)
      in reply to

      @peterkaminski "LCMs employ a hierarchical structure, mirroring human reasoning processes"

      Not a good sign if they think our reasoning is hierarchical.

      "reduces sequence length compared to token-level processing, addressing the quadratic complexity of standard Transformers and enabling more efficient handling of long contexts"

      Now seeing "concepts" as a kind of compression (of strings of tokens), which I've seen articulated before as a way of understanding much of what's happening with LLMs.

      In conversation about 5 months ago permalink

Feeds

  • Activity Streams
  • RSS 2.0
  • Atom
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.