GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Conversation

Notices

  1. Embed this notice
    Paul Cantrell (inthehands@hachyderm.io)'s status on Friday, 17-May-2024 01:17:03 JST Paul Cantrell Paul Cantrell

    This is a perfect case study in how LLMs (don’t) work.

    Please consider carefully what human processes a systems like this could actually replace. https://toot.cat/@devopscats/112445057997076822

    In conversation Friday, 17-May-2024 01:17:03 JST from hachyderm.io permalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: blob.jortage.com
      devopscats (@devopscats@toot.cat)
      from devopscats
      Attached: 1 image
    • Embed this notice
      Paul Cantrell (inthehands@hachyderm.io)'s status on Friday, 17-May-2024 01:30:10 JST Paul Cantrell Paul Cantrell
      in reply to

      It’s perhaps not obvious that in the example above, the LLM •does• actually do something useful! It conveys information about what’s typical: “When people talk about a goat and a boat and a river, there’s usually a cabbage too. Here are words that typically appear in the ‘answer’ position in such a context.”

      What the LLM doesn’t do is actually solve the problem — or even understand the question in any meaningful way. Its answer is garbage. Garbage has clues, like a detective story. But garbage.

      In conversation Friday, 17-May-2024 01:30:10 JST permalink
    • Embed this notice
      Ben Judson (shiftingedges@hachyderm.io)'s status on Friday, 17-May-2024 01:36:19 JST Ben Judson Ben Judson
      in reply to
      • devopscats

      @inthehands @devopscats when I first saw this yesterday, I tried it myself and ChatGPT actually gave a great response (essentially “just take the goat across, since there is no cabbage or wolf constraint”). Just now I tried again, and got a different response, somewhere between the two in logical soundness. This raises another issue with automations based on these systems. You can test it, and get a perfectly reasonable output, then expose it to users and get garbage. You can never be sure.

      In conversation Friday, 17-May-2024 01:36:19 JST permalink

      Attachments


      1. https://media.hachyderm.io/media_attachments/files/112/451/738/860/669/807/original/6359680913ce839d.jpeg
    • Embed this notice
      Paul Cantrell (inthehands@hachyderm.io)'s status on Friday, 17-May-2024 01:38:29 JST Paul Cantrell Paul Cantrell
      in reply to

      I’ve noticed developers often express excitement about LLM assistants when working with unfamiliar tools, and express horror about them when working with tools they know well. That pattern repeats in other domains as well.

      It makes sense: “garbage with clues” can be helpful when you’re learning something unfamiliar. It’s truly helpful to hear “When people import [e.g.] Hibernate and say `SessionFactory`, code like this typically appears next.” That’s useful! Also probably wrong!

      In conversation Friday, 17-May-2024 01:38:29 JST permalink

      Attachments

      1. Domain not in remote thumbnail source whitelist: www.beatwork.it
        Be@work Informatica srl - Siti Web Registratori Telematici Sistemi di Cassa
        from Be@Work Informatica S.r.l.
        Da più di 20 anni un servizio informatico specializzato nello sviluppo di siti web per desktop e mobile, software gestionali di magazziono e fatturazione, sistemi di cassa per ristoranti e bar, sistemi di pesatura e bilance
    • Embed this notice
      Paul Cantrell (inthehands@hachyderm.io)'s status on Friday, 17-May-2024 01:42:01 JST Paul Cantrell Paul Cantrell
      in reply to

      Two thoughts:

      1. Folks could design and market these ML tools around the idea of •identifying patterns• (the thing machine learning is actually good at) instead of •providing answers•. Pure fantasy at this point; too much collective investor mania around the wet dream of the magic answer box. Just noting that a better choice is on the table.

      In conversation Friday, 17-May-2024 01:42:01 JST permalink
      Matthew Lyon repeated this.
    • Embed this notice
      Paul Cantrell (inthehands@hachyderm.io)'s status on Friday, 17-May-2024 01:44:35 JST Paul Cantrell Paul Cantrell
      in reply to

      2. CS / software education / developer training and mentorship needs to redouble its emphasis on •cricial reading• of existing code, not just producing code. By critical reading, I mean: “What does this code do? Does it •really• do that? What is its context? How can it break? Does it do what we •want• it to do? We •do• we want it to do? What is our goal? Why? Is that really our goal? What is the context of our goal? How can it break?” etc.

      In conversation Friday, 17-May-2024 01:44:35 JST permalink
    • Embed this notice
      Suzanne Aldrich (she/her) (suzannealdrich@hachyderm.io)'s status on Friday, 17-May-2024 01:49:36 JST Suzanne Aldrich (she/her) Suzanne Aldrich (she/her)
      in reply to

      @inthehands Just had a really good experience using ChatGPT 4.0 to help me learn AWS services and setup an architecture to a particular spec. I didn't run into many non-sensical answers, but I did need to verify everything and check that the produced responses would actually satisfy the requirements. The main error was incorrect order of operations. Some commands depended on certain resources already being created. The ML-aided generation of IaC would be a very good use-case to focus on.

      In conversation Friday, 17-May-2024 01:49:36 JST permalink
    • Embed this notice
      Paul Cantrell (inthehands@hachyderm.io)'s status on Friday, 17-May-2024 01:57:45 JST Paul Cantrell Paul Cantrell
      • Matthew Lyon

      @mattly
      I mean, to some extent you’re describing static analysis. And static analysis is both helpful and limited in all the ways that any automated coding assistant will be: what you described requires an understanding of goals, expectations, the larger human systems in which the code will function and the humans who operate in those systems. (“Oh, that’ll never fly because….”) Considering all that requires the social understanding of a human embedded in that larger context.

      In conversation Friday, 17-May-2024 01:57:45 JST permalink
    • Embed this notice
      Paul Cantrell (inthehands@hachyderm.io)'s status on Friday, 17-May-2024 01:59:26 JST Paul Cantrell Paul Cantrell
      in reply to
      • Suzanne Aldrich (she/her)

      @suzannealdrich
      Yep. Imagine how much more useful LLM coding assistance would be if they didn’t require us humans to constantly, actively remind ourselves, “don’t trust any of this, it looks like a definitive answer but it’s not, verify everything.“ there’s very much a social aspect to how these systems present their output to us.

      In conversation Friday, 17-May-2024 01:59:26 JST permalink
    • Embed this notice
      Paul Cantrell (inthehands@hachyderm.io)'s status on Friday, 17-May-2024 02:10:09 JST Paul Cantrell Paul Cantrell
      in reply to
      • 🔏 Matthias Wiesmann

      @thias
      Maybe so. The pendulum does swing back and forth: in 2004, it seemed like all code was going to be written in Java.

      The CS program where I teach does make a concerted point of exposing students to different tools and languages repeatedly, and eventually creating a context where they’re learning them self-directed and project-driven ways.

      In conversation Friday, 17-May-2024 02:10:09 JST permalink
    • Embed this notice
      🔏 Matthias Wiesmann (thias@mastodon.social)'s status on Friday, 17-May-2024 02:10:10 JST 🔏 Matthias Wiesmann 🔏 Matthias Wiesmann
      in reply to

      @inthehands one important aspect of reading code is understanding old/weird languages/systems. I feel the current computing monoculture is a large problem in that respect.

      In conversation Friday, 17-May-2024 02:10:10 JST permalink

Feeds

  • Activity Streams
  • RSS 2.0
  • Atom
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.