GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Notices by Colin Gordon (csgordon@discuss.systems)

  1. Embed this notice
    Colin Gordon (csgordon@discuss.systems)'s status on Wednesday, 14-May-2025 11:25:24 JST Colin Gordon Colin Gordon

    When you submit a paper to an ACM journal, it gets run through TurnItIn (yes, really) and the editors in chief have to look at the report and decide if there are plagiarism concerns. Most submissions have a small percentage (~5%) of verbatim-matching text, from a wide variety of sources. The matches are usually small turns of phrase, technical phrases, affiliations, or ACM copyright text 😛 The exceptions are generally extended versions of conference papers, where obviously large chunks of the extension match the original publication.

    But recently I've noticed an up-tick, so far only in the wildly-out-of-scope papers that get desk rejected (mostly papers about using LLMs for NLP) of a high percentage of the paper's text (~30%) being flagged as matching, still from a wide variety of sources, but much larger chunks. A long phrase from here, most of a sentence from there, etc., from very scattered sources across different far-ranging fields. This seems unlikely to be from authors picking up phrases they like from papers they actually encountered. I can't help but think these papers have a high fraction of LLM-generated text, and that LLM-generated text on similar topics tends to output a lot of phrases and sentences repeatedly in aggregate, and these patterns are now getting picked up by traditional plagiarism checkers since there's so much LLM-generated text in the world now.

    In conversation about a month ago from discuss.systems permalink
  2. Embed this notice
    Colin Gordon (csgordon@discuss.systems)'s status on Saturday, 03-May-2025 11:54:47 JST Colin Gordon Colin Gordon
    in reply to
    • Adrianna Tan

    @skinnylatte a friend of mine once informed me that their mother considered *ketchup* to be "spicy"

    In conversation about 2 months ago from discuss.systems permalink
  3. Embed this notice
    Colin Gordon (csgordon@discuss.systems)'s status on Monday, 27-Nov-2023 23:39:14 JST Colin Gordon Colin Gordon

    A new totally bizarre twist in the AI bubble: today I was sent a call for research proposals for a large telecom company, which is only interested in proposals involving AI, and really only generative AI, not other kinds. So if you have an idea that could fundamentally improve their networking, they literally don't want to hear about it if it doesn't use generative AI 🤦

    In conversation Monday, 27-Nov-2023 23:39:14 JST from discuss.systems permalink

User actions

    Colin Gordon

    Colin Gordon

    Programming languages professor, kernel hacker, aspiring linguist (syntax & compositional semantics).Currently figuring out how to combine all of my interests by mechanically translating English into formal specifications of a formally verified OS kernel for RISC-V.:freebsd_logo: :debian: :openbsd: :clang: :csharp: :racket: :rust:

    Tags
    • (None)

    Following 0

      Followers 0

        Groups 0

          Statistics

          User ID
          217993
          Member since
          27 Nov 2023
          Notices
          3
          Daily average
          0

          Feeds

          • Atom
          • Help
          • About
          • FAQ
          • TOS
          • Privacy
          • Source
          • Version
          • Contact

          GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

          Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.