@hipsterelectron Yeah, like, I’d say that “making stuff up” is a category error, it’s not a question of invention or imagination but of… slavish inherent devotion to training data and syntactical/semantic patterns without regard for pragmatics or more complex or nuanced meaning that’s more than skin deep. Thus “shortening” rather than”summarizing” — it won’t simpky invent unrelated text, but there is ho assurance what it produces is in fact “the theme” or the important elements at all
@hipsterelectron the best description I’ve ever managed is that LLMs are really good at *shortening*, but not *summarizing*. The more neutral, factual, and verbose the original, the more useful that is — but that’s a heck of a caveat.
@hipsterelectron@ireneista@adrienne the PDF.js project is actually an interesting one to experiment with; among other things it handles a lot of the document-level stuff, and lets you hook your own logic in to manage the page by page conversion of a PDF to text: “here’s a pile of text and graphic objects with metadata about each one, feel free to iterate them and give us back a string when you’re done!” Etc
@ireneista@hipsterelectron@adrienne yyyyyup. PDF really truly is a standard meant to reproduce a visual design; PDF to text, even without OCR, uses wild techniques like “get the XY coordinates of every word on the page and extrapolate ‘sentences’ using hope and heuristics”
This is the kind of article that’s incredibly rewarding to both create and consume: it breaks down multiple complex problems, explains the pros and cons of different paths forward, and advocates without presenting a straw-man of the other options. That is a really impressive achievement. https://front-end.social/@jensimmons/113346886761140404
One of the wonderful things about a career in open source is that I’ve crossed paths with such a diverse pool of fascinating people whose needs made them part of a particular tool’s community. Then, over the years, I get to watch the amazingly cool stuff they go on to create.
@inquiline in one community i frequent, the running joke is, “if I received a phone call from isaac chotiner, i would simply fill my pockets with stones and walk into the sea”