Hey #fediverse people, do you know a #CLI tool to extract highlighted texts from a PDF or epub ?
Conversation
Notices
-
Embed this notice
📷 🖋 ~hyde (hyde@lazybear.social)'s status on Saturday, 30-Nov-2024 01:36:28 JST 📷 🖋 ~hyde -
Embed this notice
culpaplex (hp@social.sdf.org)'s status on Saturday, 30-Nov-2024 01:36:27 JST culpaplex @hyde I would probably run pdftops or pdttohtml over it, see if and how they mark up highlighted parts, then extract those with the usual plaintext filters.
-
Embed this notice
culpaplex (hp@social.sdf.org)'s status on Saturday, 30-Nov-2024 03:18:33 JST culpaplex @hyde If you have any trouble with it, send me a sample PDF with highlights and I'll fiddle around a bit, I enjoy it.
-
Embed this notice
📷 🖋 ~hyde (hyde@lazybear.social)'s status on Saturday, 30-Nov-2024 03:18:35 JST 📷 🖋 ~hyde @hp thanks I'll try that too
-
Embed this notice