Embed Notice

HTML Code

<blockquote style="position: relative; padding-left: 55px;"><section><a href="https://emacs.ch/users/sachac/statuses/111598990294096138">Sacha Chua (sachac@emacs.ch)'s status on Tuesday, 19-Dec-2023 04:06:49 JST</a><a href="https://emacs.ch/@sachac" title="sachac@emacs.ch"><img src="https://gnusocial.jp/avatar/26778-48-20221114205318.webp" width="48" height="48" alt="Sacha Chua" style="position: absolute; left: 0; top: 0;">Sacha Chua</a></section><article><p>Today I used <a href="https://emacs.ch/tags/Emacs" rel="tag">#Emacs</a> Lisp to parse Deepgram's <a href="https://emacs.ch/tags/speech" rel="tag">#speech</a> recognition JSON output with utterances, punctuation, and smart format turned on and the <a href="https://emacs.ch/tags/Whisper" rel="tag">#Whisper</a> Large model selected. I turned the words array into a VTT subtitle file with speaker identification (handy for EmacsConf Q&amp;A) and captions limited to roughly 45 characters with punctuation preferred for splitting. It's way faster than waiting for a CPU-only computer to run Whisper Large on the files. Looking forward to experimenting with this for my personal braindumping too.</p></article><footer><a rel="bookmark" href="https://gnusocial.jp/conversation/2470270#notice-4887280">In conversation</a><time datetime="2023-12-19T04:06:49+09:00" title="Tuesday, 19-Dec-2023 04:06:49 JST">Tuesday, 19-Dec-2023 04:06:49 JST</time> <span>from <span><a href="https://emacs.ch/@sachac/111598990294096138" rel="external" title="Sent from emacs.ch via ActivityPub">emacs.ch</a></span></span><a href="https://emacs.ch/@sachac/111598990294096138">permalink</a></footer></blockquote>

Corresponding Notice

Embed this notice
Sacha Chua (sachac@emacs.ch)'s status on Tuesday, 19-Dec-2023 04:06:49 JST Sacha Chua
Today I used #Emacs Lisp to parse Deepgram's #speech recognition JSON output with utterances, punctuation, and smart format turned on and the #Whisper Large model selected. I turned the words array into a VTT subtitle file with speaker identification (handy for EmacsConf Q&A) and captions limited to roughly 45 characters with punctuation preferred for splitting. It's way faster than waiting for a CPU-only computer to run Whisper Large on the files. Looking forward to experimenting with this for my personal braindumping too.
In conversationTuesday, 19-Dec-2023 04:06:49 JST from emacs.chpermalink

Public

Embed Notice

HTML Code

Corresponding Notice