GNU social JP
  • FAQ
  • Login
GNU social JPは日本のGNU socialサーバーです。
Usage/ToS/admin/test/Pleroma FE
  • Public

    • Public
    • Network
    • Groups
    • Featured
    • Popular
    • People

Conversation

Notices

  1. Embed this notice
    13 barn owls in a trenchcoat (hauntedowlbear@eldritch.cafe)'s status on Thursday, 28-Mar-2024 03:01:25 JST 13 barn owls in a trenchcoat 13 barn owls in a trenchcoat

    I have an #accessibility question for #blind and #PartiallySighted users who use text-to-speech tools.

    Has the recent development of machine-learning based synthetic voices such as Piper, Coqui, Mimic3 and Amazon Polly had any useful impact for you, or do you still use e-speak style voices?

    Do the tools you regularly use support "Neural Voices" of the kind mentioned above?

    I've noticed that integration is limited - for example, I can get transcription and reading tool Speech Note (https://flathub.org/apps/net.mkiol.SpeechNote) to read me anything I like in a range of up-to-date voices, but Gnome Orca - my OS-wide screen reader - remains intensely hard to customise.

    But I'm a Linux user, and thus aware that accessibility remains an embarrassingly low priority in practical open source development, even where the component parts exist.

    For example, few of the projects linked by the Piper team (https://github.com/rhasspy/piper?tab=readme-ov-file#people-using-piper) are focused on accessibility.

    In conversation about a year ago from eldritch.cafe permalink

    Attachments

    1. Domain not in remote thumbnail source whitelist: dl.flathub.org
      Install Speech Note on Linux | Flathub
      Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine Translation
    2. Domain not in remote thumbnail source whitelist: opengraph.githubassets.com
      GitHub - rhasspy/piper: A fast, local neural text to speech system
      A fast, local neural text to speech system. Contribute to rhasspy/piper development by creating an account on GitHub.
    • Embed this notice
      13 barn owls in a trenchcoat (hauntedowlbear@eldritch.cafe)'s status on Thursday, 28-Mar-2024 03:09:19 JST 13 barn owls in a trenchcoat 13 barn owls in a trenchcoat
      in reply to

      Holy crap, some awesome person has made a Piper integration for Orca

      https://github.com/rhasspy/piper/issues/285

      In conversation about a year ago permalink
    • Embed this notice
      13 barn owls in a trenchcoat (hauntedowlbear@eldritch.cafe)'s status on Thursday, 28-Mar-2024 03:20:12 JST 13 barn owls in a trenchcoat 13 barn owls in a trenchcoat
      in reply to
      • D.Hamlin.Music

      @dhamlinmusic It's outside the scope of what I'm currently looking at (which is desktop Linux flavoured) but I see that the sherpa-onnx project has been releasing APKs for a few months - I'll give them a go and report back.

      GitHub link:
      https://github.com/k2-fsa/sherpa-onnx

      In conversation about a year ago permalink

      Attachments

      1. Domain not in remote thumbnail source whitelist: opengraph.githubassets.com
        GitHub - k2-fsa/sherpa-onnx: Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift
        Speech-to-text, text-to-speech, and speaker recongition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 serve...
    • Embed this notice
      D.Hamlin.Music (dhamlinmusic@dragonscave.space)'s status on Thursday, 28-Mar-2024 03:20:13 JST D.Hamlin.Music D.Hamlin.Music
      in reply to

      @HauntedOwlbear I would love one of these for #android, the TTS engine options are rather limited.

      In conversation about a year ago permalink
    • Embed this notice
      13 barn owls in a trenchcoat (hauntedowlbear@eldritch.cafe)'s status on Thursday, 28-Mar-2024 03:33:16 JST 13 barn owls in a trenchcoat 13 barn owls in a trenchcoat
      in reply to
      • D.Hamlin.Music

      @dhamlinmusic yeah, a lot of these projects are very much "sideload these six things", which is less than entirely helpful sigh

      In conversation about a year ago permalink
    • Embed this notice
      D.Hamlin.Music (dhamlinmusic@dragonscave.space)'s status on Thursday, 28-Mar-2024 03:33:17 JST D.Hamlin.Music D.Hamlin.Music
      in reply to

      @HauntedOwlbear Ah yeah I’m talking through the store, and knowing for sure up front that it'll be able to be set as engine for #talkback.

      In conversation about a year ago permalink

Feeds

  • Activity Streams
  • RSS 2.0
  • Atom
  • Help
  • About
  • FAQ
  • TOS
  • Privacy
  • Source
  • Version
  • Contact

GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.

Creative Commons Attribution 3.0 All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.