They never got VOCALOID to do English well but whatever TTS Neuro-sama is using has cracked it.
The job of a singer is over.
They never got VOCALOID to do English well but whatever TTS Neuro-sama is using has cracked it.
The job of a singer is over.
@kalleboo@bitbang.social There are people who using Miku's voice to train AI models to make a "better" and more human sounding Miku that can actually speak English.
I personally think this is heresy. The bad English is part of the Vocaloid aesthetic I say.
@kalleboo@bitbang.social Also I'm pretty sure Neurosama just takes the vocal sound from the original song and just puts a (AI generated) filter on top of it to make it sound like Neurosama is singing it.
She can't sing songs from scratch without having a human singer to "learn" from.
Vocaloid is different because you can make songs from scratch using Vocaloid, as Vocaloid is more an "instrument" instead of a "filter" if you can even use those words in this context.
@kalleboo@bitbang.social Sometimes little mistakes or character things that the original singer of a song does you can hear back in Neurosama's cover which kinda makes it unnatural, because nobody covers song with the same kind of precision, copying the original singer's flaws without adding any of your own.
However Neurosama singing did cost the author a lot of work. You can hear Neurosama's voice algorithm being worked on with every karaoke video they make. The early ones sound really bad (almost funny) in comparison to this.
@kalleboo@bitbang.social I don't think believe most vtubers are just random dudes with voice changers tho, most of them have a real voice.
@SuperDicq Yeah I think the neurosama singing is basically an advanced version of the “make a Donald Trump voice say this thing” voice changers, but that still means that you can replace all the commercially viable conventionally attractive pop singers with cute anime grills backed by some random dude singing in his basement (which is what all vtubers are already anyway am I right)
@kalleboo@bitbang.social I do miss original Neuro-sama singing tho.
It was really buggy and inaccurate to the point of actually being hilarious.
https://yewtu.be/watch?v=Msm_Vasv0kA
@SuperDicq Yeah I was joking lol
@SuperDicq Haha that’s great
@kalleboo@bitbang.social So yeah you can see all the progress that they made getting Neuro to sound natural.
GNU social JP is a social network, courtesy of GNU social JP管理人. It runs on GNU social, version 2.0.2-dev, available under the GNU Affero General Public License.
All GNU social JP content and data are available under the Creative Commons Attribution 3.0 license.