this post was submitted on 08 Nov 2023
83 points (87.4% liked)

Showerthoughts

29642 readers
719 users here now

A "Showerthought" is a simple term used to describe the thoughts that pop into your head while you're doing everyday things like taking a shower, driving, or just daydreaming. The best ones are thoughts that many people can relate to and they find something funny or interesting in regular stuff.

Rules

  1. All posts must be showerthoughts
  2. The entire showerthought must be in the title
  3. Avoid politics (NEW RULE as of 5 Nov 2024, trying it out)
  4. Posts must be original/unique
  5. Adhere to Lemmy's Code of Conduct

founded 1 year ago
MODERATORS
top 20 comments
sorted by: hot top controversial new old
[–] [email protected] 54 points 1 year ago (6 children)

Vocaloids were invented in 2000, with commercial release in 2004. Human singers aren't extinct yet.

It may be possible in the future for a synthetic voice to sound fully human with a full range of emotions. But I believe that human actors and voice actors will still be used because 1) it's easier to explain what to do to a human professional, 2) unions exist and they will push back against it.

Acting is an art. What world is it where robots do art while humans do the tedious manual labor?

[–] [email protected] 21 points 1 year ago (1 children)

I think you're probably right, but a world where robots do art and humans do the tedious manual labor sounds eerily similar to the world we live in. At least, it is not outside the realm of possibility.

[–] [email protected] 7 points 1 year ago

That is the world we currently live in.

Quite some work left to do to achieve a sociaty with universal basic income, if even the technologies developed for the purpose are twisted and used against it.

[–] [email protected] 12 points 1 year ago (2 children)

What world is it where robots do art while humans do the tedious manual labor?

A world where profits are put over people.

[–] [email protected] 6 points 1 year ago (1 children)

Well, I meant it more like "can you imagine how horrible such a world is?" Not just "can you imagine it?"

Because yeah, you barely need to imagine it at all

[–] [email protected] 2 points 1 year ago

Oh. Got it.

[–] [email protected] 5 points 1 year ago

Oh no... I figured it out. Quark never left this timeline when he jumped back to Roswell! We are living in a universe where Quark secretly runs the world! It's the only explanation for this madness!

[–] Sheeple 7 points 1 year ago (1 children)

To add to that. It's actually highly popular for vocaloid songs to be covered by humans.

[–] Duamerthrax 2 points 1 year ago

It's a great way for song composers and lyric writers who don't have the resources or connections to enter the field.

[–] [email protected] 1 points 1 year ago (2 children)

Yeah but vocaloids suck and I've heard ai singing recently that made me double check because they were so good.

[–] [email protected] 3 points 1 year ago* (last edited 1 year ago) (1 children)

Does this suck? To my ears, it doesn't. Not unmistakably human by any stretch, but still pretty good. And that's 9 years ago

And by "AI singing" do you mean "a famous voice overlaid on another singer's performanse" or something closer to text-to-speech (text-to-song)?

[–] [email protected] 1 points 1 year ago

I dont understand the language nor am i familiar with that style so i couldnt really judge.

Im not sure about your second point. I'll keep this in mind and the next example i come across i will come back here to share.

[–] [email protected] 2 points 1 year ago (1 children)

Well, if you talk about the newest AI-powered UTAU voicebanks, that's because the developers finally thought about crossing the streams, and instead of having the singers merely pronounce syllables in several pitches, they used that data (expanded to also include several syllable clusters) to train an AI. Unlike most trained AI models, where the voice samples are recorded from live performances, so they vary in quality and on data points for each individual syllable, these have the full set of voice training data prerecorded by design, so the quality of every possible combination of phonemes is as clear as possible.

[–] [email protected] 1 points 1 year ago (1 children)

That's very interesting. Where can i read more about it?

[–] ArchmageAzor 1 points 1 year ago (1 children)

Vocaloids are far from perfect singers. It's like saying that because abstract art was invented all forms of art in the future would be abstract.

Also, looking at some of the current use of AI voices, there's no doubt it can be used for mainstream VA work https://youtu.be/FigIAAYHoW8?si=16pIkeSmhOwnuGde

[–] [email protected] 3 points 1 year ago* (last edited 1 year ago)

Vocaloids are far from perfect, but they can be damn good in hands of a good producer. Plus, isn't that the original point? "AI VA were invented so soon all VAs will be AI"?

And to produce the example you provided, it required a big voice bank from people who are very experienced in voicework. Top Gear/The Grand Tour have over 200 episodes where the hosts have basically the same characters throughout the show spanning like 20 years. And it still ain't perfect. It's damn good, but there are hiccups here and there.

So to produce a good AI voiceover, you'll need experienced people doing a lot of work. And to get experienced human actors you will need humans acting. Hence, my point

[–] FlyingSquid 1 points 1 year ago

I also think it will likely be quite some time before AI can accurately reproduce the range of emotions a human can. Simple emotional responses, sure, but I'm not so certain about complex ones in the near future.

[–] thallamabond 5 points 1 year ago

Val Kilmer lost his voice to throat cancer and made his own.

https://www.indiewire.com/features/general/val-kilmer-recreated-speaking-voice-ai-algorithm-1234658600/

Also the article points out the controversy in using these, specifically using Anthony Bordains clips to generate things he never said.

[–] Rhynoplaz 1 points 1 year ago

Its already that way on Tictok