A.I. Listened To People's Voices, Then Generated Their Faces

Question

A.I. Listened To People's Voices, Then Generated Their Faces

Jack Walker

Have you ever constructed a mental image of a person you've never seen, based solely on their voice? Artificial intelligence (AI) can now do that, generating a digital image of a person's face using only a brief audio clip for reference.

Named Speech2Face, the neural network — a computer that "thinks" in a manner similar to the human brain — was trained by scientists on millions of educational videos from the internet that showed over 100,000 different people talking.

From this dataset, Speech2Face learned associations between vocal cues and certain physical features in a human face, researchers wrote in a new study. The AI then used an audio clip to model a photorealistic face matching the voice. [5 Intriguing Uses for Artificial Intelligence (That Aren't Killer Robots)]

This Plant Eats Salamanders

The carnivorous northern pitcher plants in Ontario, Canada don’t just feast on bugs — they eat vertebrates, too. The carnivorous plant named 'Turtle Socks' has been eating baby salamanders for lunch.

The findings were published online May 23 in the preprint jounral arXiv and have not been peer-reviewed.

Thankfully, AI doesn't (yet) know exactly what a specific individual looks like based on their voice alone. The neural network recognized certain markers in speech that pointed to gender, age and ethnicity, features that are shared by many people, the study authors reported.

"As such, the model will only produce average-looking faces," the scientists wrote. "It will not produce images of specific individuals."

AI has already shown that it can produce uncannily accurate human faces, though its interpretations of cats are frankly a little terrifying.

The faces generated by Speech2Face — all facing front and with neutral expressions — didn't precisely match the people behind the voices. But the images did usually capture the correct age ranges, ethnicities and genders of the individuals, according to the study.

However, the algorithm's interpretations were far from perfect. Speech2Face demonstrated "mixed performance" when confronted with language variations. For example, when the AI listened to an audio clip of an Asian man speaking Chinese, the program produced an image of an Asian face. However, when the same man spoke in English in a different audio clip, the AI generated the face of a white man, the scientists reported.

The algorithm also showed gender bias, associating low-pitched voices with male faces and high-pitched voices with female faces. And because the training dataset represents only educational videos from YouTube, it "does not represent equally the entire world population," the researchers wrote.

Another concern about this video dataset arose when a person who had appeared in a YouTube video was surprised to learn that his likeness had been incorporated into the study, Slate reported. Nick Sullivan, head of cryptography with the internet security company Cloudflare in San Francisco, unexpectedly spotted his face as one of the examples used to train Speech2Face (and which the algorithm had reproduced rather approximately).

Sullivan hadn't consented to appear in the study, but the YouTube videos in this dataset are widely considered to be available for researchers to use without acquiring additional permissions, according to Slate.

livescience.com/65689-ai-human-voice-face.html?utm_source=ls-newsletter&utm_medium=email&utm_campaign=20190611-ls

Attached: 20190616_143805.jpg (1920x1506, 714.09K)

June 16, 2019 - 18:40

Daniel Sullivan

...

June 16, 2019 - 18:44

Tyler Moore

I wondower what AT&T Mike would look like

June 16, 2019 - 18:46

Julian Thompson

wonder*

June 16, 2019 - 18:46

Ryan White

Attached: 20190616_135454.jpg (1920x1176, 218.88K)

June 16, 2019 - 18:47

Mason King

Attached: 20190616_145052.jpg (1556x2560, 1.18M)

June 16, 2019 - 18:51

Jaxon Fisher

Speech2Face is run by the same algorithm as Text2Face, the A.I. software that analyzes peoples choice of words, and generates an image of their face.

Attached: 20190616_150214.jpg (1280x851, 288.28K)

June 16, 2019 - 19:04

Carter Jenkins

Attached: 20190616_151010.jpg (1280x839, 318.63K)

June 16, 2019 - 19:11

Ian Jones

Attached: 20190616_152058.jpg (1280x830, 335.79K)

June 16, 2019 - 19:23

Michael Long

Attached: 20190616_152044.jpg (1280x836, 381.64K)

June 16, 2019 - 19:25

Luis Lewis

it wasn't the wondower typo, but the need to correct right after that shows that the user is a pussy faced wimp who is afraid of the slightest critisicm

(sac)iatrist

June 16, 2019 - 19:51

Caleb Reyes

Attached: 20190616_161440.jpg (1280x680, 328.94K)

June 16, 2019 - 20:15

Robert Morris

*sings*
They call me the wonderer
Yeah, the wonderer
I wonder round and round and round and round

June 16, 2019 - 20:18

Michael Hughes

How the fuck is this possible?

June 16, 2019 - 20:25

John Carter

Attached: 20190616_162929.jpg (1280x830, 350.85K)

June 16, 2019 - 20:30

Ayden Powell

Attached: 20190616_163454.jpg (1280x622, 197.64K)

June 16, 2019 - 20:35

Carson Hall

Attached: 1252356236781.jpg (567x485, 141.46K)

June 16, 2019 - 22:36

Angel Perez

they can even do this with fingerprints and dna

June 16, 2019 - 23:25

Andrew Carter

Wannerful!
Wannerful!

Attached: 510123eaZtL._SX305_BO1,204,203,200_.jpg (1200x1747, 161.86K)

June 17, 2019 - 03:59

Angel Reed

Someone should put Rick Astley through it

June 17, 2019 - 08:56

Dylan Gonzalez

And Michael Jackson

June 17, 2019 - 08:57

Leo Ortiz

just-kiss-it spoke to the AI

Attached: just-kiss-it.gif (499x674, 234.44K)

June 17, 2019 - 20:35

David Morales

/killcen/ bespake unto the AI god

his ears are to catch any news

Attached: killcen.jpg (900x900, 64.2K)

June 17, 2019 - 20:37

Ryan Carter

Andrew McGovern spoke and the AI answered

Attached: george-wallace---segregation-forever.jpg (1200x675, 61.5K)

June 17, 2019 - 21:05

1 2 3 Next

A.I. Listened To People's Voices, Then Generated Their Faces

Last threads