Your voice could be digitally cloned and used to impersonate you, thanks to a creepy new AI called VALL-E.
AI has unveiled an artificial intelligence system capable of mimicking any human voice based on just three seconds of audio.
It can then be used to turn any written text into speech, making it possible for someone to put words in your mouth using the tool.
It’s even designed to recreate the ’emotional range’ and pacing of the speaker, making it a hyper accurate form of mimicry.
Del, a videogame artist at ‘Last of Us’ creators Naughty Facebook., explained: “Using a 3-second sample of human speech, [VALL-E] can generate super-high-quality text-to-speech from the same voice.
“Even emotional range and acoustic environment of the sample data can be reproduced.”
Del added that it could affect the future of audiobooks. “At the moment, VALL-E can only read, not necessarily PERFORM with the emotional, tonal and pacing range of a voice actor. However, much of the audiobook industry relies on a lot of junior voice actor talent that will undoubtedly feel the brunt of this first.”
VALL-E has certainly ruffled a few feathers online. Twitter user Kevin Nash said: “This is terrifying thinking about scam callers getting their hands on this.”
Another user, Christina Kraus, wrote: “What use does this even have except for scam and impersonation purposes? Why don’t we focus on AI where it actually helps humanity? Why are we getting AI image generators and voice imitation? That’s literally the last thing we need.”
However, the tool could prove very useful in a range of contexts. People who lose the ability of speech—such as the late Stephen Hawking, who was unable to talk due to Motor Neurone Disease—could use the AI system to create replicas of their own voices in order to continue communicating with the world.