Train your own software
If you select links we provide, we may receive compensation.
Key Takeaways
CoWomen / Unsplash
Computer-generated speech might soon sound a lot more human.
The software also can deliver one speakers words using another persons voice.
CoWomen / Unsplash
Its part of a burgeoning push to make computer speech more realistic.
To make artificial speech sound more natural, NVIDIAs text-to-speech research team developed a RAD-TTS model.
The company used its new model to build more conversational-sounding voice narration for its I Am AI video series.
Soundtrap / Unsplash
Harder Than It Sounds
Making computer-generated speech sound natural is a tricky problem, experts say.
“And the recording must be of high quality, recorded in a professional studio.
The more hours of quality speech loaded and processed, the better the result.”
Intonation, emotion, and musicality are the features that computer voices still lack, Ragimov said.
“Thats a work in progress.
Other voices will be able to compete with radio hosts.
Soon youll see voices that can sing and read audiobooks.”
Speech technology is becoming more popular in a wide range of businesses.
Soundtrap / Unsplash
SoundHounds approach combines these two steps into one process to track speech in real-time.
NVIDIA said its news AI models go beyond voiceover work.