How AI Could Make Computer Speech More Natural

Train your own software

If you select links we provide, we may receive compensation.

Key Takeaways

CoWomen / Unsplash

Computer-generated speech might soon sound a lot more human.

The software also can deliver one speakers words using another persons voice.

Someone working with a voice recording on a laptop computer.

CoWomen / Unsplash

Its part of a burgeoning push to make computer speech more realistic.

To make artificial speech sound more natural, NVIDIAs text-to-speech research team developed a RAD-TTS model.

The company used its new model to build more conversational-sounding voice narration for its I Am AI video series.

Someone recording voice audio in a home studio.

Soundtrap / Unsplash

Making computer-generated speech sound natural is a tricky problem, experts say.

“And the recording must be of high quality, recorded in a professional studio.

The more hours of quality speech loaded and processed, the better the result.”

Intonation, emotion, and musicality are the features that computer voices still lack, Ragimov said.

“Thats a work in progress.

Other voices will be able to compete with radio hosts.

Soon youll see voices that can sing and read audiobooks.”

Speech technology is becoming more popular in a wide range of businesses.

Soundtrap / Unsplash

SoundHounds approach combines these two steps into one process to track speech in real-time.

NVIDIA said its news AI models go beyond voiceover work.