OpenAI

OpenAI releases Voice Engine text-to-speech model

Published

1 year ago

March 29, 2024

OpenAI today released the Voice Engine model, which uses text-to-speech technology to generate natural-sounding speech based on the text input.

The company has released a small-scale preview of the model to show how it tries to resemble the original speaker.

The company says a small model with a single 15-second sample can create emotive and realistic voices. However, the sample still sounds like it needs some polish.

OpenAI first worked on Voice Engine in late 2022 and integrated the model into text-to-speech API and ChatGPT’s voice and read-aloud features. Here are a few objectives that it is looking to accomplish with this new model:

Reading Assistance
Content translation
Supporting people who are non-verbal
Helping patients recover their voice

The Voice Engine model is in an early phase of testing including a small group of internal partners. The preview is not available for a wide range of testers.

There are a few of the improvements that are expected to come in the future build of Voice Engine. You can check all of the speech samples on OpenAI’s official website linked below.

(source)

Up Next

ChatGPT is now available to use without sign up

Don't Miss

OpenAI says ChatGPT is now showing links more prominently

Sophia Garner

Sophia says technology is raising the bar of human living and she is actively trying to promote awareness among people about the latest changes in social media platforms. Social media has the power to make many positive impacts and she is continuously sharing the latest updates with fellow readers. In some spare time, she likes to tag along with friends for a walk.

EONMSK News

OpenAI releases Voice Engine text-to-speech model

You may like