Some Paying Users Can Now Use ChatGPT’s Advanced Voice Mode

OpenAI has begun rolling out ChatGPT’s Advanced Voice Mode, giving users their first experience with GPT-4o’s hyperrealistic audio responses. Initially available to a select group of ChatGPT Plus users, the feature will gradually extend to all Plus users by fall of 2024. When OpenAI first showcased GPT-4o’s voice in May, it impressed audiences with its quick responses and realistic human-like voice, which resembled that of actress Scarlett Johansson. Johansson later denied involvement and hired legal counsel after OpenAI’s demo. OpenAI has since removed the voice from its demo and delayed the feature’s release to improve safety measures.

While the voice feature is now being released in alpha, the video and screen-sharing capabilities presented during OpenAI’s Spring Update will launch at a later date. For now, some premium users will experience the advanced voice feature.

Although you may have already tried the Voice Mode that ChatGPT currently offers, Advanced Voice Mode, according to OpenAI, is different. The previous version of ChatGPT’s audio solution used three different models: one for voice-to-text conversion, another for GPT-4 prompt processing, and a third for ChatGPT text-to-voice conversion. However, since GPT-4o is multimodal, it can handle these tasks on its own without the assistance of auxiliary models, resulting in conversations with much-reduced latency. GPT-4o is also said by OpenAI to be able to recognize emotional intonations in your voice, such as singing, melancholy, or joy.

Users of ChatGPT Plus will be able to witness firsthand the hyperrealism of OpenAI’s Advanced Voice Mode in this pilot. OpenAI is gradually releasing ChatGPT’s new voice feature to monitor its usage closely. Alpha group members will receive an alert in the ChatGPT app, followed by an email with usage instructions. Since OpenAI’s demo, the company has tested GPT-4o’s voice capabilities with over 100 external red teamers speaking 45 different languages. A report on these safety efforts is expected in early August.

OpenAI’s Advanced Voice Mode will be limited to four preset voices – Juniper, Breeze, Cove, and Ember – created in collaboration with paid voice actors. The Sky voice from OpenAI’s May demo is no longer available. According to spokesperson Lindsay McCallum, ChatGPT is designed to prevent the impersonation of other people’s voices, including those of both individuals and public figures, and will restrict outputs to the preset voices. OpenAI, thus, aims to prevent deepfake controversies.

Author
Recent Posts

Beema Basheer

Some Paying Users Can Now Use ChatGPT’s Advanced Voice Mode, Says OpenAI

Leave a Reply Cancel reply

Unlearning Trained Data will Impair AI Model Performance

Meta is Letting Users Create Their Own AI Clones on Instagram

Some Paying Users Can Now Use ChatGPT’s Advanced Voice Mode, Says OpenAI

Related Articles

Meta’s AI Video Editor Reimagines Short Videos Instantly

Google Search Gets Smarter with Real-Time Data Visualisation Tools

Bing’s New AI Video Creator Uses OpenAI’s Sora – And It’s Free

Leave a Reply Cancel reply

Unlearning Trained Data will Impair AI Model Performance

Meta is Letting Users Create Their Own AI Clones on Instagram