Some Paying Users Can Now Use ChatGPT’s Advanced Voice Mode, Says OpenAI

August 9, 2024
ChatGPT’s Advanced Voice Mode
682
Views

OpenAI has begun rolling out ChatGPT’s Advanced Voice Mode, giving users their first experience with GPT-4o’s hyperrealistic audio responses. Initially available to a select group of ChatGPT Plus users, the feature will gradually extend to all Plus users by fall of 2024. When OpenAI first showcased GPT-4o’s voice in May, it impressed audiences with its quick responses and realistic human-like voice, which resembled that of actress Scarlett Johansson. Johansson later denied involvement and hired legal counsel after OpenAI’s demo. OpenAI has since removed the voice from its demo and delayed the feature’s release to improve safety measures.

While the voice feature is now being released in alpha, the video and screen-sharing capabilities presented during OpenAI’s Spring Update will launch at a later date. For now, some premium users will experience the advanced voice feature.

Although you may have already tried the Voice Mode that ChatGPT currently offers, Advanced Voice Mode, according to OpenAI, is different. The previous version of ChatGPT’s audio solution used three different models: one for voice-to-text conversion, another for GPT-4 prompt processing, and a third for ChatGPT text-to-voice conversion. However, since GPT-4o is multimodal, it can handle these tasks on its own without the assistance of auxiliary models, resulting in conversations with much-reduced latency. GPT-4o is also said by OpenAI to be able to recognize emotional intonations in your voice, such as singing, melancholy, or joy.

Users of ChatGPT Plus will be able to witness firsthand the hyperrealism of OpenAI’s Advanced Voice Mode in this pilot. OpenAI is gradually releasing ChatGPT’s new voice feature to monitor its usage closely. Alpha group members will receive an alert in the ChatGPT app, followed by an email with usage instructions. Since OpenAI’s demo, the company has tested GPT-4o’s voice capabilities with over 100 external red teamers speaking 45 different languages. A report on these safety efforts is expected in early August.

OpenAI’s Advanced Voice Mode will be limited to four preset voices – Juniper, Breeze, Cove, and Ember – created in collaboration with paid voice actors. The Sky voice from OpenAI’s May demo is no longer available. According to spokesperson Lindsay McCallum, ChatGPT is designed to prevent the impersonation of other people’s voices, including those of both individuals and public figures, and will restrict outputs to the preset voices. OpenAI, thus, aims to prevent deepfake controversies.

Article Tags:
· · · · ·
Article Categories:
Tech News

Leave a Reply

Your email address will not be published. Required fields are marked *

The maximum upload file size: 256 MB. You can upload: image, audio, video, document, spreadsheet, interactive, text, archive, code, other. Links to YouTube, Facebook, Twitter and other services inserted in the comment text will be automatically embedded. Drop file here