Meet Gemini Live: Google’s Rival to ChatGPT’s Voice Mode

August 23, 2024
Gemini Live
363
Views

At its Made By Google event, Google unveiled a range of new Pixel devices. In addition to these hardware announcements, the company highlighted several new artificial intelligence (AI) advancements for its Gemini chatbot. The standout upgrade of the evening was Gemini Live, which introduced voice interaction capabilities to the chatbot. This feature allows users to engage in real-time conversations with the AI without the need to type or read messages. From its description, Gemini Live seems to closely resemble ChatGPT’s recently introduced Advanced Voice Mode made available to select paid subscribers.

At the event, the company debuted Gemini Live as a mobile conversational tool. This feature enables Gemini to engage in fluid and natural voice interactions, complete with varied modulations and emotional tones. The goal is to make the AI’s responses feel almost human. Google emphasized that users will have access to ten distinct voices, each offering unique energy levels, pitches, and tones.

In a recent blog post, the company revealed that Gemini Live will provide a truly hands-free experience. This means the feature can listen to users and offer verbal responses even when the device is locked or running in the background. Google compared it to the convenience of a standard phone call, allowing for seamless interaction without the need to actively engage with the screen. 

The new mode enables users to engage in detailed, interactive conversations with the AI, allowing for deeper exploration of topics and more nuanced responses. Users can provide additional context or ask follow-up questions to refine the AI’s answers. Gemini Live also offers the flexibility to interrupt the AI during its response if more information needs to be added. Furthermore, users can pause the conversation and resume it at a later time, making the interaction more adaptable to their needs. 

First revealed at Google I/O, the Gemini Live feature shares similarities with OpenAI’s ChatGPT Advanced Voice Mode, which was unveiled just a day earlier at OpenAI’s Spring event. Google’s offering stands out with a broader selection of voices and a larger context window—up to one million tokens, with developers having access to two million tokens. Although these features are promising, they are still in the early stages, and widespread availability is pending. Currently, Gemini Live is being rolled out to Gemini Advanced subscribers on Android devices, initially in English. Google plans to add more languages and expand to iOS soon. 

Article Categories:
Tech News

Leave a Reply

Your email address will not be published. Required fields are marked *

The maximum upload file size: 256 MB. You can upload: image, audio, video, document, spreadsheet, interactive, text, archive, code, other. Links to YouTube, Facebook, Twitter and other services inserted in the comment text will be automatically embedded. Drop file here