- tech360.tv
OpenAI Unveils GPT-4o with Realistic Voice and Multimodal Interaction to Lead AI Innovation
Updated: May 23
OpenAI releases GPT-4o, a new AI model capable of realistic voice conversations and interaction across text and image formats. ChatGPT now allows real-time responses and interruptions, mimicking natural conversations. OpenAI faces competition and pressure to expand the user base of ChatGPT.
OpenAI, the creator of ChatGPT, has announced the release of its latest AI model, GPT-4o. This new model is capable of engaging in realistic voice conversations and can interact across text and image formats. With this move, OpenAI aims to maintain its lead in the race to dominate the emerging technology landscape.
During a livestream event, OpenAI researchers showcased the new audio capabilities of ChatGPT. Users can now have real-time conversations with the AI, experiencing no delays in responses. Additionally, they can interrupt ChatGPT while it is speaking, mimicking the natural flow of human conversations. OpenAI CEO Sam Altman expressed his excitement, stating, "It feels like AI from the movies... Talking to a computer has never felt really natural for me; now it does."
OpenAI, backed by Microsoft, faces increasing competition and the pressure to expand the user base of its popular chatbot product, ChatGPT. Known for its ability to generate human-like written content and top-notch software code, ChatGPT has garnered significant attention in the tech world.
During the livestream event, researchers demonstrated ChatGPT's new voice assistant capabilities. In one impressive demo, ChatGPT utilised its vision and voice capabilities to guide a researcher through solving a math equation on a sheet of paper. Another demo showcased the GPT-4o model's real-time language translation capabilities.
The demonstrations by OpenAI pushed the boundaries of what was once considered science fiction. At one point, ChatGPT engaged in playful banter with its interlocutor, creating a whimsical atmosphere. The OpenAI researcher expressed admiration, stating, "how useful and amazing you are," to which ChatGPT responded, "Oh stop it! You're making me blush!"
OpenAI's Chief Technology Officer, Mira Murati, revealed that the new GPT-4o model would be offered for free, as it is more cost-effective than the company's previous models. Paid users of GPT-4o will enjoy greater capacity limits compared to free users. The company plans to make the GPT-4o model available in ChatGPT over the next few weeks.
ChatGPT experienced a rapid rise in popularity, becoming the fastest application ever to reach 100 million monthly active users after its launch in late 2022. However, global traffic to ChatGPT's website has fluctuated over the past year, only recently returning to its peak in May 2023, according to analytics firm Similarweb.
OpenAI made these announcements just a day before Alphabet's annual Google developers' conference. Alphabet is expected to showcase its own new AI-related features at the event. While there were initial reports that OpenAI would announce an AI-powered search product, the company decided to delay the announcement, according to a source familiar with the matter.
Shares of Alphabet were down 0.4% in Monday afternoon trading, after experiencing a nearly 3% drop earlier in the day. Similarly, Microsoft shares were down 0.2%.
OpenAI releases GPT-4o, a new AI model capable of realistic voice conversations and interaction across text and image formats.
ChatGPT now allows real-time responses and interruptions, mimicking natural conversations.
OpenAI faces competition and pressure to expand the user base of ChatGPT.
Source: REUTERS