top of page

Alibaba Group Holding on Tuesday unveiled a new suite of artificial intelligence models, including Qwen3-Omni, a flagship multimodal system. Developers stated two variants of Qwen3-Omni outperformed OpenAI’s GPT-4o and Google’s Gemini 2.5-Flash in benchmark tests.


Credit: QWEN.AI
Credit: QWEN.AI

The new model rivals OpenAI’s GPT-4o, launched in May 2024, and Google’s popular "Nano Banana" image editor, intensifying competition domestically and internationally. Qwen3-Omni processes text, audio, image, and video inputs, responding with text and audio.



The development team stated on social media that Qwen3-Omni was the first native end-to-end multimodal system to unify text, images, audio, and video in one model. It competes directly with offerings such as OpenAI’s GPT-4o and Google’s Gemini 2.5-Flash, also known as "Nano Banana."


Benchmark tests assessed audio recognition and comprehension, along with image and video understanding. These tests showed Qwen3-Omni variants surpassed their predecessor, Qwen2.5-Omni-7B, as well as GPT-4o and Gemini 2.5-Flash.


Lin Junyang, a researcher on the Qwen team under Alibaba’s cloud unit, attributed the improvements to various foundational audio and image projects. Lin stated, "This year, our audio team has spent great efforts on building large-scale audio data sets for both pretraining and post-training," adding that they "combined everything … to build our Qwen3-Omni."


Diagram of Qwen3-Omni framework with components including MTP Module, MoE Talker, and Vision Encoder. Shows text, icons, and color coding.
Credit: QWEN.AI

Qwen3-Omni supports inputs in 119 text languages and understands 19 spoken languages, including English, Chinese, Japanese, Spanish, Arabic, and Urdu. It can generate spoken responses in 10 languages, among them English, Chinese, French, German, Russian, Italian, Spanish, Portuguese, Japanese, and Korean.


The model’s multilingual and multimodal capabilities extend its utility beyond text-based conversations. According to an Alibaba video demonstration, hardware equipped with Qwen3-Omni, cameras, microphones, and speakers could perceive images and sounds, generating vocal responses.


Three variants of the Qwen3-Omni series are now available on open-source hosting platforms, including Hugging Face and GitHub. Alibaba also launched an updated open-source image tool, Qwen-Image-Edit-2509, on Tuesday.


Additionally, a proprietary speech model, Qwen3-TTS-Flash, was released, available exclusively through the company’s cloud computing platform. The team noted the new image tool improved image consistency during editing, while the speech model could produce "expressive voices" with humanlike timbres and adapt its tone to match input text.


These model releases precede Alibaba Cloud’s annual Apsara Conference, taking place from Wednesday to Friday in Hangzhou, eastern Zhejiang province.

  • Alibaba unveiled Qwen3-Omni, a new multimodal artificial intelligence model.

  • Two Qwen3-Omni variants reportedly outperformed OpenAI’s GPT-4o and Google’s Gemini 2.5-Flash in benchmark tests.

  • Qwen3-Omni processes text, audio, image, and video inputs, and supports 119 text and 19 spoken languages.


Source: SCMP

Google is integrating its Gemini artificial intelligence assistant into televisions, with TCL’s new QM9K series becoming the first models to feature the large language model technology built-in. This marks a significant upgrade from basic voice commands, allowing users to have full, conversational interactions with their television sets.


Smart TV interface showing "Lilo & Stitch" on YouTube. Various app icons at the bottom. Text bubble asks about a new hospital drama.
Credit: GOOGLE

The Gemini AI assistant enables users to articulate preferences for shows or films, request recaps of previous seasons, and inquire about critical reviews. Google provided examples illustrating its capacity for vague, conversational queries, such as, "Find me something to watch with my wife. I like dramas, but she likes lighthearted comedies."



Other examples include asking, "What happened in the last season of ‘Outlander’?", or “What's the new hospital drama everyone's talking about?” Gemini extends beyond entertainment, functioning as a valuable learning and lifestyle tool, according to Google.


On the television screen, Gemini can guide a child through a volcano science project, assist in preparing a last-minute dessert, or offer YouTube tutorials for learning a new skill like guitar. These interactions are designed to feel like chatting with a knowledgeable friend, accommodating natural follow-up questions.


The TCL QM9K is the initial television to launch with Gemini, and it also includes a presence sensor. This sensor can transform the television into an information panel, similar to a Nest Hub, displaying weather, calendar events, and camera feeds when someone enters the room.


Later this year, Gemini will also expand to the Google TV Streamer, Walmart Onn 4K Pro, and Hisense’s 2025 U7, U8, and UX models. Additionally, it will be available on TCL’s QM7K, QM8K, and X11K sets.


Text on a screen explains volcanic eruptions to a child. Below, colorful educational video icons are shown, featuring volcano graphics.
Credit: GOOGLE

With over 300 million devices currently operating Google TV or Android TV OS, this extensive rollout could signify a pivotal moment for voice control on large screens. This extensive rollout could signify a pivotal moment, making voice interaction a genuinely useful feature for daily use.


This development follows the announcement of Gemini for Home, a comprehensive replacement for Google Assistant on Nest speakers and displays. An early access rollout for Gemini for Home is scheduled to begin in October, coinciding with a hardware event slated for Oct. 1.


Gemini for Home will introduce more natural language controls, smarter routines, and proactive automation suggestions across smart home devices. Capabilities range from summarising security camera alerts to compiling lunch recipes based on available refrigerator ingredients.

  • Google Gemini AI assistant is now integrated into televisions, beginning with the TCL QM9K series.

  • Gemini allows for natural, conversational interactions, moving beyond rigid voice commands.

  • It functions as an entertainment guide, a learning aid, and a lifestyle tool on the big screen.


Source: FORBES

Walt Disney announced Tuesday that prices for its Disney+ streaming service in the United States will increase next month. The changes, effective Oct. 21, mark the fourth consecutive year of price adjustments for the platform.


Collage of Disney+ shows/movies including "Snow White", "Captain America", and "LaLiga". Featured banners and logos on a dark blue background.
Credit: DISNEY

The ad-supported Disney+ plan will see a USD 2 increase, bringing its monthly cost to USD 11.99. The ad-free premium tier will rise by USD 3, reaching USD 18.99 per month.


Annual premium subscriptions will also jump by USD 30, with the new cost set at USD 189.99. Bundled packages, which combine Disney+ with Hulu, and ESPN+, are also slated for price increases, according to information on the company's website.


These adjustments are part of Walt Disney’s strategy to bolster profits from its digital platforms and transform its streaming operations into a growth engine. The streaming business achieved profitability for the first time last year.


Disney+ originally launched in Nov. 2019 at a monthly rate of USD 6.99. Previous price hikes included a 38% increase in Dec. 2022, followed by further rises in Oct. 2023, and Oct. 2024.


Man in a suit smiling, holding papers. Vanity mirror with lights in the background. Text: "Jimmy Kimmel Live!" and "ABC" on a blue wall.
Credit: Jimmy Kimmel Live

The company is currently facing heightened public scrutiny. This follows recent controversy over the temporary removal of "Jimmy Kimmel Live!" from ABC, which sparked calls to boycott Disney's services.

  • Disney+ prices in the United States will increase starting Oct. 21.

  • The ad-supported plan will cost USD 11.99 per month, and the ad-free premium tier will be USD 18.99 monthly.

  • Annual premium subscriptions will rise to USD 189.99, and bundled packages will also see increases.


Source: REUTERS

Tech360tv is Singapore's Tech News and Gadget Reviews platform. Join us for our in depth PC reviews, Smartphone reviews, Audio reviews, Camera reviews and other gadget reviews.

  • YouTube
  • Facebook
  • TikTok
  • Instagram
  • Twitter
  • LinkedIn

© 2021 tech360.tv. All rights reserved.

bottom of page