top of page

DeepSeek Enhances AI Model with Significant Token Expansion as Zhipu AI Launches GLM-5

  • Writer: tech360.tv
    tech360.tv
  • 3 hours ago
  • 2 min read

Chinese artificial intelligence start-up DeepSeek has made a significant upgrade to its flagship AI model, introducing support for a much larger context window that incorporates more up-to-date knowledge. This enhancement has generated considerable excitement regarding the forthcoming major release of the model. DeepSeek's chatbot confirmed that it has expanded its context window from 128,000 tokens to over 1 million, marking an almost tenfold increase. This substantial enhancement is anticipated to improve the AI's ability to handle human queries more effectively.


Credit: Scmp
Deepseek

A larger context window allows an AI model to retain and process more information during a single conversation or task. This capability enables the model to engage in more complex reasoning and to work more efficiently with data and code. The upgrade is particularly timely, coinciding with the unveiling of Zhipu AI's next flagship model, GLM-5, which is expected to intensify competition within the AI sector.


Zhipu AI's GLM-5 boasts improved coding and agentic capabilities, attributed to a twofold increase in its parameters. Additionally, it incorporates DeepSeek's innovative Sparse Attention technique, which aims to balance model performance with efficiency. This development highlights the rapid advancements being made in the field of artificial intelligence, particularly among Chinese tech companies.


As the AI landscape continues to evolve, the enhancements made by DeepSeek and Zhipu AI reflect a broader trend of increasing capabilities and competition among leading firms. The ability to process larger amounts of information not only enhances user experience but also opens up new possibilities for applications in various sectors, from customer service to complex data analysis. The race to develop more sophisticated AI models is heating up, with both companies poised to play pivotal roles in shaping the future of technology.


  • DeepSeek has expanded its AI model's context window from 128,000 tokens to over 1 million, enhancing its information processing capabilities.

  • The upgrade allows for more complex reasoning and improved handling of human queries.

  • Zhipu AI's GLM-5 model features increased parameters and incorporates DeepSeek's Sparse Attention technique, boosting its performance and efficiency.

  • The advancements signal intensified competition in the AI sector among Chinese tech firms.

As technology advances and has a greater impact on our lives than ever before, being informed is the only way to keep up.  Through our product reviews and news articles, we want to be able to aid our readers in doing so. All of our reviews are carefully written, offer unique insights and critiques, and provide trustworthy recommendations. Our news stories are sourced from trustworthy sources, fact-checked by our team, and presented with the help of AI to make them easier to comprehend for our readers. If you notice any errors in our product reviews or news stories, please email us at editorial@tech360.tv.  Your input will be important in ensuring that our articles are accurate for all of our readers.

Tech360tv is Singapore's Tech News and Gadget Reviews platform. Join us for our in depth PC reviews, Smartphone reviews, Audio reviews, Camera reviews and other gadget reviews.

  • YouTube
  • Facebook
  • TikTok
  • Instagram
  • Twitter
  • LinkedIn

© 2021 tech360.tv. All rights reserved.

bottom of page