DeepSeek Enhances AI Model with Significant Token Expansion as Zhipu AI Launches GLM-5

tech360.tv
Feb 12
2 min read

Chinese artificial intelligence start-up DeepSeek has made a significant upgrade to its flagship AI model, introducing support for a much larger context window that incorporates more up-to-date knowledge. This enhancement has generated considerable excitement regarding the forthcoming major release of the model. DeepSeek's chatbot confirmed that it has expanded its context window from 128,000 tokens to over 1 million, marking an almost tenfold increase. This substantial enhancement is anticipated to improve the AI's ability to handle human queries more effectively.

A larger context window allows an AI model to retain and process more information during a single conversation or task. This capability enables the model to engage in more complex reasoning and to work more efficiently with data and code. The upgrade is particularly timely, coinciding with the unveiling of Zhipu AI's next flagship model, GLM-5, which is expected to intensify competition within the AI sector.

Zhipu AI's GLM-5 boasts improved coding and agentic capabilities, attributed to a twofold increase in its parameters. Additionally, it incorporates DeepSeek's innovative Sparse Attention technique, which aims to balance model performance with efficiency. This development highlights the rapid advancements being made in the field of artificial intelligence, particularly among Chinese tech companies.

As the AI landscape continues to evolve, the enhancements made by DeepSeek and Zhipu AI reflect a broader trend of increasing capabilities and competition among leading firms. The ability to process larger amounts of information not only enhances user experience but also opens up new possibilities for applications in various sectors, from customer service to complex data analysis. The race to develop more sophisticated AI models is heating up, with both companies poised to play pivotal roles in shaping the future of technology.

DeepSeek has expanded its AI model's context window from 128,000 tokens to over 1 million, enhancing its information processing capabilities.
The upgrade allows for more complex reasoning and improved handling of human queries.
Zhipu AI's GLM-5 model features increased parameters and incorporates DeepSeek's Sparse Attention technique, boosting its performance and efficiency.
The advancements signal intensified competition in the AI sector among Chinese tech firms.