top of page

A new artificial intelligence (AI) model training technique, co-authored by DeepSeek founder Liang Wenfeng and Peking University researchers, aims to bypass graphics processing unit (GPU) memory constraints. This development underscores the Hangzhou start-up’s focus on maximising cost efficiency amid a computational power deficit compared to US industry leaders.


Credit: DEEPSEEK
Credit: DEEPSEEK

The technical paper introduces a "conditional memory" technique called Engram, designed to address a key bottleneck in scaling AI models: the limited capacity of GPU high-bandwidth memory (HBM). Existing large language models (LLMs) often waste valuable sequential depth on trivial operations, which could be used for higher-level reasoning.


HBM represents one of China’s biggest AI hardware gaps with the US. According to Ray Wang, a Seoul-based analyst at SemiAnalysis, China’s memory champion ChangXin Memory Technologies (CXMT) lags several years behind industry leaders such as South Korea’s Samsung Electronics, SK Hynix, and Micron Technology of the US.


Engram addresses this by "decoupling" compute and memory, enabling models to "look up" basic information more efficiently. This new technique also purports to improve models’ efficiency in handling long context, a significant challenge for turning AI chatbots into useful AI agents.


Researchers validated Engram in a 27 billion parameter model, finding it boosted performance on major industry benchmarks by several percentage points. Crucially, it also left more capacity for the model to perform computationally demanding complex reasoning.


They envision conditional memory as "an indispensable modelling primitive for next-generation sparse models," likening its potential impact to their own variant of the Mixture-of-Experts technique. This technique allowed scaling of model size without proportional increases in compute.


DeepSeek has been a poster child for China’s AI innovation over the past year. Elie Bakouch, a research engineer at open-source developer platform Hugging Face, praised the paper for validating the technique "with hardware at inference and training."


The paper lists 14 co-authors, including Huishuai Zhang, an assistant professor of computer science at Peking University and a former researcher at Microsoft Research Asia. The lead author is Cheng Xin, a Peking University student who also contributed to DeepSeek’s landmark V3 and R1 models.


Credit: BLOOMBERG
Credit: BLOOMBERG

Anticipation is high for a new major DeepSeek model on the one-year anniversary of the release of its R1 model. US tech media outlet The Information reported on Friday that DeepSeek was expected to launch a new V4 model with strong coding capabilities in mid-February.

  • DeepSeek founder Liang Wenfeng co-authored a paper proposing Engram, an AI training technique.

  • Engram aims to bypass GPU memory constraints and improve efficiency in handling long context.

  • The technique was validated in a 27 billion parameter model, showing performance boosts and more capacity for complex reasoning.


Source: SCMP

AstraZeneca will acquire Boston-based Modella AI to accelerate oncology drug research and the discovery of new medicines. The companies did not disclose the financial terms of the agreement.


Credit: AstraZeneca
Credit: AstraZeneca

Modella AI's foundation models and artificial intelligence agents will integrate into oncology research and development. This integration aims to support clinical development and biomarker discovery.


Gabi Raia, Modella AI’s chief commercial officer, stated that oncology drug development is becoming more complex, data-rich, and time-sensitive. Raia added that joining AstraZeneca would allow Modella to deploy its tools in global trials and clinical settings.


AstraZeneca announced this marks the first acquisition of an artificial intelligence firm by a major pharmaceutical organisation. This move aligns with the drug industry’s increasing use of AI to speed up drug discovery.


Aradhana Sarin, AstraZeneca chief financial officer, said the acquisition would “supercharge” the company’s quantitative pathology and biomarker discovery efforts. Sarin explained that this brings more data and AI capabilities in-house for AstraZeneca.


This deal is among several pacts between major drug firms and AI companies announced at a healthcare conference. Another notable collaboration involves Nvidia and Eli Lilly, a USD 1 billion partnership to build a new research laboratory using Nvidia’s latest-generation AI chips.


Credit: Modella AI
Credit: Modella AI

Modella will accelerate AstraZeneca’s efforts to make pathology more quantitative. This involves using computers to analyse biopsies for relevant proteins and correlate them with clinical data.


Sarin noted this process allows AstraZeneca to develop “highly targeted biomarkers and then highly targeted therapeutics.” The current acquisition expands on a multi-year collaboration the companies unveiled in July.


Sarin described that prior partnership as a “test drive.” She added that AstraZeneca ultimately sought to bring Modella’s data, foundation models, and AI talent in-house.


AI tools can also be used to select patients for drug trials more rapidly. This could potentially increase the odds of clinical success and reduce related costs.


Separately, Sarin stated that 2026 is anticipated to be another catalyst year for AstraZeneca. Multiple high-value late-stage readouts are expected across various therapy areas. The company is currently working towards a target of USD 80 billion in annual revenue by 2030.

  • AstraZeneca is acquiring Boston-based Modella AI to accelerate oncology drug research.

  • Modella AI’s technology will integrate into AstraZeneca’s oncology research and development to support biomarker discovery and clinical development.

  • AstraZeneca Chief Financial Officer Aradhana Sarin highlighted the acquisition will boost quantitative pathology and bring AI capabilities in-house.


Source: REUTERS

Apple has introduced Apple Creator Studio, a new subscription bundle of professional creative software. This initiative aims to boost the company's services business, targeting creators, students, and professionals.


Credit: APPLE
Credit: APPLE

The subscription is priced at USD 12.99 monthly or USD 129 annually. It is scheduled to launch Jan. 28 on the App Store.


Apple Creator Studio includes well-known creative tools such as Final Cut Pro, Logic Pro, and Pixelmator Pro. These applications will be available across Mac and iPad devices.


Credit: APPLE
Credit: APPLE

The package also incorporates premium content and new artificial intelligence-powered features into Apple's productivity applications, Keynote, Pages, and Numbers. The digital whiteboarding application Freeform will gain enhanced features at a later date.


Final Cut Pro will feature new tools for faster video editing, including transcript-based search, visual search, and beat detection. Logic Pro introduces AI-powered capabilities like Synth Player and Chord ID to assist with music creation.



Credit: APPLE
Credit: APPLE

Pixelmator Pro, a Photoshop alternative, will be available on iPad for the first time. It will also offer Apple Pencil support.



Credit: APPLE
Credit: APPLE

Apple has utilised its services business, which encompasses Apple Music and iCloud, to drive recent growth. This strategy helps offset slower hardware growth and generates consistent recurring revenue.

  • Apple introduced Apple Creator Studio, a new subscription bundle for creative software.

  • The service costs USD 12.99 monthly or USD 129 annually.

  • Bundled applications include Final Cut Pro, Logic Pro, and Pixelmator Pro; Logic Pro features new AI capabilities.


Source: REUTERS

Tech360tv is Singapore's Tech News and Gadget Reviews platform. Join us for our in depth PC reviews, Smartphone reviews, Audio reviews, Camera reviews and other gadget reviews.

  • YouTube
  • Facebook
  • TikTok
  • Instagram
  • Twitter
  • LinkedIn

© 2021 tech360.tv. All rights reserved.

bottom of page