By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: DeepSeek shows enterprises model distillation opportunity | Computer Weekly
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > DeepSeek shows enterprises model distillation opportunity | Computer Weekly
News

DeepSeek shows enterprises model distillation opportunity | Computer Weekly

News Room
Last updated: 2025/08/08 at 5:33 AM
News Room Published 8 August 2025
Share
SHARE

Model distillation is one of the technology trends that has reached a level of maturity identified in Gartner’s 2025 Hype Cycle for artificial intelligence (AI) as “the slope of enlightenment”.

However, while it was recently put into the spotlight at the start of the year with China’s DeepSeek demonstrating how model distillation can be used to train a large language model (LLM) that rivals models from OpenAI, it is not a new development, with Haritha Khandabattu, senior director analyst at Gartner, saying: “I was actually researching model distillation in 2017.”

In fact, the technique dates back to the 2006 Cornell university Model compression paper by Cristian Bucilă, Rich Caruana and Alexandru Niculescu-Mizil. Nine years later, in 2015, Cornell university’s Distilling the knowledge in a neural network paper by Geoffery Hinton, Oriol Vinyals and Jeff Dean used the term distillation to describe a technique to improve the performance of AI models.  

Although it is not considered a new technological development by Gartner, Khandabattu said: “Model distillation has been re-emphasised. The foundation models are compute hungry and extremely expensive to run, and enterprises have started asking how they can get 80% of the performance at 10% of the cost.”

She said DeepSeek has led to a downward pricing trend for pricing over the past six to 12 months. But rather than adapt to these price changes, Khandabattu recommended that CIOs “plan their use cases and prioritise with the expectation that training and inference costs will continue to decline”.

Khandabattu said that even the large AI technology providers recognise the usefulness of model distillation to enable more deployable, more tunable and more governable AI, adding: “Model distillation is finally gaining commercial traction.”

She describes model distillation as a bridge between innovation and scalability: “Model distillation unlocks both technical merit and access. It offers lower inference cost and IT infrastructure expenses are also a bit lower, which makes model distillation cost-effective for certain AI deployments.”

But Khandabattu also noted that there are other costs IT leaders need to consider beyond the IT infrastructure needed to run inference workloads. “CIOs need to be extremely careful and recognise that the total cost of deploying GenAI [generative AI] applications is not limited to the cost of the models.”

There are engineering costs and costs associated with integrating the AI system with enterprise IT, she said, adding: “Fine-tuning an AI model costs a lot of money. If the model provider decides to change the model, then you have to change all of the things that you’ve built on the older model to the newer one, which is very expensive.”

Beyond model distillation, she said: “With AI investment remaining strong this year, a sharper emphasis is being placed on using AI for operational scalability and real-time intelligence.”

According to Gartner, this has led to a gradual pivot from generative AI as a central focus, toward the foundational enablers that support sustainable AI delivery, such as AI-ready data and AI agents.

“Despite the enormous potential business value of AI, it isn’t going to materialise spontaneously,” said Khandabattu. “Success will depend on tightly business aligned pilots, proactive infrastructure benchmarking, and coordination between AI and business teams to create tangible business value.”

Among the AI innovations Gartner has forecast will reach mainstream adoption in the next five years are multimodal AI and AI trust, risk and security management (TRiSM).

Multimodal AI models are trained with multiple types of data simultaneously, such as images, video, audio and text. TRiSM is focused on layers of technical capabilities that support enterprise policies for all AI use cases and help assure AI governance, trustworthiness, fairness, safety, reliability, security, privacy and data protection. Gartner has predicted that, in combination, these developments will enable more robust, innovative and responsible AI applications, transforming how businesses and organisations operate.

Gartner also expects AI agents are at least two to five years away from becoming mainstream. 

“To reap the benefits of AI agents, organisations need to determine the most relevant business contexts and use cases, which is challenging given no AI agent is the same and every situation is different,” said Khandabattu. “Although AI agents will continue to become more powerful, they can’t be used in every case, so use will largely depend on the requirements of the situation at hand.”

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Launch of GPT-5 reinforces AI governance gap, UK body warns – UKTN
Next Article Today's NYT Strands Hints, Answer and Help for Aug. 8 #523 – CNET
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Streamlining Go Concurrency Using a Worker Pool | HackerNoon
Computing
iOS 26: Friends Can't Decide What to Eat? Here's How to Create a Poll in Messages
News
AI and the Prospect of a Post-Big Tech Internet | HackerNoon
Computing
Microsoft’s new Copilot 3D feature is great for Ikea, bad for my dog
News

You Might also Like

News

iOS 26: Friends Can't Decide What to Eat? Here's How to Create a Poll in Messages

6 Min Read
News

Microsoft’s new Copilot 3D feature is great for Ikea, bad for my dog

4 Min Read
News

Rocket Lab eyes big defense opportunities with new acquisition | News

3 Min Read
News

Air Aware Labs founder: Investing in people – UKTN

1 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?