By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: DeepSeek-V3.2 Outperforms GPT-5 on Reasoning Tasks
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > DeepSeek-V3.2 Outperforms GPT-5 on Reasoning Tasks
News

DeepSeek-V3.2 Outperforms GPT-5 on Reasoning Tasks

News Room
Last updated: 2026/01/06 at 12:32 PM
News Room Published 6 January 2026
Share
DeepSeek-V3.2 Outperforms GPT-5 on Reasoning Tasks
SHARE

DeepSeek released DeepSeek-V3.2, a family of open-source reasoning and agentic AI models. The high compute version, DeepSeek-V3.2-Speciale, performs better than GPT-5 and comparably to Gemini-3.0-Pro on several reasoning benchmarks.

DeepSeek applied three new techniques in the development of DeepSeek-V3.2. First, they used a more efficient attention mechanism called DeepSeek Sparse Attention (DSA) that reduces the computational complexity of the model. They also scaled the reinforcement learning phase, which consumed more compute budget than did pre-training. Finally, they developed an agentic task synthesis pipeline to improve the models’ tool use. The result was a model that outperforms most other open models on a range of coding, reasoning, and agentic benchmarks, and performs as well as or better than closed frontier models such as GPT-5 and Gemini-3.0-Pro. However, the DeepSeek team pointed out:

Despite these achievements, we acknowledge certain limitations when compared to frontier closed-source models…First, due to fewer total training FLOPs, the breadth of world knowledge in DeepSeek-V3.2 still lags behind that of leading proprietary models. We plan to address this knowledge gap in future iterations by scaling up the pre-training compute. Second, token efficiency remains a challenge…Future work will focus on optimizing the intelligence density of the model’s reasoning chains to improve efficiency. Third, solving complex tasks is still inferior to frontier models, motivating us to further refine our foundation model and post-training recipe.

InfoQ covered several of DeepSeek’s previous releases, including the initial DeepSeek-V3 launch and DeepSeek-R1, their first reasoning model; both were released in early 2025. Later in 2025, InfoQ covered DeepSeek-V3.1, a hybrid reasoning model that combines thinking and non-thinking modes in a single system.

DeepSeek-V3.2 Benchmark Performance. Image Source: DeepSeek Tech Report

DeepSeek-V3.2 uses the same architecture as DeepSeek-V3.1, except using the new DSA attention mechanism. The team started with a checkpoint of DeepSeek-V3.1 and extended the context length to 128K before continuing pre-training to produce DeepSeek-V3.2. The new attention mechanism reduces the computational complexity from O(^2) to O(), where L is context length and k<

For post-training, the team used specialist distillation. They trained a set of specialist models dedicated to a particular domain: coding, math, and several agent tasks. Then these specialist models produce synthetic training data that is used to fine-tune the main model.


In a Hacker News discussion about DeepSeek-V3.2, several users pointed out the advantages of a high-performing open model. One user wrote:


If you’re trying to build AI based applications you can and should compare the costs between vendor based solutions and hosting open models with your own hardware…Then you compare that to the cost of something like GPT-5, which is a bit simpler because the cost per (million) token is something you can grab off of a website. You’d be surprised how much money running something like DeepSeek (or if you prefer a more established company, Qwen3) will save you over the cloud systems…DeepSeek and Qwen will function on cheap GPUs that other models will simply choke on.


The DeepSeek-V3.2 model files are available to download from Huggingface. However, the high-compute DeepSeek-V3.2-Speciale variant is currently only available via DeepSeek’s API.





Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Best SSPs for Publishers: Comprehensive Comparison Guide Best SSPs for Publishers: Comprehensive Comparison Guide
Next Article If your robot vacuum has ever gotten stuck between rooms, this one’s for you If your robot vacuum has ever gotten stuck between rooms, this one’s for you
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

US Senators Urge Apple, Google to Pull X, Grok Apps Over Sexualized Imagery
US Senators Urge Apple, Google to Pull X, Grok Apps Over Sexualized Imagery
News
Ex-Tesla engineer shares thoughts on China’s E2E self-driving tech · TechNode
Ex-Tesla engineer shares thoughts on China’s E2E self-driving tech · TechNode
Computing
Best headphone deal: Save 0 on the bmani ANC Headphones
Best headphone deal: Save $100 on the bmani ANC Headphones
News
Geely-affiliated EV maker Polestar reportedly lays off 30% of China staff · TechNode
Geely-affiliated EV maker Polestar reportedly lays off 30% of China staff · TechNode
Computing

You Might also Like

US Senators Urge Apple, Google to Pull X, Grok Apps Over Sexualized Imagery
News

US Senators Urge Apple, Google to Pull X, Grok Apps Over Sexualized Imagery

7 Min Read
Best headphone deal: Save 0 on the bmani ANC Headphones
News

Best headphone deal: Save $100 on the bmani ANC Headphones

3 Min Read
Security Think Tank: Stop buying AI, start buying outcomes | Computer Weekly
News

Security Think Tank: Stop buying AI, start buying outcomes | Computer Weekly

10 Min Read
9 Signs Your RAM Is About To Fail – BGR
News

9 Signs Your RAM Is About To Fail – BGR

21 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?