By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Intel oneDNN 3.8 Brings More CPU & GPU Performance Optimizations
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > Computing > Intel oneDNN 3.8 Brings More CPU & GPU Performance Optimizations
Computing

Intel oneDNN 3.8 Brings More CPU & GPU Performance Optimizations

News Room
Last updated: 2025/05/11 at 12:10 AM
News Room Published 11 May 2025
Share
SHARE

Intel software engineers released oneDNN 3.8 to end out the week with various new performance optimizations and more.

The Intel oneDNN library that is now part of the UXL Foundation serves as the building blocks for AI / deep learning applications. This library provides basic building blocks for deep learning applications and is aggressively optimized for Intel’s hardware offerings but with time has also developed robust support for competitor hardware platforms too.

Intel Xeon Granite Rapids

With oneDNN 3.8 there are continued Intel AMX enhancements, better Panther Lake Xe3 integrated graphics performance, refinements for existing Xe2 graphics support, and other optimizations to benefit Intel’s recent and upcoming CPU and GPU products.

“Intel Architecture Processors

– Improved matmul and inner product primitives performance on processors with Intel AMX instruction set support.
– Improved performance of convolution and inner product primitives on processors with Intel AVX2 instruction set support.
– Improved performance of int8 convolution support with zero points.
– Improved fp32 convolution performance with fp16 and bf16 compressed weights on processors with Intel AVX2 or Intel AVX-512 instruction set support.
– Improved fp16/bf16 depthwise convolution performance with fp32 bias or sum post-ops or dilation.
– Improved bf16 pooling backpropagation performance.
– Improved binary post-ops performance with per_w broadcast.

Intel Graphics Products

– Improved performance on Intel Arc graphics for future Intel Core Ultra processors (code name Panther Lake).
– Improved convolution performance on:
Intel Arc Graphics for Intel Core Ultra processor series 2 (formerly Lunar Lake).
Intel Arc B-series discrete graphics (formerly Battlemage).
– Improved int8 matmul performance with zero-points support for source and weight tensors.
– Improved f4_e2m1 and f4_e3m0 matmul and reorder performance.
– Improved performance of the following subgraphs with Graph API:
Scaled Dot Product Attention (SDPA) with int4 and int8 compressed key and value.
fp16/bf16 SDPA with fp32 intermediate data types. Using fp32 intermediate data types is recommended.
SDPA with head size 512 and 576.
Grouped Query Attention (GQA) with 5D input tensors.”

The oneDNN 3.8 release also has FP16, INT8, and BF16 optimizations for AArch64 processors, Graph API support for NVIDIA GPUs, ROCm 6 support on AMD CPUs, and a variety of other smaller enhancements.

Downloads and more information on the oneDNN 3.8 library release for building out deep learning applications via GitHub. New oneDNN benchmarks soon for upcoming hardware releases.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article M4 MacBook Pro might be one of the first worthy upgrades for Apple Silicon Mac users – 9to5Mac
Next Article Ford quietly kills multi-billion dollar software-defined vehicle plans
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Save over $300 on a robot vacuum that’s probably smarter than you
News
Free Script Writing Templates for Professional Screenwriting
Computing
Apple’s latest MacBook Air just fell to $837 for Mother’s Day
News
3 underrated CarPlay features everyone should be using
News

You Might also Like

Computing

Free Script Writing Templates for Professional Screenwriting

23 Min Read
Computing

Top 10 Clockify Alternatives For Time Tracking & Productivity

26 Min Read
Computing

TikTok faces large-scale content removal after major falling out with Universal Music Group · TechNode

3 Min Read
Computing

Ad Hoc Meeting Essentials: 7 Key Steps for Success |

28 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?