By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: PyTorch 2.10 Released With More Improvements For AMD ROCm & Intel GPUs
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > Computing > PyTorch 2.10 Released With More Improvements For AMD ROCm & Intel GPUs
Computing

PyTorch 2.10 Released With More Improvements For AMD ROCm & Intel GPUs

News Room
Last updated: 2026/01/21 at 1:08 PM
News Room Published 21 January 2026
Share
PyTorch 2.10 Released With More Improvements For AMD ROCm & Intel GPUs
SHARE

PyTorch 2.10 is out today as the latest feature update to this widely-used deep learning library. The new PyTorch release continues improving support for Intel GPUs as well as for the AMD ROCm compute stack along with still driving more enhancements for NVIDIA CUDA.

PyTorch 2.10 for AMD ROCm now enables grouped GEMM via regular GEMM fallback and via CK. There is also better ROCm support for PyTorch on Microsoft Windows, torch.cuda._compile_kernel support, load_inline support, GFX1150/GFX1151 RDNA 3.5 GPUs are added to the hipblaslt-supported GEMM lists, scaled_mm v2 support, AOTriton scaled_dot_product_attention, improved heuristics for pointwise kernels on ROCm, code generation support for fast_tanhf on ROCm, and other improvements.

Intel GPU support also is enjoying a number of improvements with PyTorch 2.10. A number of additional Torch XPU APIs are now in place for Intel GPUs, support for ATen operators scaled_mm and scaled_mm_v2, _weight_int8pack_mm support, and the SYCL support in the PyTorch CPP Extension API now allows for implementing new custom operators on Windows. There are also some Intel performance optimizations and other improvements.

The NVIDIA CUDA support in PyTorch 2.10 also boasts more features. CUDA on PyTorch 2.10 enables templated kernels, pre-compiled kernel support, adding CUDA headers automatically, support for the cuda-python CUDA stream protocol, CUDA 13 compatibility improvements, support for nested memory pools, CUTLASS MATMULs on Thor, and other features.

PyTorch 2.10 also brings Python 3.14 support for torch.compole() as well as experimental support for the Python 3.14 free-threaded build. There is also lower kernel launch overhead with combo-kernels horizontal fusion in Torch Inductor, debug improvements, and different quantization enhancements.

PyTorch logo

Downloads and more details on PyTorch 2.10 via GitHub.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Are DJI Drones Still Banned? Are DJI Drones Still Banned?
Next Article This midrange Android phone also runs Windows and Linux This midrange Android phone also runs Windows and Linux
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Report details the AI chatbots Apple uses to boost employee productivity – 9to5Mac
Report details the AI chatbots Apple uses to boost employee productivity – 9to5Mac
News
Nedbank targets Kenya’s NCBA with 6M takeover bid
Nedbank targets Kenya’s NCBA with $856M takeover bid
Computing
Blue Origin Plans a Faster Starlink Competitor With ‘TeraWave’
Blue Origin Plans a Faster Starlink Competitor With ‘TeraWave’
News
Is Software Too Easy To Build? Using Lovable To Test Before Committing
Is Software Too Easy To Build? Using Lovable To Test Before Committing
Software

You Might also Like

Nedbank targets Kenya’s NCBA with 6M takeover bid
Computing

Nedbank targets Kenya’s NCBA with $856M takeover bid

3 Min Read
The HackerNoon Newsletter: What Comes After the AI Bubble? (1/21/2026) | HackerNoon
Computing

The HackerNoon Newsletter: What Comes After the AI Bubble? (1/21/2026) | HackerNoon

2 Min Read
Tech Moves: Former Microsoft CVP joins Amazon; Chronus names CEO; REI hires AI leader
Computing

Tech Moves: Former Microsoft CVP joins Amazon; Chronus names CEO; REI hires AI leader

9 Min Read
Beijing hosts world’s first half-marathon for humanoid robots · TechNode
Computing

Beijing hosts world’s first half-marathon for humanoid robots · TechNode

3 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?