Reading: Huawei Zurich Lab’s New Open-Source Tech Lets LLMs Run on Consumer GPUs

Huawei Zurich Lab’s New Open-Source Tech Lets LLMs Run on Consumer GPUs

Last updated: 2025/10/06 at 9:04 AM

News Room Published 6 October 2025

Huawei’s Zurich Computing Systems Laboratory has released SINQ (Sinkhorn Normalization Quantization), an open-source quantization method that reduces the memory requirements of large language models (LLMs) by up to 70%. The breakthrough allows workloads that once needed enterprise GPUs like Nvidia’s A100 or H100 to run efficiently on consumer-grade cards such as the RTX 4090, cutting both hardware and cloud compute costs.

The Apache 2.0–licensed project is now available on GitHub and Hugging Face for free use and commercialization. Huawei says SINQ achieves accuracy close to data-calibrated approaches while outperforming other calibration-free methods such as RTN and HQQ in both speed and precision. [TechNode reporting]

Share This Article

How Ai is changing white-collar work

Latest Data Breach Payout for This Parking App Is Here. It’s Not Exactly a Windfall

Huawei Zurich Lab’s New Open-Source Tech Lets LLMs Run on Consumer GPUs

Leave a Reply Cancel reply

Stay Connected

Latest News

Google Maps Timeline error leaves users unable to access location history

ProBuilt Software Has Solved The Browser Multitasking Problem No One Talks About | HackerNoon

Get the Galaxy Watch 8 for a steal before Amazon Prime Big Deal Days

OpenAI and chipmaker AMD sign chip supply partnership for AI infrastructure

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

Topics

Sign Up for Our Newsletter

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Stay Connected

Latest News