By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Report: Nvidia is working on a top secret AI inference chip that could debut next month – News
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Report: Nvidia is working on a top secret AI inference chip that could debut next month – News
News

Report: Nvidia is working on a top secret AI inference chip that could debut next month – News

News Room
Last updated: 2026/03/01 at 8:28 PM
News Room Published 1 March 2026
Share
Report: Nvidia is working on a top secret AI inference chip that could debut next month –  News
SHARE

Nvidia Corp. is reportedly working on a dedicated inference processor that will be used by OpenAI Group PBC and other artificial intelligence companies to develop faster and more efficient models, according to a report late Friday in the Wall Street Journal.

The new inference platform is expected to be launched at Nvidia’s annual GTC developer conference in San Jose later this month, and will integrate technology the company acquired from the chip startup Groq Inc. in December.

Inference, which refers to the process of running trained AI models in production, has emerged as a key area of focus in the AI industry. Nvidia rivals such as Google LLC and Amazon Web Services Inc. have both developed specialized inference chips that compete with its graphics processing units, and it also faces competition from dedicated inference chip startups such as Cerebras Systems Inc. and SambaNova Systems Inc.

The Journal said OpenAI has had early access to Nvidia’s new inference chip and will become one of its earliest adopters, in what amounts to a significant win for the chipmaker. Though OpenAI has been shopping for more efficient alternatives to Nvidia’s GPUs in order to diversify its computing stack, it received $30 billion in funding from the world’s top chipmaker last week in a deal that reaffirms its commitment to the company.

Nvidia is the world’s most dominant maker of GPUs, which are specialized processors that can perform billions of tasks simultaneously. But although the company continues to insist that they’re useful for both training and inference, its GPUs are no longer considered the most efficient option for powering AI applications. Many companies have found that Nvidia’s chips consume too much energy, making them extremely costly for applications such as AI agents, which carry out tasks autonomously on behalf of human users and require immense computing power.

That’s why OpenAI signed a multibillion-dollar contract with Cerebras last month to access its dinner plate-sized inference-focused chips. Cerebras claims that its silicon is much faster than Nvidia’s GPUs when it comes to inference tasks.

Nvidia’s inference chip is reportedly going to integrate technology developed by Groq. Nvidia paid $20 billion to license Groq’s technology on a nonexclusive basis in December, and as part of that deal it also hired its founding Chief Executive Officer Jonathan Ross and its President Sunny Madra. It was billed at the time as one of the largest-ever “acquihires” in Silicon Valley’s history.

Groq’s inference chips are known as “language processing units,” and they’re based on an entirely novel architecture that enables them to perform inference with much lower energy usage. However, Nvidia hasn’t said how it plans to use the startup’s technology.

OpenAI reportedly wants to use Nvidia’s new inference chip to power its Codex programming tool, which is a rival to Anthropic PBC’s Claude Code. Coding applications have emerged as one of the most powerful and profitable use cases for generative AI, and it’s an area where OpenAI is only second-best, for Claude Code is widely considered to be the market leader.

Nvidia is also pushing its central processing units as another alternative for running inference workloads. Traditionally, most companies pair its GPUs with CPUs, using the two chips in tandem to compensate for the inefficiencies of the other.

But Nvidia says some agentic AI workloads can actually run more efficiently on its most advanced Grace CPUs alone. Last month, Meta Platforms Inc. became the first company to commit to making the first sizable CPU-only deployment to support its ad-targeting agents in production.

Image: News/Microsoft Designer

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About News Media

News Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of News, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — News Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, News Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Lenovo’s Modular, Two-Screen ThinkBook Is the Futuristic Laptop I’m Rooting For Lenovo’s Modular, Two-Screen ThinkBook Is the Futuristic Laptop I’m Rooting For
Next Article Lenovo's Legion Go Fold Concept Introduces Very Vertical Gaming on the Go Lenovo's Legion Go Fold Concept Introduces Very Vertical Gaming on the Go
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Linux 7.0 Shows Off Nice Performance Gains For Databases In Small AMD EPYC Servers
Linux 7.0 Shows Off Nice Performance Gains For Databases In Small AMD EPYC Servers
Computing
Apple Refreshes iPad Air With M4 Chip, 12GB RAM
Apple Refreshes iPad Air With M4 Chip, 12GB RAM
News
Google Antigravity: 20 Game-Changing Prompts for Complete Automation | HackerNoon
Google Antigravity: 20 Game-Changing Prompts for Complete Automation | HackerNoon
Computing
An existing iPhone 16e case will still fit the iPhone 17e
An existing iPhone 16e case will still fit the iPhone 17e
News

You Might also Like

Apple Refreshes iPad Air With M4 Chip, 12GB RAM
News

Apple Refreshes iPad Air With M4 Chip, 12GB RAM

4 Min Read
An existing iPhone 16e case will still fit the iPhone 17e
News

An existing iPhone 16e case will still fit the iPhone 17e

1 Min Read
Tecno’s got the most modular phone ever
News

Tecno’s got the most modular phone ever

3 Min Read
5 Warning Signs Your Lithium-Ion Battery Could Catch Fire (And What To Do) – BGR
News

5 Warning Signs Your Lithium-Ion Battery Could Catch Fire (And What To Do) – BGR

11 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?