By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: IBM releases small open-source Granite 4 models for mobile devices and browsers – News
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > IBM releases small open-source Granite 4 models for mobile devices and browsers – News
News

IBM releases small open-source Granite 4 models for mobile devices and browsers – News

News Room
Last updated: 2025/10/29 at 12:24 PM
News Room Published 29 October 2025
Share
SHARE

IBM Corp. today announced the release of Granite 4 Nano, a family of extremely small generative artificial intelligence models designed to run at the edge, on-device or in browsers.

The company said the models exhibit extremely high performance for size and represent the company’s smallest models yet.

The Granite 4.0 Nano family includes four instruct models and their base model counterparts between 1.5 billion and 350 million parameters. Parameters are the internal values that a large language model learns during training to understand context from user text queries and generate answers.

Larger LLMs need increased computing power and energy, leading to increased operational costs. They also require specialized hardware, such as powerful graphics processing units and substantial machine memory. Tiny LLMs require far less compute and memory, meaning that they can run on consumer hardware, such as laptops, PCs and mobile devices.

The tradeoff is a reduction in accuracy and contextual knowledge that is trimmed from the models to reduce their size. But with advanced compression techniques, a lot of knowledge and capability can be packed into a smaller size.

Very small LLMs enhance privacy and security, provide offline access to reasoning and allow complete control and customization. By avoiding the transmission of sensitive data to cloud servers, local LLMs can also be cost-effective because they don’t incur cloud expenses.

The models include Granite 4.0 H 1B and 350M, 1.5 billion and 350 million parameter models featuring the model family’s hybrid architecture and two alternative traditional transformer-based versions designed to be compatible where hybrid workloads may not have optimized support.

Granite 4 models have a specialized architecture developed by IBM that combines an additional algorithm with the transformer design that powers most LLMs. Transformers use an attention algorithm to understand and generate text by focusing on the most important parts of an input. IBM hybridized the transformer with processing components based on the Mamba neural network architecture, which is more hardware-efficient than traditional transformers.

There is a lot of competition in the sub-billion- to near 1 billion-parameter model design market, where developers focus on performance and capability. Rivals include the Qwen models from Alibaba Group Ltd., liquid foundation models from Liquid AI Inc. and Gemma models designed by Google LLC.

IBM stated that Granite Nano models perform better than several similarly sized models across various benchmarks in general knowledge, math, coding and safety. Additionally, the Nano models outperformed competitors for agentic workflows, including instruction following and tool calling in IFEval, or Instruction-Following Evaluation, and Berkley’s Function Calling Leaderboard v3.

Granite 4.0 H 1B reached top marks in accuracy on IFEval at 78.5 compared to Quen3 1.7B at 73.1 and Gemma 3 1B scoring 59.3. In tool calling, the same model secured 54.8 on Berkley’s leaderboard, compared to Quen3 at 52.2 and Gemma 3 at 16.3.

IBM released all the Granite 4 Nano models under the open-source Apache 2.0 license, which is highly permissive. The license allows for broad commercial use and includes special considerations for research.

Images: Microsoft Designer/ News, IBM

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About News Media

News Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of News, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — News Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, News Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Best iPad deal: Get the 2022 model for just $242 at Walmart
Next Article Pop!_OS 24.04 LTS & COSMIC Desktop Aim For December Stable Release
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Chinese AI chipmakers join forces with StepFun to counter Nvidia’s return to China · TechNode
Computing
Get a 500GB SIM for under £6 a month in this epic deal
Gadget
OnePlus 15's Global Launch Is Coming Nov. 13
News
Spain is a country extremely loyal to its local supermarkets. A chain wants to change that: Action
Mobile

You Might also Like

News

OnePlus 15's Global Launch Is Coming Nov. 13

2 Min Read
News

Lenovo Legion Tower 5 Gen 10 Review: A Potent Midranger With a Difficult Design

4 Min Read
News

Apple joins advisory board for 'The Game Awards'

1 Min Read
News

Microsoft Azure outage update: What we know about crash disrupting the internet

2 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?