By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: You Can Run IBM’s AI Chatbot Locally In Your Web Browser – Here’s How – BGR
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > You Can Run IBM’s AI Chatbot Locally In Your Web Browser – Here’s How – BGR
News

You Can Run IBM’s AI Chatbot Locally In Your Web Browser – Here’s How – BGR

News Room
Last updated: 2025/11/22 at 3:46 PM
News Room Published 22 November 2025
Share
You Can Run IBM’s AI Chatbot Locally In Your Web Browser – Here’s How – BGR
SHARE






GamePixel/Shutterstock

IBM recently launched its Granite 4.0 Nano AI models that, like AI chatbots on iPhones, you can run locally in your web browser. The four new models, which range from 350 million to 1.5 billion parameters, are small enough to load directly into your web browser without the need for a server, subscription fees, or an internet connection. Since these chatbots run locally and offline, they keep every conversation private, and the data stays on your device. 

Popular AI chatbots such as ChatGPT, Gemini, and Claude, as well as other alternatives, require heavy cloud infrastructure, servers, and internet connectivity. Running IBM’s compressed AI models locally in your web browser is simple. For Granite 4.0 Nano AI models, all you need is a laptop or desktop with at least 8GB of RAM and a WebGPU-enabled browser like Chrome or Edge. IBM has launched Granite 4.0 Nano models in different sizes and architectures, including Granite-4.0-H-1B (1.5 billion parameters), Granite-4.0-H-350M (350 million parameters), Granite-4.0-1B, and Granite-4.0-350M. All models feature a hybrid Mamba/transformer architecture that IBM states “reduces memory requirements without sacrificing performance.” 

For better reasoning and responses, you can use the larger model with 1.5 billion parameters, but it would require a dedicated GPU with at least 6-8GB of additional VRAM. You’ll need an internet connection to download a model, but after setup, the AI model runs offline. To use IBM’s chatbots on your browser, check if your browser is updated. Once done, visit HuggingFace. Here, you can select a model and download it. Once loaded, you can start using it for tasks such as writing code, summarizing documents, and drafting emails.

The trade‑offs of using local AI


An image of an AI chatbot
Vertigo3d/Getty Images

IBM’s Nano models are small, but according to the company, they punch above their weight. Cloud-based AI chatbots, such as ChatGPT and Claude, use large language models (LLMs) that contain billions of parameters, which demand a lot of computing power to process and generate responses. These parameters define how a model processes information and generates a response. 

In general, a higher parameter count means an LLM is better at reasoning. However, response quality also depends on architecture, training data, and how a model is optimized. Running an AI chatbot locally has several upsides. Your data is not stored in any server, and it’s a free tool versus the $20 per month users pay for services like ChatGPT Plus or Gemini Pro. Moreover, response lag is minimal with local AI models because they don’t require an internet connection or a server to process requests.

There are some trade-offs as well. IBM’s Granite Nano is competitive with other AI models in similar parameter ranges, and can handle straightforward tasks, but it can’t replace or compete with LLMs, such as GPT-4 or Claude. The responses from these smaller models will usually be shorter and may not offer deep reasoning like larger models do. Smaller models also struggle with long inputs and can’t search the web or access information beyond their training data. IBM’s compressed AI models are useful if you want a customized tool for specific tasks. These AI models can be used for many tasks, such as writing emails or summarizing documents, but for better reasoning, you’ll have to consider regular LLMs.



Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Luckin Coffee records first quarterly loss in two years, negative operating margin · TechNode Luckin Coffee records first quarterly loss in two years, negative operating margin · TechNode
Next Article Indie App Spotlight: ‘Mint’ is an all-in-one collection tracker for Pokémon enthusiasts – 9to5Mac Indie App Spotlight: ‘Mint’ is an all-in-one collection tracker for Pokémon enthusiasts – 9to5Mac
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Nearly M Raised Before V1 Protocol Launch, This New Crypto May Be The Safest 2026 Investment | HackerNoon
Nearly $19M Raised Before V1 Protocol Launch, This New Crypto May Be The Safest 2026 Investment | HackerNoon
Computing
Resetting GPU depreciation: Why AI factories bend, but don’t break, useful life assumptions –  News
Resetting GPU depreciation: Why AI factories bend, but don’t break, useful life assumptions – News
News
New DNA Sequencing Technology Can Completely Change Care For Newborns – BGR
New DNA Sequencing Technology Can Completely Change Care For Newborns – BGR
News
Best Free Cloud Storage for Photos in 2025
Best Free Cloud Storage for Photos in 2025
News

You Might also Like

Resetting GPU depreciation: Why AI factories bend, but don’t break, useful life assumptions –  News
News

Resetting GPU depreciation: Why AI factories bend, but don’t break, useful life assumptions – News

13 Min Read
New DNA Sequencing Technology Can Completely Change Care For Newborns – BGR
News

New DNA Sequencing Technology Can Completely Change Care For Newborns – BGR

5 Min Read
Best Free Cloud Storage for Photos in 2025
News

Best Free Cloud Storage for Photos in 2025

22 Min Read
Google Promotes Apple TV Series Pluribus With Secret Search Message
News

Google Promotes Apple TV Series Pluribus With Secret Search Message

6 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?