By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Intel Releases llm-scaler-vllm 0.14.0-b8, Talks Up 1.49x Performance With BMG-G31
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > Computing > Intel Releases llm-scaler-vllm 0.14.0-b8, Talks Up 1.49x Performance With BMG-G31
Computing

Intel Releases llm-scaler-vllm 0.14.0-b8, Talks Up 1.49x Performance With BMG-G31

News Room
Last updated: 2026/03/02 at 5:58 AM
News Room Published 2 March 2026
Share
Intel Releases llm-scaler-vllm 0.14.0-b8, Talks Up 1.49x Performance With BMG-G31
SHARE

Intel kicked off the new month by releasing the latest version of LLM Scaler vLLM (llm-scaler-vllm) as their Docker-based solution for running vLLM on Intel Battlemage GPUs for AI inferencing.

Intel llm-scaler-vllm v0.14.0-b8 is out today as the newest version of this solution for vLLM on Intel graphics hardware. This new version is rebased against vLLM 0.14 upstream while also upgrading PyTorch to 2.10 and pulling in the latest oneAPI components. Thanks to Intel oneDNN optimizations the INT4 performance is seeing up to a 25% throughput improvement compared to the prior release.

There is also new LLM coverage with this llm-scaler-vllm update, with now officially supporting Qwen3-VL-Reranker-2B/8B, Qwen3-VL-Embedding-2B/8B, GLM-4.7-Flash, Ministral models, DeepSeek-OCR-2, and Qwen3-Coder-Next.

There is also validated support now for the BMG-G31 “Big Battlemage” GPU. The Intel BMG-G31 has remained elusive with no official announcement yet, rumors of its cancellation, etc, but the open-source software enablement around it continues. With this llm-scaler-vllm update seeming to confirm it’s still coming as there is now validated support. The announcement even mentions some word on its performance uplift:

“G31 validation has been added in this release and all models are functional. The key models’ performance is measured on a non-golden setup B70 system (limited perf for allreduce with small message size), compare with G21: 1.49x geomean under SLA constraints and 1.13x geomean at fixed batch size. The throughput should be better on system with golden BKC setup.”

Seemingly confirming as well that the talked about Arc Pro B70 is indeed BMG-G31. But whether BMG-G31 will appear in any consumer Intel Arc Graphics card remains to be seen. But 1.49x geo mean performance with SLA constraints is quite exciting.

BMG-G31 validation text

See the GitHub release announcement for more details on the Intel llm-scaler-vllm update.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article You vibe-coded an app with AI, now what? You vibe-coded an app with AI, now what?
Next Article Samsung Display Takes the Bezel Almost Down to Zero With This Concept Samsung Display Takes the Bezel Almost Down to Zero With This Concept
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Android XR is getting a Pixel 10 feature and we tried it at MWC 2026
Android XR is getting a Pixel 10 feature and we tried it at MWC 2026
News
Apple announces the iPhone 17E
Apple announces the iPhone 17E
News
Lenovo’s Latest Wacky Concepts Include a Laptop With a Built-In Portable Monitor
Lenovo’s Latest Wacky Concepts Include a Laptop With a Built-In Portable Monitor
Gadget
Mark Essien’s new startup fixes what travel booking missed
Mark Essien’s new startup fixes what travel booking missed
Computing

You Might also Like

Mark Essien’s new startup fixes what travel booking missed
Computing

Mark Essien’s new startup fixes what travel booking missed

7 Min Read
The 2026 FBA Ads Playbook: How to Beat Fee Hikes with Dynamic Bidding | HackerNoon
Computing

The 2026 FBA Ads Playbook: How to Beat Fee Hikes with Dynamic Bidding | HackerNoon

10 Min Read
⚡ Weekly Recap: SD-WAN 0-Day, Critical CVEs, Telegram Probe, Smart TV Proxy SDK and More
Computing

⚡ Weekly Recap: SD-WAN 0-Day, Critical CVEs, Telegram Probe, Smart TV Proxy SDK and More

28 Min Read
Docker Scout vs Traditional Container Scanners: Why Context Beats CVE Noise | HackerNoon
Computing

Docker Scout vs Traditional Container Scanners: Why Context Beats CVE Noise | HackerNoon

14 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?