By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Intel Releases llm-scaler-vllm 0.14.0-b8, Talks Up 1.49x Performance With BMG-G31
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > Computing > Intel Releases llm-scaler-vllm 0.14.0-b8, Talks Up 1.49x Performance With BMG-G31
Computing

Intel Releases llm-scaler-vllm 0.14.0-b8, Talks Up 1.49x Performance With BMG-G31

News Room
Last updated: 2026/03/02 at 5:58 AM
News Room Published 2 March 2026
Share
Intel Releases llm-scaler-vllm 0.14.0-b8, Talks Up 1.49x Performance With BMG-G31
SHARE

Intel kicked off the new month by releasing the latest version of LLM Scaler vLLM (llm-scaler-vllm) as their Docker-based solution for running vLLM on Intel Battlemage GPUs for AI inferencing.

Intel llm-scaler-vllm v0.14.0-b8 is out today as the newest version of this solution for vLLM on Intel graphics hardware. This new version is rebased against vLLM 0.14 upstream while also upgrading PyTorch to 2.10 and pulling in the latest oneAPI components. Thanks to Intel oneDNN optimizations the INT4 performance is seeing up to a 25% throughput improvement compared to the prior release.

There is also new LLM coverage with this llm-scaler-vllm update, with now officially supporting Qwen3-VL-Reranker-2B/8B, Qwen3-VL-Embedding-2B/8B, GLM-4.7-Flash, Ministral models, DeepSeek-OCR-2, and Qwen3-Coder-Next.

There is also validated support now for the BMG-G31 “Big Battlemage” GPU. The Intel BMG-G31 has remained elusive with no official announcement yet, rumors of its cancellation, etc, but the open-source software enablement around it continues. With this llm-scaler-vllm update seeming to confirm it’s still coming as there is now validated support. The announcement even mentions some word on its performance uplift:

“G31 validation has been added in this release and all models are functional. The key models’ performance is measured on a non-golden setup B70 system (limited perf for allreduce with small message size), compare with G21: 1.49x geomean under SLA constraints and 1.13x geomean at fixed batch size. The throughput should be better on system with golden BKC setup.”

Seemingly confirming as well that the talked about Arc Pro B70 is indeed BMG-G31. But whether BMG-G31 will appear in any consumer Intel Arc Graphics card remains to be seen. But 1.49x geo mean performance with SLA constraints is quite exciting.

BMG-G31 validation text

See the GitHub release announcement for more details on the Intel llm-scaler-vllm update.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article You vibe-coded an app with AI, now what? You vibe-coded an app with AI, now what?
Next Article Samsung Display Takes the Bezel Almost Down to Zero With This Concept Samsung Display Takes the Bezel Almost Down to Zero With This Concept
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

How to Protect Your SaaS from Bot Attacks with SafeLine WAF
How to Protect Your SaaS from Bot Attacks with SafeLine WAF
Computing
Wise strengthens its board with new appointment – UKTN
Wise strengthens its board with new appointment – UKTN
News
Armbian 26.02 Released: New Boards, Powered By Linux 6.18 LTS & RISC-V Xfce Desktop
Armbian 26.02 Released: New Boards, Powered By Linux 6.18 LTS & RISC-V Xfce Desktop
Computing
15 essential Switch game deals you can’t miss (even if you own a Switch 2)
15 essential Switch game deals you can’t miss (even if you own a Switch 2)
News

You Might also Like

How to Protect Your SaaS from Bot Attacks with SafeLine WAF
Computing

How to Protect Your SaaS from Bot Attacks with SafeLine WAF

11 Min Read
Armbian 26.02 Released: New Boards, Powered By Linux 6.18 LTS & RISC-V Xfce Desktop
Computing

Armbian 26.02 Released: New Boards, Powered By Linux 6.18 LTS & RISC-V Xfce Desktop

1 Min Read
Chowdeck introduces accident insurance for its 20,000 riders
Computing

Chowdeck introduces accident insurance for its 20,000 riders

3 Min Read
Storage Virtualization Is Not “VMware for Disks” | HackerNoon
Computing

Storage Virtualization Is Not “VMware for Disks” | HackerNoon

14 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?