By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Microsoft launches new high-speed voice and image models – News
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Microsoft launches new high-speed voice and image models – News
News

Microsoft launches new high-speed voice and image models – News

News Room
Last updated: 2026/04/02 at 6:20 PM
News Room Published 2 April 2026
Share
Microsoft launches new high-speed voice and image models –  News
SHARE

Microsoft Corp. today introduced a trio of artificial intelligence models optimized to process images and audio.

The algorithms are available through Microsoft Foundry, an Azure service that developers can use to build AI applications. The tech giant has also started rolling out the models to a number of other products.

The first new algorithm, MAI-Image-2, can generate images with a resolution of up to 1024 by 1024 pixels based on user instructions. Each prompt may contain up to 32,000 tokens worth of text. Under the hood, MAI-Image-2 turns instructions into images using 10 billion to 50 billion non-embedding parameters. Non-embedding parameters are model components that focus on generating content rather than preliminary data preparation tasks.

Microsoft says that MAI-Image-2 is at least twice as fast as its previous-generation image generator. The second new model that debuted today, MAI-Transcribe-1, also brings significant speed improvements. It can transcribe speech 2.5 times faster than Microsoft’s earlier models.

MAI-Transcribe-1’s other selling point is its accuracy. Microsoft tested the model’s mean word error rate, a measure of transcript quality, across 25 languages. MAI-Transcribe-1 logged an error rate of 3.9%, which put it ahead of Gemini 3.1 Flash and OpenAI Group PBC’s GPT-Transcribe. One contributor to the model’s accuracy is that it includes features for filtering environmental noise.

On launch, MAI-Transcribe-1 supports batch transcription. That means the model can only process pre-prepared files such as audiobooks. According to Microsoft, a future update will add the ability to transcribe real-time audio streams. The company is also working on a so-called diarization feature that can split the text of a transcript into speaker-specific segments.

The third model that Microsoft introduced today is called MAI-Voice-1. As the name suggests, it’s optimized to generate synthetic speech based on user-provided scripts. Customers can choose from one of built-in AI voices or use their own voice. 

Microsoft says all three models offer competitive pricing compared to competitors. MAI-Image-2 is priced at $5 per 1 million input tokens and $33 per 1 million output tokens. MAI-Transcribe-1 costs $0.36 per hour of transcribed speech, while MAI-Voice-1 starts at $22 per 1 million characters.

The models are available through not only Microsoft Foundry but also several other services. Microsoft is currently in the process of rolling out MAI-Image-2 to Bing and PowerPoint, while MAI-Voice-1 is accessible in an audio creation tool called Copilot Audio Expressions.

The tech giant has developed a line of custom AI chips called MAIA to power its AI workloads. The newest addition to the series family, the inference-optimized Maia 200, made its debut in late January. Microsoft says that the three-nanometer chip outperforms competing cloud providers’ custom AI chips across several benchmarks.

Photo: Microsoft

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About News Media

News Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of News, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — News Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, News Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article give reasons to bring back teleworking give reasons to bring back teleworking
Next Article The Best Online Fax Services We’ve Tested for 2026 The Best Online Fax Services We’ve Tested for 2026
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Daniels quoted in CNN article on the Anthropic safety saga
Daniels quoted in CNN article on the Anthropic safety saga
News
Today's NYT Connections Hints, Answers for April 3 #1027
Today's NYT Connections Hints, Answers for April 3 #1027
News
Xiaomi 17 debuts Snapdragon 8 Elite Gen 5 as CEO Lei Jun details XRING chip and high-end tech strategy · TechNode
Xiaomi 17 debuts Snapdragon 8 Elite Gen 5 as CEO Lei Jun details XRING chip and high-end tech strategy · TechNode
Computing
Elon Musk is about to be a very busy boy!
Elon Musk is about to be a very busy boy!
News

You Might also Like

Daniels quoted in CNN article on the Anthropic safety saga
News

Daniels quoted in CNN article on the Anthropic safety saga

2 Min Read
Today's NYT Connections Hints, Answers for April 3 #1027
News

Today's NYT Connections Hints, Answers for April 3 #1027

3 Min Read
Elon Musk is about to be a very busy boy!
News

Elon Musk is about to be a very busy boy!

12 Min Read
Don’t Miss Tomorrow’s Friday Night Baseball Games. Here’s How to Watch on Apple TV
News

Don’t Miss Tomorrow’s Friday Night Baseball Games. Here’s How to Watch on Apple TV

11 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?