By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Deepgram’s Aura-2 is a high-performance text-to-speech engine built for business interactions – News
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Deepgram’s Aura-2 is a high-performance text-to-speech engine built for business interactions – News
News

Deepgram’s Aura-2 is a high-performance text-to-speech engine built for business interactions – News

News Room
Last updated: 2025/04/15 at 8:39 AM
News Room Published 15 April 2025
Share
SHARE

The speech recognition-focused startup Deepgram Inc. today launched a new text-to-speech model called Aura-2, saying it will be a game-changer for real-time voice applications.

According to the startup, Aura-2 is built for clarity, consistency and low-latency performance, enabling much smoother, more fluid and natural conversations between humans and artificial intelligence-powered chatbots. It can be used in almost any kind of voice application, including customer support agents, AI-powered assistants and more.

Deepgram is known for its highly sophisticated speech recognition engine, which enables humanlike conversations between human and machine. Its software has been widely praised for its responsiveness and its realistic nature, waiting for the most appropriate moment to break into the conversation, as well as its “interruptible” nature, which means humans can interject and it will immediately pause whatever it’s saying and reconfigure its response.

It argues that most existing text-to-speech engines are actually more focused on entertainment, optimized for character voices, storytelling and emotionally expressive delivery, which means they don’t always meet the demands of enterprise-grade voice systems. The customer service industry in particular needs more natural-sounding voices that support domain-specific pronunciation, with a professional tone and consistent contextual handling, while being cost-effective and secure.

According to Deepgram, that’s exactly what Aura-2 provides, delivering quality, context-aware speech and conversational experiences for any enterprise environment.

Designed for business scenarios

In a blog post, Deepgram points to a number of capabilities that set Aura-2 apart from other models, the main one being its excellence in terms of domain-specific pronunciation. By this, it means it’s designed to converse using highly specific terminology in various industries, fully literate with financial jargon, for example, or product names in the chemical industries. This eliminates the need for customers to fine-tune LLMs with extensive pronunciation dictionaries to provide clear communication when speaking about niche topics.

Aura-2 supports more than 40 distinct voices in English, including numerous regional U.S. accents and those from other English-speaking countries. Moreover, each accent will employ “business-appropriate speech” that purposely avoids using the overly theatrical tones that are too common with entertainment-focused text-to-speech engines. Customers can choose from various voice personas, ranging from empathetic and charismatic to calm and professional, to ensure their voice apps align with their brand identity.

Deepgram says Aura-2 also excels in terms of its interruption handling, context awareness and “end-of-thought detection,” enabling more fluid conversations, even when the human speaker interrupts the AI. The startup says it can intelligently adjust different aspects of its voice, such as pacing, pauses, tone and expression, based on the context of whatever it’s discussing, resulting in smoother, more coherent speech overall.

Deepgram Chief Executive Scott Stephenson said the AI chatbot industry has evolved to the point where enterprises don’t just require voices that sound real, but can reliably communicate with humanlike precision in professional contexts.

“Aura-2 delivers the perfect balance of natural speech and enterprise-grade accuracy, enabling organizations to create voice experiences that truly enhance customer engagement while maintaining operational efficiency,” he promised.

Strong performance

The company showed off this graphic, which shows that humans strongly prefer interacting with Aura-2 over various other text-to-speech models, including the most advanced models from OpenAI and Microsoft Corp.

Aura-2’s responsiveness is another major strength. The company boasts of sub-200 milliseconds responses, as well as impressive scalability, handling thousands of concurrent requests to support high-volume deployments in call centers and virtual assistant scenarios.

Customers have the option to deploy Aura-2 on-premises or in virtual private cloud environments to ensure full control over their data, which also has the impact of further reducing latency.

Deepgram highlighted Aura-2’s competitiveness in terms of cost, too. It said it’s priced at just three cents per 1,000 characters, making it significantly cheaper than other business-focused models such as Elevenlabs Turbo and Cartesia Sonic, which cost five cents and 3.8 cents, respectively. It charges the same rate for all 40-plus voices offered, with a tiered pricing structure for higher-volume deployments.

Lastly, Deepgram explained that Aura-2 is powered by a customized infrastructure layer called Deepgram Enterprise Runtime. This supports additional features such as automated model adaptation, enabling it to learn on the job and improve its performance over time, and model “hot-swapping,” where customers can instantly switch among the underlying large language models that power their chat applications, without downtime.

Deepgram is inviting customers to get started with Aura-2 now via its interactive playground. New signups will get a generous $200 worth of free credits, enough to generate around 220 hours of speech, providing ample opportunity to experiment and see how it performs in various voice application scenarios.

Featured image: News/Dreamina

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Where’s My Music? How to Restore Songs on Your iPhone or iPad
Next Article Landing AI | Lenovo unveils AI-Powered PCs at Bilibili World 2024 · TechNode
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

The Therm-a-Rest NeoLoft Will Make You Forget You’re Sleeping on the Ground
Gadget
Samsung Odyssey OLED G8 G81SF 4K UHD HDR Monitor Review
Computing
JBang Jash Brings Streamlined Process Execution to Java
News
DingTalk launches immersive workplace for Apple Vision Pro · TechNode
Computing

You Might also Like

News

JBang Jash Brings Streamlined Process Execution to Java

4 Min Read
News

Huge Home Depot weekend sale takes up to 50% off patio furniture, grills and more — 27 deals I’d shop now

1 Min Read
News

Student Loan Borrowers, You Have Until Summer to Prevent Your Wages From Being Garnished

6 Min Read
News

UK tech funding roundup: This week’s deals from Juice to Zeus – UKTN

1 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?