By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Google BigQuery Adds SQL-Native Managed Inference for Hugging Face Models
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Google BigQuery Adds SQL-Native Managed Inference for Hugging Face Models
News

Google BigQuery Adds SQL-Native Managed Inference for Hugging Face Models

News Room
Last updated: 2026/01/28 at 6:15 AM
News Room Published 28 January 2026
Share
Google BigQuery Adds SQL-Native Managed Inference for Hugging Face Models
SHARE

Google recently launched third-party generative AI inference for open models in BigQuery, allowing data teams to deploy and run any model from Hugging Face or Vertex AI Model Garden using plain SQL. With this interface in preview, there is no longer a need for separate ML infrastructure, as it automatically spins up compute resources, manages endpoints, and cleans up everything through BigQuery’s SQL interface.

The new capability tackles a problem data teams have dealt with for some time. Running open-source models previously meant managing Kubernetes clusters, configuring endpoints, and juggling multiple tools. Virinchi T, writing in a Medium article about the launch, put it this way:

This process requires multiple tools, different skill sets, and significant operational overhead. For many data teams, this friction means AI capabilities remain out of reach—even when the models themselves are freely available.

Yet, with BigQuery’s SQL interface, the entire workflow boils down to two SQL statements. Users create a model with one CREATE MODEL statement that specifies a Hugging Face model ID (like sentence-transformers/all-MiniLM-L6-v2) or a Vertex AI Model Garden model name. BigQuery automatically provisions compute resources with default configurations, typically completing deployment in 3-10 minutes depending on the model size.

Next, users run inference using AI.GENERATE_TEXT for language models or AI.GENERATE_EMBEDDING for embeddings, querying data straight from BigQuery tables. The platform manages the resource lifecycle via the endpoint_idle_ttl option, which shuts down idle endpoints to prevent charges. Furthermore, they can also manually undeploy endpoints with ALTER MODEL statements when batch jobs wrap up.

The feature supports customization for production use cases. Users can set machine types, replica counts, and endpoint idle times right in the CREATE MODEL statement. Compute Engine reservations can lock in GPU instances for steady performance. When they are done with a model, a quick DROP MODEL statement automatically wipes out all associated Vertex AI resources.

Google’s blog post describes the system as providing “granular resource control” and “automated resource management,” letting teams find the right balance between performance and cost without leaving SQL. An earlier blog post from September 2025 showed processing 38 million rows for roughly $2-3 using similar patterns with open-source embedding models.

The feature works with over 13,000 Hugging Face text embedding models and 170,000+ text generation models, covering Meta’s Llama series and Google’s Gemma family. Models need to comply with Vertex AI Model Garden deployment requirements, including regional availability and quota limits.

Virinchi T highlighted what this means for different roles:

For Data Analysts: You can now experiment with ML models without leaving your SQL environment or waiting for engineering resources. For Data Engineers: Building ML-powered data pipelines becomes dramatically simpler—no separate ML infrastructure to maintain.

The launch puts BigQuery up against Snowflake’s Cortex AI and Databricks’ Model Serving, both of which offer SQL-accessible ML inference. BigQuery’s edge might be its direct integration with Hugging Face’s massive model catalog in the data warehouse, which could appeal to users already running on Google Cloud.

Documentation and tutorials are available for text generation with Gemma models and embedding generation.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article TikTok goes down in the US, blames power outages TikTok goes down in the US, blames power outages
Next Article iPhone 18 Pro Could Feature Teleconverter Alongside Variable Aperture iPhone 18 Pro Could Feature Teleconverter Alongside Variable Aperture
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Why Today’s Video AI Models Fail Robots in the Real World | HackerNoon
Why Today’s Video AI Models Fail Robots in the Real World | HackerNoon
Computing
Your Roku City Hides Lots Of Easter Eggs – Here’s What To Look Out For – BGR
Your Roku City Hides Lots Of Easter Eggs – Here’s What To Look Out For – BGR
News
Apple iOS 26.2 .1: New iPhone Software: Should You Upgrade?
Apple iOS 26.2 .1: New iPhone Software: Should You Upgrade?
Software
Bluesky is testing ‘live’ features to take on X
Bluesky is testing ‘live’ features to take on X
News

You Might also Like

Your Roku City Hides Lots Of Easter Eggs – Here’s What To Look Out For – BGR
News

Your Roku City Hides Lots Of Easter Eggs – Here’s What To Look Out For – BGR

4 Min Read
Bluesky is testing ‘live’ features to take on X
News

Bluesky is testing ‘live’ features to take on X

2 Min Read
As 37 US states respond to Grok CSAM, Apple needs to act
News

As 37 US states respond to Grok CSAM, Apple needs to act

4 Min Read
This new Samsung accessory could be the best way to charge your Galaxy S26 wirelessly
News

This new Samsung accessory could be the best way to charge your Galaxy S26 wirelessly

3 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?