Hugging Face Launches An Open Source Tool For Affordable AI Deployment

Hugging Face launches an open source tool for affordable AI deployment

Last updated: 2024/12/14 at 3:45 AM

News Room Published 14 December 2024

Hugging Face has introduced its latest offering, Hugging Face Generative AI Services (HUGS), aimed at simplifying the deployment and scaling of generative AI applications using open-source models.

Built on Hugging Face technologies such as Transformers and Text Generation Inference (TGI), HUGS promises optimized performance across various hardware accelerators.

For developers using AWS or Google Cloud, the service is available at $1 per hour per container, with a five-day free trial on AWS to help users get started.

Streamlining AI with zero-configuration inference

HUGS offers developers a solution to run AI models on their own infrastructure without the need for manual configuration. One of the primary challenges when deploying large language models (LLMs) is optimizing them for specific hardware environments. Each accelerator, whether it is an NVIDIA GPU or an AMD GPU, requires fine-tuning to extract maximum performance.

With HUGS, these optimizations are managed automatically, delivering high throughput out of the box. In addition to NVIDIA and AMD GPUs, the company promises that its support will soon extend to AWS Inferentia and Google TPUs.

Hugging Face aims to ease the transition from black-box APIs to open, self-hosted solutions with support for a wide array of models, including well-known LLMs like Llama and Gemma, with plans to introduce multimodal models such as Idefics and Llava soon. In the future, the company says it will include embedding models like BGE and Jina, giving developers even more options to customize their AI applications.

This service uses standardized APIs compatible with OpenAI’s model interfaces, therefore, developers can migrate their own code.

For startups in particular, HUGS provides an opportunity to build AI applications without incurring the high costs associated with proprietary platforms. The availability of one-click deployments on DigitalOcean makes it even easier for small teams to experiment with generative AI technologies.

Meanwhile, larger enterprises can leverage HUGS to scale their applications without being locked into a single cloud provider or proprietary API. On DigitalOcean, HUGS is included at no extra charge beyond the standard cost of GPU Droplets. Hugging Face also offers custom deployment solutions for enterprises through its Enterprise Hub.

Hugging Face launches an open source tool for affordable AI deployment

Streamlining AI with zero-configuration inference

You might also like

Leave a Reply Cancel reply

Stay Connected

Latest News

AI Gets Funny: 35 Hilarious ChatGPT Memes to Enjoy

Save Space on Your Phone by Offloading and Archiving Apps

Gmail Hack Attacks—What You Need To Know, What You Need To Do

As Soon as You Install iOS 18.2, Change These 8 iPhone Settings

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

Topics

Sign Up for Our Newsletter

Streamlining AI with zero-configuration inference

You might also like

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Stay Connected

Latest News