AMD has released a new stable Diffusion 3 Medium Artificial Intelligence (AI) Model Optimized for XDNA 2 Neural Processing Units (NPUS). The chipmaker claimed that it is the world’s first ai model that processes outputs in the BF16 Format. The model will be supported by the newer ryzen ai laptops with at least 24GB ram, after users download tensorstack’s amuse 3.1 beta software. The stable Diffusion 3 Medium is an on-Device Image Generation Model that does not remove the internet connectivity.
AMD’s Image Generation Model Can Generate Print-Alady Images
In a press release, the santa clara-based tech giant detailed the new image generation model. The AI model is based on stable Diffusion 3 Medium, which is optimized for the company’s xdna npus and are equipped in the ryzen ai laptops released in 2024 and newr.
The company claims the model can be used to Generate Stock-Quality Images from Text Prompts. The model generates 1024 × 1024 resolution images, which are then UPSCALED to 2048 × 2048 print-resolution using the NPU’s Capabilitys.
The new ai model is part of amd and tensorsk’s new amuse 3.1 desktop app, which is free to download and install. Since the image generation model runs entryly locally, it even works when the device is not connected to the internet. The data-processing occurs on-device, powered by the XDNA 2 Npus.
AMD said it has worked on the memory requirements of the AI Model, and It Now Requires 24GB RAM, Intead of 32GB Ram which was health for the stable defusion XL TURBO Model. Additionally, the new image model consums only 9GB of Ram while active. The company achieved this by using the block floating point 16 or block FP16 (BF16) Memory-Effective Format.
The tech giant highlighted that the Stable Diffusion 3 Medium Ai Model Striistly Adhares to the Prompt, Structure, and Order. AMD Said Users Trying Out the Model should first describe the type of image, then the structural components, and finally details and other context. Negative Prompts Can Be Used to Remove Elements from the image, and placement of full stops can change the context understanding of the model.