Reading: AMD Releases AMD-135M: An Open-Source Small Language Model

AMD Releases AMD-135M: An Open-Source Small Language Model

Last updated: 2024/09/27 at 10:21 PM

News Room Published 27 September 2024

AMD today announced “AMD-135M” as their first small language model they are publicly releasing. AMD-135M is open-source with the training code, dataset, and weights all being open-source to help in the development of other SLMs and LLMs.

AMD-135M features speculative decoding and was trained from scratch using AMD Instinct MI250 accelerators with 670 billion tokens. Training using four MI250 nodes took six days. There is also an AMD-Llama-135M-code variant that has an additional 20 billion tokens of code data. AMD-135M is based on the LLaMA2 model architecture.

AMD is making all of the AMD-135M model assets open-source in hopes of helping other AI development — and for AMD’s part, hoping that the training and inferencing is happening from AMD hardware.

AMD-135M

More details on the AMD-135M SLM via the AMD blog. AMD-135M is available via HuggingFace and GitHub.

Share This Article

Apple not investing in OpenAI after all, new report says – 9to5Mac

The American judge rules that software code does not qualify as freedom of expression in the Tornado Cash case

AMD Releases AMD-135M: An Open-Source Small Language Model

Leave a Reply Cancel reply

Stay Connected

Latest News

Readers’ Choice: The Top EV, Hybrid, and Charging Network Brands

gSknswhBsW&bunhnhnBnngngsn

these headphones feel like déjà vu

Starlink doubles prices for Nigerian customers to ₦75,000

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

Topics

Sign Up for Our Newsletter

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Stay Connected

Latest News