By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Prime Intellect Releases INTELLECT-2: A 32B Parameter Model Trained via Decentralized Reinforcement
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Prime Intellect Releases INTELLECT-2: A 32B Parameter Model Trained via Decentralized Reinforcement
News

Prime Intellect Releases INTELLECT-2: A 32B Parameter Model Trained via Decentralized Reinforcement

News Room
Last updated: 2025/05/21 at 2:33 PM
News Room Published 21 May 2025
Share
SHARE

Prime Intellect has released INTELLECT-2, a 32 billion parameter language model trained using fully asynchronous reinforcement learning across a decentralized network of compute contributors. Unlike traditional centralized model training, INTELLECT-2 is developed on a permissionless infrastructure where rollout generation, policy updates, and training are distributed and loosely coupled.

The system is built around PRIME-RL, a new training framework designed for asynchronous RL in untrusted environments. It separates the tasks of generating rollouts, updating models, and broadcasting weights. Policy updates are handled by SHARDCAST, a component that distributes model weights using a tree-based HTTP network. Inference rollouts submitted by workers are verified through TOPLOC, a locality-sensitive hashing mechanism that detects tampering or numerical discrepancies before allowing the results to influence training.



Source: https://arxiv.org/html/2505.07291v1

INTELLECT-2 was trained on 285,000 math and coding tasks sourced from datasets such as NuminaMath-1.5 or SYNTHETIC-1. The reward signal combines binary task success with token-length penalties or bonuses, allowing fine-grained control over inference-time compute budgets. Training stability was supported by techniques such as two-sided GRPO clipping, gradient norm management, and both offline and online filtering of high-value tasks.

The asynchronous training process overlaps inference, communication, and model updates, avoiding typical bottlenecks found in centralized RL systems. A Rust-based orchestrator running on a testnet coordinates the global pool of contributors, handling hardware checks, heartbeats, task assignments, and contribution tracking—operating similarly to peer-to-peer or blockchain-based systems.

Performance evaluations showed improvements on targeted math and programming tasks, particularly over QwQ-32B, a previous RL-trained model. Broader benchmark improvements were more modest, suggesting gains were mostly confined to training data domains. Prime Intellect noted that improvements might be more significant using stronger base models, such as Qwen3, or by integrating more complex environments and reasoning tools.

One Reddit user remarked on the broader implications:

Distributed training and distributed inference seem like the way to go. Maybe something similar to P2P or blockchain with some kind of rewards for computational contributions/transactions. Not necessarily yet another cryptocurrency, but maybe credits that can be used for free computing on the network.

Future work includes increasing the inference-to-training compute ratio, enabling multi-turn reasoning with integrated tools like web search or Python, crowdsourcing RL tasks, and experimenting with decentralized model merging methods such as DiLoCo.

The model, code, training framework, and documentation are publicly available on the Prime Intellect website. Additional tools and interfaces, including a Hugging Face release and a chat demo, are also publicly accessible.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article A firm mattress for side sleepers? I slept on the Helix Twilight and loved it — now 27% off for Memorial Day
Next Article A Gaming YouTuber Says an AI-Generated Clone of His Voice Is Being Used to Narrate ‘Doom’ Videos
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

How to backup your Photos library in macOS Sequoia
News
Developers can finally pull that buggy app update from the Play Store
News
Man Utd vs Tottenham – Europa League final LIVE: Reaction as Spurs WIN in Bilbao
News
Personalized Gifts to Show Your Dad Love on Father's Day in 2025
News

You Might also Like

News

How to backup your Photos library in macOS Sequoia

9 Min Read
News

Developers can finally pull that buggy app update from the Play Store

2 Min Read
News

Man Utd vs Tottenham – Europa League final LIVE: Reaction as Spurs WIN in Bilbao

2 Min Read
News

Personalized Gifts to Show Your Dad Love on Father's Day in 2025

1 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?