XAI Releases Grok Code Fast 1, A New Model For Agentic Coding

xAI Releases Grok Code Fast 1, a New Model for Agentic Coding

Last updated: 2025/09/05 at 11:14 AM

News Room Published 5 September 2025

xAI introduced grok-code-fast-1, a model developed specifically for agentic coding workflows. The architecture was built from the ground up, with a pre-training corpus composed of programming-related data and a post-training set drawn from real pull requests and practical coding tasks.

The model includes optimization for tool usage commands like grep, terminal operations, and file editing, and is meant to integrate smoothly with coding environments and IDEs. It also uses serving techniques and prompt caching to improve responsiveness, with cache hit rates reportedly above 90 percent in partner workflows.

It also supports several programming languages, including TypeScript, Python, Java, Rust, C++, and Go. It is positioned to handle a range of everyday developer tasks, from project scaffolding and codebase inquiries to precise bug fixes with minimal supervision.

Performance was measured on the SWE-Bench-Verified benchmark, where the model scored 70.8 percent using xAI’s internal evaluation suite. Beyond benchmarks, xAI also incorporated human evaluations and automated assessments to guide development, focusing on real-world usability.

To support rapid interaction, the model uses a 256 k token context window, enabling it to process larger codebases in context. Internally, it uses a mixture-of-experts architecture with an estimated 314 billion parameters, designed to balance speed with coding capability. In practical usage, throughput is approximately 92 tokens per second, enabling interactive pacing during development sessions.

In comparison with other coding-focused large language models, grok-code-fast-1 places its emphasis on speed and integration with tools rather than maximum benchmark accuracy. OpenAI’s o1-mini and Anthropic’s Claude Sonnet 3.5, for instance, report higher raw reasoning or coding accuracy on some tasks but do not match the same level of prompt caching optimization or throughput. The model’s mixture-of-experts design is closer to Google DeepMind’s Gemini 1.5 Pro in terms of architecture, though adapted specifically for software development workflows.

Community responses highlighted aspects of Grok Code Fast 1’s execution speed. Software developer Eric Jiang shared:

I’ve been using the model the past few weeks as my daily driver, and the speed has made a massive difference in my productivity. It’s a delight to use!

Other commenters turned to use cases and accessibility, discussing how Grok Code Fast 1 could fit into day-to-day development. Questions centered on integration with coding editors and command-line tools.

Software developer Jonathan Parra noted:

Nice, been wanting something like this for a while, needs a CLI tho to compete with Claude Code tho.

Access to grok-code-fast-1 is available for a limited time at no cost through select launch partners, including GitHub Copilot, Cursor, Cline, Roo Code, Kilo Code, opencode, and Windsurf. xAI says it will make updates to the model on a frequent cadence and notes that a new variant featuring multimodal input, parallel tool usage, and extended context length is already in training.