AI coding agents have become one of the fastest-growing categories in enterprise software. In the span of just a few years, these development tools have evolved from simple autocomplete assistants into autonomous systems capable of taking over the complete software development cycle, all via natural language prompts.
As vibe-coding takes off, tools from startups like Cursor and Anthropic’s Claude Code have quickly reached multibillion‑dollar revenue run rates. Cursor reportedly crossed $1 billion in annual recurring revenue (ARR) in 2025 and has since approached $2 billion in Q1 of 2026. Anthropic’s Claude Code has scaled even faster, reaching an estimated $2.5 billion annualized run rate within its first year, making it one of the fastest‑growing products in the category that accounts for a large share of Anthropic’s $14 billion ARR.
Yet inside large enterprises, writing code is rarely the hardest part of the job. Data scientists, engineers, and analysts spend much of their time maintaining and augmenting pipelines rather than building new ones. The real bottleneck in enterprise AI, therefore, is not software development itself, but operating complex data systems in production.
Databricks CEO and co-founder Ali Ghodsi believes that the gap represents the next frontier for AI automation. In his view, the next generation of AI agents won’t just write software, but operate the data systems that modern businesses depend on.
That strategic bet is behind Genie Code, a system of autonomous AI agents unveiled today, designed for data engineering, data science, and analytics operations. The system extends the company’s existing Genie platform ecosystem, which allows knowledge workers to ask questions about enterprise data in natural language. (More than 20,000 organizations already used Databricks’s data management and analytics tools; the company’s ARR surpassed $5.4 billion annual revenue in February.)
“Instead of functioning merely as a coding assistant or helping generate code faster, these agents actually understand the structure of the data and existing data problems,” Ali Ghodsi says. “It can automatically set up pipelines, analyze why something is failing, and understand issues like when a dataset schema changes or when permissions are modified.”
For instance, Genie Code can help determine how a dataset should be prepared for modeling—randomizing the data, separating part of it into a test set, or training a model on the remaining portion. After training, the system can aid in evaluating the results using metrics such as F1 scores or the area under the curve, and then analyzing them to determine whether the model is performing well or requires improvement.
“It can suggest trying different approaches—maybe retraining the model or generating plots and graphs to visualize performance, and uncover reasoning about what changes might improve the results,” Ghodsi explains. “It’s not about just generating random code snippets, but understanding the entire structure of the data problem and working through the modeling workflow the same way a data scientist or engineer would.”
