By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Pinterest’s Moka: How Kubernetes Is Rewriting the Rules of Big Data Processing
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Pinterest’s Moka: How Kubernetes Is Rewriting the Rules of Big Data Processing
News

Pinterest’s Moka: How Kubernetes Is Rewriting the Rules of Big Data Processing

News Room
Last updated: 2026/01/19 at 4:04 AM
News Room Published 19 January 2026
Share
Pinterest’s Moka: How Kubernetes Is Rewriting the Rules of Big Data Processing
SHARE

Digital pinboard provider Pinterest has published an article explaining its blueprint for the future of large-scale data processing with its new platform Moka. The company is moving core workloads from ageing Hadoop infrastructure to a Kubernetes-based system on Amazon EKS, with Apache Spark as the main engine and support for other frameworks on the way.

In a two-part blog series, Soam Acharya, Rainie Li, William Tom and Ang Zhang describe how the Pinterest Big Data Platform team considered alternatives for their next-generation massive-scale data processing platform as the limits of the existing Hadoop-based system, known internally as Monarch, became clear. They present Moka as the outcome of that search, and as their EKS based cloud native data processing platform, which now runs production workloads at Pinterest scale. Part one of the series focused on the overall design and the application layer. In contrast, part two turns to what the authors call “the infrastructure-focused aspects of Moka with learnings and future direction.”

The post frames the move to Kubernetes in practical terms. It demonstrates an industry-wide shift in which big technology companies now treat Kubernetes as a control plane for data, rather than only as a stateless service platform. Encouraged by growing popularity and increasing adoption in the Big Data community, the team explored Kubernetes-based systems as the most likely replacement for Hadoop 2.x. Any candidate platform had to meet precise criteria around scalability, security, cost and the ability to host multiple processing engines. Moka is an example of how to modernise a Hadoop-era data platform without abandoning existing Spark investments.

A central theme of the second article is how to operate Spark at a very large scale on Kubernetes. The authors explain how they added logging, metrics and job history services around Moka so that engineers could debug and tune jobs without needing to understand the underlying cluster topology. They worked on standardising log collection with Fluent Bit and publishing uniform metrics using OpenTelemetry and Prometheus-compatible endpoints. This gave both infrastructure and application teams a consistent view of system health. 

Pinterest has also invested in making the platform reproducible through infrastructure-as-code. In the article, the team outlines how they use Terraform and Helm to create EKS clusters, configure networking and security, and deploy supporting components such as the Spark History Server.

Pinterest’s engineers also discuss dealing with different hardware architectures. They describe how they built multi-architecture images so that their data workloads run well on both Intel and ARM-based instances, including AWS Graviton, and they link this to cost and efficiency goals at fleet scale. A LinkedIn summary of the project from InfoQ editor Eran Stiller notes that Moka “delivers container-level isolation, ARM support, YuniKorn scheduling, and significant cost savings by consolidating workloads and auto scaling across instance types”. These details place the work within the broader trend of cloud users seeking to cut infrastructure costs without sacrificing performance.

The broader industry conversation on processing engines adds nuance to Pinterest’s story. In a separate LinkedIn post, Acharya writes that “while Spark is our primary workhorse, the success of Moka has meant other use cases at Pinterest are following suit: Flink Batch is in production with Apache Ray close behind and Flink Streaming slated for later this year”. Technical deep dives on Spark and Flink underline why this matters, stressing that Spark remains well suited to large batch and interactive analytics workloads, while Flink is “purpose built for real time, stateful stream processing” with strict event by event handling. The team is presenting Moka as a flexible base to add different engines to, depending on the needs of the specific workload, rather than as a Spark-only platform.

External observers have drawn lessons from the Pinterest case. The ML Engineer newsletter describes the Moka article as an example of “deploying EKS clusters, Fluent Bit logging, OTEL metrics pipelines, image management, and a custom Moka UI for Spark on Kubernetes”, placing it alongside other case studies in modern data infrastructure. These reactions suggest that Moka is seen somewhat as a reference architecture for a class of cloud native data systems.

The team do however present their migration work as an ongoing journey, rather than a finished project. In the blog and a further LinkedIn post, the Pinterest authors discuss “learnings and future direction” and describe how early proof-of-concepts led to a phased move away from Hadoop as confidence in the new stack grew. Acharya notes that “the best problems show up at scale” and that building the platform involved “working out the kinks” as the team shifted real workloads. For other organisations, this experience may be the most important lesson. Copying the technical choices around Kubernetes, EKS and Spark is relatively simple, but the process of uncoupling from legacy systems and investing in observability, automation and multi-engine support is likely to be the real work that lies ahead.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Next Wave: The perimeter and the drain Next Wave: The perimeter and the drain
Next Article TSMC reportedly declines outsourcing deal for Samsung’s Exynos chips · TechNode TSMC reportedly declines outsourcing deal for Samsung’s Exynos chips · TechNode
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Douyin international registration claim debunked, over 10,000 fake foreign user accounts removed · TechNode
Douyin international registration claim debunked, over 10,000 fake foreign user accounts removed · TechNode
Computing
Best DJI deal: Save 20% on the DJI Mini 4K drone
Best DJI deal: Save 20% on the DJI Mini 4K drone
News
New Patches Provide HDMI VRR & Auto Low Latency Mode Gaming Features For AMD Linux GPU Driver
New Patches Provide HDMI VRR & Auto Low Latency Mode Gaming Features For AMD Linux GPU Driver
Computing
Trump administration wants tech companies to buy B of power plants they may not use |  News
Trump administration wants tech companies to buy $15B of power plants they may not use | News
News

You Might also Like

Best DJI deal: Save 20% on the DJI Mini 4K drone
News

Best DJI deal: Save 20% on the DJI Mini 4K drone

4 Min Read
Trump administration wants tech companies to buy B of power plants they may not use |  News
News

Trump administration wants tech companies to buy $15B of power plants they may not use | News

4 Min Read
5 Costco Electronics Rules Every Shopper Should Know – BGR
News

5 Costco Electronics Rules Every Shopper Should Know – BGR

8 Min Read
I restarted using Apple Watch after two years and can’t believe I ever quit
News

I restarted using Apple Watch after two years and can’t believe I ever quit

9 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?