What we are announcing:
Effortless Scaling: Focus on your innovation, not infrastructure — spin up and scale Ray clusters on YellowDog in seconds, with no manual setup or management required.
Unmatched Flexibility: Run Ray workloads seamlessly across clouds and regions for true elastic, global performance.
Next-Level Scale: Go beyond open-source limits with support for clusters of up to 10,000 nodes on a single deployment.
Take It to the Limit: Put RayDog to the test — scale bigger, move faster using $500 free cloud credits.
The demand for large-scale distributed computing has never been greater. Across financial services, AI research, quantitative engineering, and advanced analytics, teams are turning to Ray to build scalable applications that power training, simulation, and inference workloads.
Yet as Ray’s adoption accelerates, so too does the complexity of managing clusters at scale. Standing up, orchestrating, and maintaining infrastructure that keeps pace with high-performance workloads can become a bottleneck — especially for organisations that want to innovate fast without getting bogged down in DevOps overhead.
That’s where RayDog comes in.
RayDog: The Easiest Way to Scale Ray Workloads Without Infrastructure Limits
RayDog is the new integration between Ray and the YellowDog platform, purpose-built to make scaling Ray clusters effortless, elastic, and enterprise-ready.
By combining Ray’s powerful distributed computing framework with YellowDog’s intelligent workload management and compute orchestration platform, RayDog enables users to launch and manage Ray clusters with unprecedented ease — all while unlocking the flexibility and performance needed for truly large-scale workloads.
Whether you’re training large AI models, running quant simulations, or coordinating distributed data pipelines, RayDog removes the friction between your code and the compute it needs to thrive.

Breaking Barriers: From Complexity to Seamless Orchestration
With RayDog, what once required extensive manual setup, cluster tuning, and ongoing monitoring now becomes seamless. The integration takes advantage of YellowDog’s smart workload scheduler, policy-based governance, and elastic compute provisioning, so Ray clusters can grow or shrink automatically based on workload demand.
RayDog allows teams to:
- Scale instantly and intelligently multi-region, multi-cloud or hybrid.
- Optimise compute usage and cost through automated workload placement.
- Eliminate infrastructure management overhead, freeing developers to focus entirely on workload logic and innovation.
- Gain enterprise-grade visibility and control through the YellowDog platform.
In short, RayDog makes distributed computing as simple as it was meant to be.
Unmatched Scale and Multi-Region Capability
What truly sets RayDog apart is its unmatched scalability and flexibility.
RayDog enables organisations to scale Ray clusters far beyond the limits of the open-source framework — up to 10,000 nodes in a single cluster. This capability removes one of the biggest barriers to scaling advanced distributed workloads, giving teams the freedom to experiment, iterate, and deploy at levels that were previously out of reach.
And because the integration is powered by YellowDog, RayDog clusters can span multiple regions seamlessly. This means teams with distributed operations — from global research institutions to regulated financial organisations — can run a single Ray cluster across regions, optimising for performance, locality, and compliance.
For many, this isn’t just scale — it’s strategic flexibility.
Built for the Teams Driving the Future of Compute
RayDog was designed for the innovators who demand both power and simplicity:
- Financial Services and Quant Teams: Accelerate large-scale backtesting, simulation, and model training workloads without infrastructure bottlenecks.
- AI Innovators and Research Labs: Scale distributed training and inference across thousands of nodes — without needing a dedicated DevOps function.
- High-Performance Data and Simulation Workloads: Access compute resources dynamically and operate clusters across multiple regions and providers with unified control.
These teams share one goal — faster time to value — and RayDog is built precisely to deliver it.
From Idea to Infinite Scale, Instantly
With RayDog, scaling Ray no longer means managing infrastructure. It means focusing on what matters most: your algorithms, your insights, and your outcomes.
By fusing Ray’s simplicity with YellowDog’s orchestration intelligence, RayDog transforms how teams build and deploy high-scale distributed applications — unlocking new frontiers of performance, speed, and efficiency.
If you’re ready to experience what truly flexible scaling feels like, RayDog is here.
Get Started with $500 in Free Cloud Credits
To celebrate the launch of RayDog, we’re giving new users $500 in cloud credits to experience the full power of seamless Ray scaling with YellowDog.
Unlock next-level performance, simplify orchestration, and see how effortlessly you can scale Ray workloads — from prototype to production — without infrastructure limits.
