Save a process, container or VM using Cedana to migrate the workload onto another instance with real-time performance and zero interruption or code-modifications.
Works with Kueue
(for HPC-style workloads), KServe (for inference), KubeFlow (for large-scale training), SLURM and more.
Scale workloads and clusters up and down with higher performance, utilization and faster response times than previously available. Preempt and save workloads quickly to downscale resources without losing progress or performance.
Deliver best-in-class performance. We continuously optimize performance at the kernel, container, filesystem, network and interconnect layers. We deploy internal testing and simulation to thoroughly measure correctness, reliability and performance.
We’ve deployed a test cluster for you to play with where you can interact and experiment with the system.
Learn more about how Cedana is transforming compute orchestration and how we can help your organization.
From deploying on your cluster, to market, to GPU Checkpointing, learn our system and get started quickly.