INITIALIZING · TRAINING 0%
  cedana <3 compute
cedana / use cases

Unbreakable distributed training.

Resume training runs from the exact step, even after mid-epoch failures on large multi-node clusters.