Capacity Planning

Hardware Resource Ratio

Recommended balanced ratio per worker node for optimal Celeborn performance:

ResourceRatio
vCPUs2
Memory (GB)5
Network (Gbps)2
Disk I/O (KIOps)1

Worker Sizing Formula

Bash
Copy

Memory Recommendations by Workload

Workload ProfileWorker HeapWorker Off-Heap
Light (< 100 GB shuffle)2 GB4 GB
Medium (100 GB – 1 TB)4 GB8 GB
Heavy (> 1 TB)8 GB16 GB
HDFS / S3 backend8 GB32 GB

Storage Backend Recommendations

Storage TypePerformanceNotes
NVMe SSDBestLowest latency — recommended for hot shuffle data
SAS SSDGoodGood balance of cost and performance
HDDAcceptableUse multiple disks per worker for parallelism
HDFSModerateHigher latency but infinite capacity — ideal for cold/large shuffles
S3 / Object StoreLowerHighest latency, lowest cost — use for overflow or archival

###

Type to search, ESC to discard
Type to search, ESC to discard
Type to search, ESC to discard
  Last updated