bytedance/byteps

A high performance and generic framework for distributed DNN training

Archived
48
/ 100
Emerging

Supports TensorFlow, PyTorch, Keras, and MXNet with both TCP and RDMA networking, using a cloud-optimized architecture that replaces MPI with custom inter-machine communication alongside intra-machine NCCL. Incorporates hierarchical strategies, pipelining, tensor partitioning, and priority-based scheduling to achieve ~90% scaling efficiency on 256 GPUs—significantly outperforming Horovod+NCCL, particularly on bandwidth-constrained networks. Horovod-compatible API enables minimal code changes for switching frameworks.

3,718 stars. No commits in the last 6 months.

Archived Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 10 / 25
Maturity 16 / 25
Community 22 / 25

How are scores calculated?

Stars

3,718

Forks

495

Language

Python

License

Last pushed

Oct 03, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/bytedance/byteps"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.