yahoo/TensorFlowOnSpark

TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.

69
/ 100
Established

Enables distributed TensorFlow training and inference across Spark/Hadoop clusters by launching TensorFlow workers on executors with support for both HDFS-native data reads and Spark RDD pushdown via `TFNode.DataFeed`. Supports synchronous/asynchronous training, model/data parallelism, and server-to-server direct communication, requiring minimal code changes to existing TensorFlow programs while integrating seamlessly into Spark data pipelines.

3,859 stars and 11,238 monthly downloads. No commits in the last 6 months. Available on PyPI.

Stale 6m No Dependents
Maintenance 0 / 25
Adoption 19 / 25
Maturity 25 / 25
Community 25 / 25

How are scores calculated?

Stars

3,859

Forks

941

Language

Python

License

Apache-2.0

Last pushed

Jul 10, 2023

Monthly downloads

11,238

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/yahoo/TensorFlowOnSpark"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.