TensorFlowOnSpark and tensorflow_scala
These are complements: TensorFlowOnSpark provides distributed training infrastructure on Spark clusters while tensorflow_scala provides the language bindings to write TensorFlow code in Scala, and they can be used together to build Scala-based distributed ML pipelines.
About TensorFlowOnSpark
yahoo/TensorFlowOnSpark
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
Enables distributed TensorFlow training and inference across Spark/Hadoop clusters by launching TensorFlow workers on executors with support for both HDFS-native data reads and Spark RDD pushdown via `TFNode.DataFeed`. Supports synchronous/asynchronous training, model/data parallelism, and server-to-server direct communication, requiring minimal code changes to existing TensorFlow programs while integrating seamlessly into Spark data pipelines.
About tensorflow_scala
eaplatanios/tensorflow_scala
TensorFlow API for the Scala Programming Language
Related comparisons
Scores updated daily from GitHub, PyPI, and npm data. How scores work