aengusmartindonaire/pyspark-ml-pipeline
PySpark ML classification pipelines for NLP, clinical prediction, and census income deployed on a 3-node Spark/HDFS cluster.
Stars
—
Forks
—
Language
Jupyter Notebook
License
—
Category
Last pushed
Mar 20, 2026
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/aengusmartindonaire/pyspark-ml-pipeline"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
lensacom/sparkit-learn
PySpark + Scikit-learn = Sparkit-learn
Angel-ML/angel
A Flexible and Powerful Parameter Server for large-scale machine learning
MingChen0919/learning-apache-spark
Notes on Apache Spark (pyspark)
flink-extended/dl-on-flink
Deep Learning on Flink aims to integrate Flink and deep learning frameworks (e.g. TensorFlow,...
alibaba/Alink
Alink is the Machine Learning algorithm platform based on Flink, developed by the PAI team of...