uber/petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
1,880 stars and 327,598 monthly downloads. Available on PyPI.
Stars
1,880
Forks
286
Language
Python
License
Apache-2.0
Category
Last pushed
Jan 02, 2026
Monthly downloads
327,598
Commits (30d)
0
Dependencies
13
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/uber/petastorm"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
runpod/runpod-python
🐍 | Python library for RunPod API and serverless worker SDK.
treeverse/dvc
🦉 Data Versioning and ML Experiments
carsdotcom/skelebot
Machine Learning Project Development Tool
microsoft/vscode-jupyter
VS Code Jupyter extension
operatorai/modelstore
🏬 modelstore is a Python library that allows you to version, export, and save a machine learning...