uber/petastorm

Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

74
/ 100
Verified

1,880 stars and 327,598 monthly downloads. Available on PyPI.

Maintenance 6 / 25
Adoption 20 / 25
Maturity 25 / 25
Community 23 / 25

How are scores calculated?

Stars

1,880

Forks

286

Language

Python

License

Apache-2.0

Last pushed

Jan 02, 2026

Monthly downloads

327,598

Commits (30d)

0

Dependencies

13

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/uber/petastorm"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.