open-edge-platform/datumaro
Dataset Management Framework, a Python library and a CLI tool to build, analyze and manage Computer Vision datasets.
661 stars and 24,334 monthly downloads. Actively maintained with 29 commits in the last 30 days. Available on PyPI.
Stars
661
Forks
158
Language
Python
License
MIT
Category
Last pushed
Mar 13, 2026
Monthly downloads
24,334
Commits (30d)
29
Dependencies
17
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/open-edge-platform/datumaro"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related frameworks
webdataset/webdataset
A high-performance Python-based I/O system for large (and small) deep learning problems, with...
explosion/ml-datasets
🌊 Machine learning dataset loaders for testing and example scripts
alan-turing-institute/CleverCSV
CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement...
tensorflow/datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
JovianHQ/opendatasets
A Python library for downloading datasets from Kaggle, Google Drive, and other online sources.