treeverse/dvc
🦉 Data Versioning and ML Experiments
Builds reproducible ML pipelines as directed acyclic graphs (DAGs) that track code, data, and hyperparameters in Git while storing large artifacts in cloud/remote storage with content-addressable caching. Integrates with S3, Azure, GCS, SSH and other remotes; experiment runs are compared locally without external servers, enabling full Git-based collaboration and lineage tracking.
15,443 stars and 2,111,672 monthly downloads. Used by 5 other packages. Actively maintained with 5 commits in the last 30 days. Available on PyPI.
Stars
15,443
Forks
1,282
Language
Python
License
Apache-2.0
Category
Last pushed
Mar 09, 2026
Monthly downloads
2,111,672
Commits (30d)
5
Dependencies
42
Reverse dependents
5
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/treeverse/dvc"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Compare
Related frameworks
runpod/runpod-python
🐍 | Python library for RunPod API and serverless worker SDK.
uber/petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning...
carsdotcom/skelebot
Machine Learning Project Development Tool
microsoft/vscode-jupyter
VS Code Jupyter extension
operatorai/modelstore
🏬 modelstore is a Python library that allows you to version, export, and save a machine learning...