treeverse/dvc

🦉 Data Versioning and ML Experiments

85
/ 100
Verified

Builds reproducible ML pipelines as directed acyclic graphs (DAGs) that track code, data, and hyperparameters in Git while storing large artifacts in cloud/remote storage with content-addressable caching. Integrates with S3, Azure, GCS, SSH and other remotes; experiment runs are compared locally without external servers, enabling full Git-based collaboration and lineage tracking.

15,443 stars and 2,111,672 monthly downloads. Used by 5 other packages. Actively maintained with 5 commits in the last 30 days. Available on PyPI.

Maintenance 16 / 25
Adoption 25 / 25
Maturity 25 / 25
Community 19 / 25

How are scores calculated?

Stars

15,443

Forks

1,282

Language

Python

License

Apache-2.0

Last pushed

Mar 09, 2026

Monthly downloads

2,111,672

Commits (30d)

5

Dependencies

42

Reverse dependents

5

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/treeverse/dvc"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

Compare