cookiecutter-data-science and cookiecutter-docker-science

These are complementary tools: cookiecutter-data-science provides the project structure and organization, while cookiecutter-docker-science adds containerization capabilities that can wrap around that structure for reproducible deployment.

cookiecutter-data-science

Verified

cookiecutter-docker-science

Emerging

Maintenance 10/25

Adoption 18/25

Maturity 25/25

Community 25/25

Maintenance 0/25

Adoption 10/25

Maturity 16/25

Community 23/25

Stars: 9,723

Forks: 2,628

Downloads: 3,161

Commits (30d): 0

Language: Python

License: MIT

Stars: 360

Forks: 81

Downloads: —

Commits (30d): 0

Language: Makefile

License: Apache-2.0

No risk flags

Stale 6m No Package No Dependents

About cookiecutter-data-science

drivendataorg/cookiecutter-data-science

A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.

Built on the [cookiecutter](https://cookiecutter.readthedocs.io/) templating framework, it provides a Python package (`ccds` CLI) that scaffolds projects with standardized directories for data pipelines, notebooks, models, and documentation, along with `pyproject.toml` and Makefile conventions for reproducibility. The template separates raw/processed data, intermediate artifacts, and source code into a modular Python package structure with dedicated modules for dataset handling, feature engineering, and model training/inference.

About cookiecutter-docker-science

docker-science/cookiecutter-docker-science

Cookiecutter template for data scientists working with Docker containers

Provides scaffolding with separate development and production Dockerfiles, abstracted data source ingestion (S3, NFS, URL), and Make targets for container lifecycle management and Jupyter integration. Supports live code editing in host editors via volume mounts while maintaining reproducible environments, with development images using bind mounts and release images packaging complete projects. Streamlines common ML workflows including dependency management, notebook execution, linting, and testing within containerized workflows.

Related comparisons

cookiecutter-data-science and cookiecutter-ml-research cookiecutter-data-science and e2eml-cookiecutter cookiecutter-data-science and template-starter cookiecutter-data-science and biscuit-cutter cookiecutter-data-science and cookie-ml cookiecutter-data-science and biscuit-cutter

Scores updated daily from GitHub, PyPI, and npm data. How scores work