CodeCutTech/Data-science
Collection of useful data science topics along with articles, videos, and code
ArchivedCovers MLOps fundamentals (dependency management, CI/CD, data drift detection), data pipeline tools (dbt, DVC), and Python ecosystem practices (testing with pytest, dataframe optimization with Polars). Each article pairs hands-on code repositories and video tutorials with explanations of modern tools like Hydra for configuration, pre-commit hooks for automation, and GitHub Actions for ML deployment. Content spans infrastructure, testing, visualization, and LLM integration across distributed compute frameworks (Pandas, Spark, Dask).
4,180 stars.
Stars
4,180
Forks
1,058
Language
Jupyter Notebook
License
—
Category
Last pushed
Dec 02, 2025
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/CodeCutTech/Data-science"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
bodo-ai/Bodo
High Performance Data Processing in Python
yogeshhk/TeachingDataScience
Course notes for Data Science related topics, prepared in LaTeX
sreeharierk/datascience
This repository is a compilation of free resources for learning Data Science.
virgili0/Virgilio
Your new Mentor for Data Science E-Learning.
PacktWorkshops/The-Data-Science-Workshop
A New, Interactive Approach to Learning Data Science