Statistics for Data Science ML Frameworks

Educational resources, textbooks, and comprehensive courses on probability, statistics, and statistical methods specifically for data science applications. Includes lecture notes, tutorials, and problem sets. Does NOT include general machine learning algorithms, deep learning frameworks, or discipline-specific statistics (e.g., biostatistics, econometrics).

There are 24 statistics for data science frameworks tracked. The highest-rated is D2RS-2026spring/data-driven-reproducible-study at 48/100 with 18 stars. 1 of the top 10 are actively maintained.

Get all 24 projects as JSON

curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=statistics-for-data-science&limit=20"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.

# Framework Score Tier
1 D2RS-2026spring/data-driven-reproducible-study

《数据驱动的可重复性研究》课程讲义

48
Emerging
2 unpingco/Python-for-Probability-Statistics-and-Machine-Learning

Jupyter Notebooks for Springer book "Python for Probability, Statistics, and...

44
Emerging
3 wangyingsm/Python-Data-Science-Handbook

A Chinese translation of Jake Vanderplas' "Python Data Science Handbook"....

44
Emerging
4 cfgranda/ps4ds

Probability and Statistics for Data Science: A self-contained introduction...

43
Emerging
5 aeturrell/python4DS

Python for Data Science. This repository hosts the code behind the online...

42
Emerging
6 APMonitor/pds

Machine Learning for Engineers in Python

42
Emerging
7 matteocourthoud/Machine-Learning-for-Economic-Analysis

Material for the exercise sessions of master course Machine Learning for...

39
Emerging
8 verri/dsp-book

Data Science Project: An Inductive Learning Approach

36
Emerging
9 muandet-lab/ipml-course

A course on imprecise probabilistic machine learning

35
Emerging
10 UWNETLAB/dcss_supplementary

Supplementary materials for McLevey 2021 Doing Computational Social Science...

32
Emerging
11 alioh/ds-100-ar

Arabic Translation of Data 100 Textbook at UC Berkeley http://www.textbook.ds100.org/

32
Emerging
12 rickiepark/python4daml

<코딩 뇌를 깨우는 파이썬>(한빛미디어, 2023)의 코드 저장소

32
Emerging
13 tomasonjo/graphs-network-science

Accompanying repository for my book about Graph Data Science

30
Emerging
14 apachecn/ds100-textbook-zh

:book: [译] UCB DS100 数据科学的原理与技巧

29
Experimental
15 harrywang/misy331

Course Website for MISY331 Machine Learning for Business

28
Experimental
16 Chandrakant817/Statistics-for-Data-Science

Statistics for Data Science and Machine Learning Handwritten Notes

27
Experimental
17 jdestefani/StatisticalFoundationsML_INFOF422

Repository for the Statistical Foundation of Machine Learning class (INFO-F-422).

26
Experimental
18 DiogoRibeiro7/academic-presentations

Professional-grade presentations on advanced statistics, MCMC methods, and...

15
Experimental
19 luqigroup/cap-4611

Course website for CAP 4611

15
Experimental
20 AIML-research/ML4DS-Lecture

Machine Learning for Data Science lecture at Freie University Berlin during WiSe21/22

15
Experimental
21 arushig02/Statistics-ML

Statistics for Machine Learning — Week 1

14
Experimental
22 Vinod123456183/DSMP-1.0

Ml

14
Experimental
23 rugvedmhatre/machine-learning-summer

Course Website for ML Summer Course

13
Experimental
24 JosephMehdiyev/Statistics-and-Probability-with-Code-Applications

A open-source book written by Joseph Mehdiyev for educational and...

10
Experimental