Missing Data Imputation ML Frameworks
Tools and frameworks for handling, imputing, and analyzing missing values in datasets across various modalities and domains. Does NOT include general data cleaning, time-series forecasting without imputation focus, or synthetic data generation unrelated to missingness mechanisms.
There are 27 missing data imputation frameworks tracked. 1 score above 70 (verified tier). The highest-rated is sktime/skpro at 84/100 with 314 stars and 41,366 monthly downloads.
Get all 27 projects as JSON
curl "https://pt-edge.onrender.com/api/v1/datasets/quality?domain=ml-frameworks&subcategory=missing-data-imputation&limit=20"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
| # | Framework | Score | Tier |
|---|---|---|---|
| 1 |
sktime/skpro
A unified framework for tabular probabilistic regression, time-to-event... |
|
Verified |
| 2 |
WenjieDu/PyGrinder
PyGrinder: a Python toolkit for grinding data beans into the incomplete for... |
|
Established |
| 3 |
WenjieDu/Awesome_Imputation
Awesome Deep Learning for Time-Series Imputation, including an unmissable... |
|
Established |
| 4 |
ocbe-uio/imml
A Python package for integrating, processing, and analyzing incomplete... |
|
Emerging |
| 5 |
DoubleML/doubleml-for-r
DoubleML - Double Machine Learning in R |
|
Emerging |
| 6 |
MIDASverse/rMIDAS
R package for missing-data imputation with deep learning |
|
Emerging |
| 7 |
vanderschaarlab/hyperimpute
A framework for prototyping and benchmarking imputation methods |
|
Emerging |
| 8 |
aangelopoulos/ltt
Learn then Test: Calibrating Predictive Algorithms to Achieve Risk Control |
|
Emerging |
| 9 |
imputr/imputr
Python library for easy and fast ML-based & conventional imputation techniques. |
|
Emerging |
| 10 |
feruzoripov/tsgap
Time-series missingness simulation separating mechanisms (MCAR/MAR/MNAR)... |
|
Emerging |
| 11 |
SAP/knn-sampler
Machine learning imputation method to recover the distribution of missing... |
|
Emerging |
| 12 |
haghish/mlim
mlim: single and multiple imputation with automated machine learning |
|
Emerging |
| 13 |
TyMill/SynthPred
A Julia package for synthetic data analysis, advanced imputation (ARIMA,... |
|
Experimental |
| 14 |
thibaultcordier/risk-control
A toolkit to calibrate predictive algorithms to achieve risk control. |
|
Experimental |
| 15 |
DoubleML/DoubleMLReplicationCode
Replication of Simulations in Bach et al. (2024) - DoubleML - An... |
|
Experimental |
| 16 |
blind-contours/CVtreeMLE
:deciduous_tree: :dart: Cross Validated Decision Trees with Targeted Maximum... |
|
Experimental |
| 17 |
Akchaykumar2004/Missing-Data-Doctor
🩺 Diagnose and treat missing values in machine learning datasets with tools... |
|
Experimental |
| 18 |
AmirhosseinHonardoust/Missing-Data-Doctor
Missing Data Doctor is a diagnostic and treatment toolkit for missing values... |
|
Experimental |
| 19 |
miriamspsantos/heterogeneous-distance-functions
A collection of heterogeneous distance functions handling missing values. |
|
Experimental |
| 20 |
missValTeam/Iscores
Scoring rules for missing values imputations (Michel et al., 2021) |
|
Experimental |
| 21 |
liangyuanhu/Variable-selection-w-missing-data
A general variable selection approach in the presence of missing data in... |
|
Experimental |
| 22 |
jannebor/dd_forecast
Code for predicting probabilities of threat for Data Deficient species of... |
|
Experimental |
| 23 |
fchamroukhi/FLaMingos
Functional Latent datA Models for clusterING heterogeneOus curveS |
|
Experimental |
| 24 |
miriamspsantos/synthetic-missing-data
A library for synthetic missing data generation. |
|
Experimental |
| 25 |
kennethleungty/DataWig-Missing-Data-Imputation
Imputation of Missing Data in Tables |
|
Experimental |
| 26 |
michelelagreca/Classification-On-Imputed-Data
Project of the 'Data and Information Quality' Course, aiming on describing... |
|
Experimental |
| 27 |
marcvidalbadia/functional-whitening
Online Material for Vidal and Aguilera (2022). Novel whitening approaches in... |
|
Experimental |