scikit-learn-contrib/imbalanced-learn

A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning

/ 100

Verified

Provides over-sampling, under-sampling, and hybrid re-sampling algorithms (SMOTE, ADASYN, tomek links) that integrate directly with scikit-learn's pipeline API for seamless preprocessing. Implements statistical and distance-based techniques to generate synthetic minority samples or remove noisy majority instances while maintaining data integrity. Supports TensorFlow and Keras models alongside traditional scikit-learn estimators for end-to-end imbalanced data workflows.

7,090 stars. Used by 23 other packages. Available on PyPI.

Maintenance 10 / 25

Adoption 15 / 25

Maturity 25 / 25

Community 24 / 25

How are scores calculated?

Stars

7,090

Forks

1,328

Language

Python

License

MIT

Compare

imbalanced-learn and imbalanced-ensemble imbalanced-learn and machine-learning-imbalanced-data

Related frameworks

ZhiningLiu1998/imbalanced-ensemble

🛠️ Class-imbalanced Ensemble Learning Toolbox. | 类别不平衡/长尾机器学习库 [NeurIPS'25]

solegalli/machine-learning-imbalanced-data

Code repository for the online course Machine Learning with Imbalanced Data

ZhiningLiu1998/awesome-imbalanced-learning

😎 Everything about class-imbalanced/long-tail learning: papers, codes, frameworks, and libraries...

artefactory/mgs-grf

MGS-GRF for imbalanced-mixed-tabular data (AISTATS 2026 and ECML-PKDD 2025)

getspams/spams-python

Python interface for SPAMS (SPArse Modeling Software)

Explore ML Frameworks

All categories Trending ML Framework directory Insights