Yimeng-Zhang/feature-engineering-and-feature-selection
A Guide for Feature Engineering and Feature Selection, with implementations and examples in Python.
Covers the complete feature engineering pipeline from data exploration through selection, including univariate/bivariate analysis, missing value imputation, outlier detection, encoding, and dimensionality reduction techniques. Implementations leverage scikit-learn and pandas with Jupyter notebook demos that explain the rationale and trade-offs for each method. The guide emphasizes practical decision-making—when and why to apply specific techniques—rather than just algorithmic implementations.
1,639 stars. No commits in the last 6 months.
Stars
1,639
Forks
424
Language
Jupyter Notebook
License
—
Category
Last pushed
Sep 24, 2022
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/Yimeng-Zhang/feature-engineering-and-feature-selection"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
feature-engine/feature_engine
Feature engineering and selection open-source Python library compatible with sklearn.
alteryx/featuretools
An open source python library for automated feature engineering
cod3licious/autofeat
Linear Prediction Model with Automated Feature Engineering and Selection Capabilities
abess-team/abess
Fast Best-Subset Selection Library
rodrigo-arenas/Sklearn-genetic-opt
ML hyperparameters tuning and features selection, using evolutionary algorithms.