shantanu1109/IBM-HR-Analytics-Employee-Attrition-and-Performance-Prediction

In this project, we enlisted the numerical and categorical attributes present in the publicly available dataset. Missing values were dropped to give better insights in data analysis. ANOVA and Chi-Square tests were carried out during statistical analysis. Machine Learning algo's were applied to understand, manage, and mitigate employee attrition.

36
/ 100
Emerging

Implements an ensemble approach comparing seven classifiers (Logistic Regression, Random Forest, SVM, XGBoost, LightGBM, CatBoost, AdaBoost) with ROC curve visualization using hvPlot, leveraging scikit-learn, XGBoost, LightGBM, and CatBoost libraries. The pipeline performs one-hot encoding of categorical features via pandas `get_dummies()`, trains/test splits via scikit-learn, and evaluates models using accuracy scores and confusion matrices. Targets the IBM HR Analytics dataset (1,470 records, 35 attributes) to identify attrition risk factors through statistical testing and predictive modeling.

No commits in the last 6 months.

Stale 6m No Package No Dependents
Maintenance 0 / 25
Adoption 7 / 25
Maturity 16 / 25
Community 13 / 25

How are scores calculated?

Stars

30

Forks

5

Language

Jupyter Notebook

License

MIT

Last pushed

Oct 05, 2023

Commits (30d)

0

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/shantanu1109/IBM-HR-Analytics-Employee-Attrition-and-Performance-Prediction"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.