danielchristopher513/Brain_Stroke_Prediction_Using_Machine_Learning
Stroke is a disease that affects the arteries leading to and within the brain. A stroke occurs when a blood vessel that carries oxygen and nutrients to the brain is either blocked by a clot or ruptures. According to the WHO, stroke is the 2nd leading cause of death worldwide. Globally, 3% of the population are affected by subarachnoid hemorrhage, 10% with intracerebral hemorrhage, and the majority of 87% with ischemic stroke. 80% of the time these strokes can be prevented, so putting in place proper education on the signs of stroke is very important. The existing research is limited in predicting risk factors pertained to various types of strokes. Early detection of stroke is a crucial step for efficient treatment and ML can be of great value in this process. To be able to do that, Machine Learning (ML) is an ultimate technology which can help health professionals make clinical decisions and predictions. During the past few decades, several studies were conducted on the improvement of stroke diagnosis using ML in terms of accuracy and speed. The existing research is limited in predicting whether a stroke will occur or not. Machine Learning techniques including Random Forest, KNN , XGBoost , Catboost and Naive Bayes have been used for prediction.Our work also determines the importance of the characteristics available and determined by the dataset.Our contribution can help predict early signs and prevention of this deadly disease
Implements a complete ML pipeline using the Kaggle stroke dataset with preprocessing steps including median imputation for missing BMI values, dummy encoding for categorical features, and random oversampling to address class imbalance in the highly skewed target variable. Evaluates five models (Decision Tree, KNN, XGBoost, Random Forest, Logistic Regression) using confusion matrices, ROC-AUC curves, and 20-fold cross-validation, selecting Random Forest as the final classifier achieving 99.48% accuracy. Built with NumPy, Pandas, Scikit-learn, and Imbalanced-learn for data manipulation and visualization using Matplotlib and Seaborn.
No commits in the last 6 months.
Stars
29
Forks
9
Language
Jupyter Notebook
License
—
Category
Last pushed
May 11, 2023
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/danielchristopher513/Brain_Stroke_Prediction_Using_Machine_Learning"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
BenyoubOhtmane/BrainStroke
A Machine Learning System that work on prediction of Brain Stroke to take Immediate or...
aravind-selvam/ml-pipeline-using-stroke-data
This project demonstrates the implementation of a ML pipeline and CI/CD using data on heart...
asarkar2210/CarotidNet
Predicts stroke risk by fusing visual ultrasound anomalies with tabular clinical data using a...
PRIYAtechky/Genetic_Forecasting_of_Huntington_Disease
A Deep Neural Network model to predict Huntington’s Disease from genetic sequences. Focused on...
marinhodomingosm/post-stroke-aphasia-risk-analysis
🩺 Analyze post-stroke aphasia risks by investigating medication patterns and mental health...