KevinLiao159/MyDataSciencePortfolio
Applying Data Science and Machine Learning to Solve Real World Business Problems
Covers multiple ML domains including customer churn prediction, topic modeling with both scikit-learn and Apache Spark, and collaborative filtering recommender systems using KNN, ALS, and neural networks in Keras. Demonstrates distributed computing approaches for large-scale analysis, contrasting single-machine workflows with Spark's fault-tolerant cluster computing for processing at scale. Includes hands-on NLP implementations across NLTK, spaCy, and Gensim libraries, with notebooks designed for portability across Jupyter, Databricks, and Google Colab environments.
405 stars. No commits in the last 6 months.
Stars
405
Forks
225
Language
Jupyter Notebook
License
MIT
Category
Last pushed
Jul 05, 2020
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/KevinLiao159/MyDataSciencePortfolio"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Higher-rated alternatives
codeperfectplus/codeperfectplus
README PROFILE
sajal2692/data-science-portfolio
Portfolio of data science projects completed by me for academic, self learning, and hobby purposes.
archd3sai/Portfolio
This Portfolio is a compilation of all the Data Science and Data Analysis projects I have done...
MelihGulum/Comprehensive-Data-Science-AI-Project-Portfolio
A curated collection of AI, data engineering, and DevOps projects featuring real-world...
EgorTatarnikov/DS_ML_Learning_Portfolio
Репозиторий содержит проекты, выполненные в ходе изучения DS и ML. Также здесь представлены мои...