ChenglongChen/kaggle-HomeDepot
3rd Place Solution for HomeDepot Product Search Results Relevance Competition on Kaggle.
Employs a multi-stage ensemble combining XGBoost, Keras neural networks, and RGF regressors with extensive feature engineering—including Word2Vec/GloVe embeddings, text similarity metrics, and domain-specific dictionaries for spelling correction and color data. Uses hyperopt for automated hyperparameter tuning across learners, stacking predictions across cross-validation folds for 2nd/3rd level ensemble aggregation. Feature selection combines regex-based manual curation with correlation filtering, leveraging thousands of candidate features derived from product titles, search queries, and descriptions through NLP preprocessing.
466 stars. No commits in the last 6 months.
Stars
466
Forks
204
Language
Python
License
MIT
Category
Last pushed
Dec 31, 2018
Commits (30d)
0
Get this data via API
curl "https://pt-edge.onrender.com/api/v1/quality/nlp/ChenglongChen/kaggle-HomeDepot"
Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.
Related tools
ChenglongChen/kaggle-CrowdFlower
1st Place Solution for CrowdFlower Product Search Results Relevance Competition on Kaggle.
ai-forever/digital_peter_aij2020
Materials of the AI Journey 2020 competition dedicated to the recognition of Peter the Great's...
minerva-ml/open-solution-avito-demand-prediction
Open solution to the Avito Demand Prediction Challenge
thomasthaddeus/DSComp
This repository is for a sharing work from a competition on kaggle were teamed up on.
DaoyuanLi2816/Kaggle-Eedi-Mining-Misconceptions-in-Mathematics-Silver-Medal
Silver Medal Solution for the Kaggle Competition: Eedi - Mining Misconceptions in Mathematics