xgboost and LightGBM

These are competitors offering alternative implementations of gradient boosting algorithms, where XGBoost is more mature and widely adopted (evidenced by substantially higher stars and downloads) while LightGBM emphasizes speed and memory efficiency through leaf-wise tree growth, forcing practitioners to choose based on performance characteristics and specific use case requirements.

xgboost
98
Verified
LightGBM
71
Verified
Maintenance 23/25
Adoption 25/25
Maturity 25/25
Community 25/25
Maintenance 20/25
Adoption 10/25
Maturity 16/25
Community 25/25
Stars: 28,121
Forks: 8,847
Downloads: 41,912,233
Commits (30d): 45
Language: C++
License: Apache-2.0
Stars: 18,157
Forks: 3,988
Downloads:
Commits (30d): 15
Language: C++
License: MIT
No risk flags
No Package No Dependents

About xgboost

dmlc/xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Implements parallel tree boosting with built-in support for categorical features, missing value handling, and monotonic constraints without preprocessing. Uses a novel column-block structure for cache-aware tree construction and supports GPU acceleration via CUDA for faster training on large datasets. Integrates with ML platforms including scikit-learn, MLflow, and Optuna for hyperparameter optimization, with native support for feature importance analysis and SHAP explainability.

About LightGBM

lightgbm-org/LightGBM

A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.

Implements leaf-wise tree growth with histogram-based learning to reduce memory footprint and accelerate training on CPU and GPU hardware. Provides native bindings for Python, R, and C++, with ecosystem integrations including FLAML for AutoML, Optuna for hyperparameter tuning, and model compilers like Treelite and Hummingbird for production deployment.

Scores updated daily from GitHub, PyPI, and npm data. How scores work