dmlc/xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

98
/ 100
Verified

Implements parallel tree boosting with built-in support for categorical features, missing value handling, and monotonic constraints without preprocessing. Uses a novel column-block structure for cache-aware tree construction and supports GPU acceleration via CUDA for faster training on large datasets. Integrates with ML platforms including scikit-learn, MLflow, and Optuna for hyperparameter optimization, with native support for feature importance analysis and SHAP explainability.

28,121 stars and 41,912,233 monthly downloads. Used by 120 other packages. Actively maintained with 45 commits in the last 30 days. Available on PyPI.

Maintenance 23 / 25
Adoption 25 / 25
Maturity 25 / 25
Community 25 / 25

How are scores calculated?

Stars

28,121

Forks

8,847

Language

C++

License

Apache-2.0

Last pushed

Mar 13, 2026

Monthly downloads

41,912,233

Commits (30d)

45

Dependencies

3

Reverse dependents

120

Get this data via API

curl "https://pt-edge.onrender.com/api/v1/quality/ml-frameworks/dmlc/xgboost"

Open to everyone — 100 requests/day, no key needed. Get a free key for 1,000/day.