XGBoost vs. LightGBM vs. CatBoost vs. H2O vs. MLflow

Happened to come across a blog XGBoost vs LightGBM: How Are They Different. Let's investigate a bit wider and deeper into the following 4 machine learning open source packages.

1. Some Readings

Cited from XGBoost vs LightGBM: How Are They Different

1
Two of the most popular algorithms that are based on Gradient Boosted Machines are XGBoost and LightGBM. 

2. Concepts

3. Amazon.com - Employee Access Challenge

In this blog, I'm going to use a very old Kaggle dataset Amazon.com - Employee Access Challenge.

4. XGBoost

Please make sure: - either rebuild your own XGBoost - or download my built wheel from https://longervision.us/

1
2
3
4
5
6
7
8
9
10
11
➜  ~ pip show xgboost
Name: xgboost
Version: 1.6.0.dev0
Summary: XGBoost Python Package
Home-page: https://github.com/dmlc/xgboost
Author:
Author-email:
License: Apache-2.0
Location: /home/lvision/.local/lib/python3.8/site-packages
Requires: numpy, scipy
Required-by: autoviz

5. LightGBM

Please make sure: - either rebuild your own LightGBM - or download my built wheel from https://longervision.us/

1
2
3
4
5
6
7
8
9
10
11
➜  ~ pip show lightgbm
Name: lightgbm
Version: 3.3.2.99
Summary: LightGBM Python Package
Home-page: https://github.com/microsoft/LightGBM
Author:
Author-email:
License: The MIT License (Microsoft)
Location: /home/lvision/.local/lib/python3.8/site-packages
Requires: numpy, scikit-learn, scipy, wheel
Required-by:

Pay Attention: In order to have XGBoost or LightGBM successfully built, you may have to: - either rebuilt your own CuPy - or download my built wheel from https://longervision.us/

6. CatBoost

1
2
3
4
5
6
7
8
9
10
11
➜  ~ pip show catboost
Name: catboost
Version: 1.0.4
Summary: Catboost Python Package
Home-page: https://catboost.ai
Author: CatBoost Developers
Author-email:
License: Apache License, Version 2.0
Location: /home/lvision/.local/lib/python3.8/site-packages
Requires: graphviz, matplotlib, numpy, pandas, plotly, scipy, six
Required-by:

7. H2O

1
2
3
4
5
6
7
8
9
10
11
➜  ~ pip show h2o
Name: h2o
Version: 3.37.0.6
Summary: H2O, Fast Scalable Machine Learning, for python
Home-page: https://github.com/h2oai/h2o-3.git
Author: H2O.ai
Author-email: support@h2o.ai
License: Apache v2
Location: /home/lvision/.local/lib/python3.8/site-packages
Requires: future, requests, tabulate
Required-by:

8. MLflow

1
2
3
4
5
6
7
8
9
10
11
➜  pip show mlflow
Name: mlflow
Version: 1.24.1.dev0
Summary: MLflow: A Platform for ML Development and Productionization
Home-page: https://mlflow.org/
Author: Databricks
Author-email:
License: Apache License 2.0
Location: /home/lvision/.local/lib/python3.8/site-packages
Requires: alembic, click, cloudpickle, databricks-cli, docker, entrypoints, Flask, gitpython, gunicorn, importlib-metadata, numpy, packaging, pandas, prometheus-flask-exporter, protobuf, pytz, pyyaml, querystring-parser, requests, scipy, sqlalchemy, sqlparse
Required-by:

Please make sure: - either rebuild your own MLflow - or download my built wheel from https://longervision.us/