limexp / LightGBM

A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

LightGBM, Light Gradient Boosting Machine

Join the chat at https://gitter.im/Microsoft/LightGBM Build Status GitHub Issues Windows Build status Documentation Status PyPI version

LightGBM is a gradient boosting framework that uses tree based learning algorithms. It is designed to be distributed and efficient with the following advantages:

  • Faster training speed and higher efficiency
  • Lower memory usage
  • Better accuracy
  • Parallel and GPU learning supported
  • Capable of handling large-scale data

For more details, please refer to Features.

Experiments on public datasets show that LightGBM can outperform existing boosting frameworks on both efficiency and accuracy, with significantly lower memory consumption. What's more, the experiments show that LightGBM can achieve a linear speed-up by using multiple machines for training in specific settings.

News

08/02/2017: Optimal split for Categorical Features. Now LightGBM can provide much better accuracy when using categorical features. Compared with one-hot coding, LightGBM's new solution shows a great improvement.

07/13/2017: Gitter is avaiable.

06/20/2017: Python-package is on PyPI now.

06/09/2017: LightGBM Slack team is available.

05/03/2017: LightGBM v2 stable release.

04/10/2017: LightGBM supports GPU-accelerated tree learning now. Please read our GPU Tutorial and Performance Comparison.

02/20/2017: Update to LightGBM v2.

02/12/2017: LightGBM v1 stable release.

01/08/2017: Release R-package beta version, welcome to have a try and provide feedback.

12/05/2016: Categorical Features as input directly(without one-hot coding). Experiment on Expo data shows about 8x speed-up with same accuracy compared with one-hot coding.

12/02/2016: Release python-package beta version, welcome to have a try and provide feedback.

External (unofficial) Repositories

Julia Package: https://github.com/Allardvm/LightGBM.jl

JPMML: https://github.com/jpmml/jpmml-lightgbm

Get Started And Documents

To get started, please follow the Installation Guide and Quick Start.

External Links

Useful if you are looking for details:

Support

You can ask questions and join the development discussion on:

You can also create bug reports and feature requests (not including questions) in Github issues.

How to Contribute

LightGBM has been developed and used by many active community members. Your help is very valuable to make it better for everyone.

  • Check out call for contributions to see what can be improved, or open an issue if you want something.
  • Contribute to the tests to make it more reliable.
  • Contribute to the documents to make it clearer for everyone.
  • Contribute to the examples to share your experience with other users.
  • Check out Development Guide.
  • Open issue if you met problems during development.

Microsoft Open Source Code of Conduct

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

About

A fast, distributed, high performance gradient boosting (GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks. It is under the umbrella of the DMTK(http://github.com/microsoft/dmtk) project of Microsoft.

License:MIT License


Languages

Language:C++ 63.0%Language:Python 14.3%Language:R 13.9%Language:C 8.1%Language:Shell 0.4%Language:CMake 0.3%