Gradient Boosting Machine

Machine Learning

Jesus A. Gonzalez

August 10, 2019

GRadient Boosted Trees

Gradient Boosted Trees

Boosting with Trees

Boosting with Trees

Boosting

How Boosting Works?

The Boosting Abstract Process

Splitting the Groups

Adjusting the Output for the Next Iteration

Result of a set of Gradient Boosting Trees

Regression vs Classification

The Boosting Regression Process

  1. Predict an initial estimate of 0.0
  2. Use the true values to calculate the error in the initial prediction
  3. Split the data into groups using the features of the data
    • The goal is putting data with similar error in the same group
  4. For each group, find the average error
  5. For every data point in that group, add the average error to the current prediction
  6. Calculate the new error for each point for the new prediction
  7. Repeat from 3 to 6 as many times as required

Parameters

Parameters

How a GBM Focuses on Parts of the Problem?

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

Regression Boosting Example

GBM for Classification

GBM for Classification - Steps

  1. Assigns 0 and 1 to categories
  2. Set initial prediction equal to average value
  3. Compute current error
  4. Split into groups using decision tree (regression)
  5. Compute amount of change for each point
  6. Convert all points to the infinite range
  7. Add amount of change to each point
  8. Convert all points back to 0-1 range
  9. Iterate as many times as needed
  10. Finish fitting the model