Decision trees

Introduction

Decision trees do recursive partitioning of the data for classification and/or
regression tasks.

DT comes with the risk of overfitting.

  • Advantages
    • Output is easy to understand
    • Can combine numeric and categorical data
    • Robust (outliers)
    • Fast (after developing the rules)
  • Disadvantages
    • Prone to overfitting
    • Limited to the range of attributes in the training data
    • Unstable (small perturbation input -> larger perturbation output)
      • Categorical data

Examples

Outgoing relations

Incoming relations

Contributors

  • Sander Mooren