Q1. What is the approximate deph of a Decision Tree trained (without restrictions) on a training set with 1 million instances?
A1: The depth of a well-balanced binary tree containing m leaves is equal to , so when there is 1 million instances the approximate depth is
20.
Q2. Is a node's Gini impurity generally lower or greater than its parent's? Is it generally lower/greater, or always lower/greater?
A2: Generally lower, but not always lower. Consider A, B, A, A, A, Gini impurity = 1 - -