Is there any academic work on any Decision Tree that fits a regression at its final leaf node?
For instance, suppose I have 100 features (X), and use them build a tree with 3 depths such that I have 8 leaf nodes.
Standard tree approach gives the mean value of Y for the observations in the 8 leaf nodes.
Is there any decision tree algorithm in academic literature that instead does a regression of the Y’s on X for observations in each of the 8 leaf nodes ?