I am trying to understand this paper better Greedy function approximation: A gradient boosting machine, but I start having difficulty at around Equation (6) and (7).
What does a gradient w.s.t. to an equation mean ($\partial F(x)$), could anyone provide a concrete example showing how it could be calculated?