I am trying to understand how an optimal tree length is determined using cost-complexity pruning.
Visit this lecture note screenshot: https://www.dropbox.com/s/qgr742zg45ahqgr/Screenshot%202015-02-04%2018.17.03.png?dl=0
In the highlighted equation, I don't understand how C(T) or alpha is calculated. My thought process is that a large tree is grown and then a subtree is created and then they are compared but I don't understand how.
Thanks in advance.