Following my question Heckman sample selection vs. OLS about the meaning of Mills ratio I am wondering why some researchers estimate a generalization of Heckman instead of the actual Heckman's procedure. (based on Das et. al 2003 http://www.jstor.org/stable/3648610 (gated version).
Instead of using Mills ratio only, the generalized version also includes predicted probabilities, Mills ratio, their squares and interaction terms in the 2 step.
I do no understand the advantages / disadvantages of this approach compared to "basic" Heckman sample selection? Which model should I prefer for testing sample selection?