I'm working on a project that involves the use of a Heckman selection model (more specifically a Roy or move-stay model, which is essentially a two-sided Heckman) of the following form:
$$ Y_{i1} = X_i\beta_1 + \varepsilon_{i1} \text{ if } Select_i=1 $$ $$ Y_{i0} = X_i\beta_0 + \varepsilon_{i0} \text{ if } Select_i=0 $$ $$ Select_i^* = X_i\gamma + Z_i\delta + \varepsilon_{i} $$ $$ Select_i = I(Select_i^*>0) $$
The model is identified only by the distributional assumptions on the error terms unless a variable is included in the first stage (the $Select_i^*$ equation) that can be plausibly excluded from the second stage (the $Y_{i1}$ and $Y_{i2}$ equations). Here this variable is $Z_i$.
Is there a rule of thumb for how "strong" the relationship between $Select_i$ and $Z_i$ needs to be for the model to be adequately identified? I'm thinking something like the rule of thumb in the instrumental variables literature that an F-test of the IV's should be 10 or above. I haven't been able to find anything similar for selection models. Is there something?