For my research I am looking into the relationship between an outcome (Y) and a predictor (X) as follows:
$$Y = X + e$$
where $e$ is the error term.
Because there might be reverse causality I am employing a 2 stage estimation strategy. My first stage being:
$$X = Z + u$$
where $u$ is an error term.
Now I was all concerned about the exclusion restriction and came up with a series of robustness checks. Instead the main critique I received concerned reverse causality in the first stage.
Of course I looked into this when I started this project and as far as I am aware there is two conditions that should be met for the instrument to be valid:
- relevance: Z should be correlated with X, or $corr(Z,X)= 0$;
- exogeneity: Z should not be correlated with unobserved factors influencing Y, or $corr(Z,e)\neq 0$.
Am I overlooking something? Or are there recent developments that I am not aware of maybe?
Any guidance much appreciated!