I'm having trouble making sense of the exclusion restriction in instrumental variables.
I understand that the unbiased treatment effect is $B = \frac{Cov(Y, Z)}{Cov(S, Z)}$, where $Y$ is the outcome, $S$ is the treatment, and $Z$ is the instrument. In other words, $B = \frac{ITT} {\text{Compliance Rate}}$.
However, if I think about this in a mediation framework, and apply the exclusion restriction, this makes less and less sense.
In a mediation framework, ITT = the total effect, or the $Cov(S,Z)\cdot Cov(Y,S) + Cov(Y,Z|S)$. So, the unbiased treatment effect is:
$\frac{(Cov(S,Z)\cdot Cov(Y,S) + Cov(Y,Z|S))}{Cov(S,Z)}$, which reduces to:
$Cov(Y,S) + \frac{Cov(Y, Z|S)}{Cov(S, Z)}$,
so the unbiased causal estimate is the effect of the biased treatment + the effect of the instrument ($\frac{controlling for the treatment} { compliance rate}$.
However, with the exclusion restriction, there is not supposed to be an effect of the instrument once we control for the treatment.
An example, from Gelman's Sesame Street example. First, obtaining the unbiased treatment effect via 2SLS:
fit.2s <- lm(regular ~ encour, data = df)
watched.hat <- fit.2s$fitted
fit.2b <- lm(postlet ~ watched.hat, data = df)
summary(fit.2b)
which gives the answer, 7.934.
And now, within an SEM framework:
library(foreign)
library(lavaan)
mod <-
'
regular ~ a*encour
postlet ~ b*regular + c*encour
ind := a*b
total := a*b + c
'
fit <- sem(mod, data = df)
summary(fit)
Regressions:
Estimate Std.Err Z-value P(>|z|)
regular ~
encour (a) 0.362 0.051 7.134 0.000
postlet ~
regular (b) 13.698 2.079 6.589 0.000
encour (c) -2.089 1.802 -1.160 0.246
Defined Parameters:
Estimate Std.Err Z-value P(>|z|)
ind 4.965 1.026 4.840 0.000
total 2.876 1.778 1.617 0.106
13.698 - 2.089/.362 = 7.92
So, the only reason that the unbiased treatment effect is not just the biased treatment effect is the there is still an effect of the instrument when controlling for the treatment, which, according to the exclusion restriction should not happen.
Am I missing something here?