I am wondering what the main methods for estimating the Average Treatment Effect in Observational Studies are outside of matching. In matching, there are weighting, stratification, propensity score methods, etc. However, are there techniques outside of this? Or are they all based on same way on matching?
-
1A generic alternative to propensity score modeling is to use outcome modeling. For confounders $X$ and a treatment $A$, you can write down the model $E\{Y(a) \mid X = x\} = g(X,a,\beta)$ for some parameters $\beta$, get an estimate $\widehat \beta$, and approximate $\widehat E\{Y(a)\} = N^{-1} \sum_i g(X_i, a, \widehat \beta)$. This is the simplest instance of what is called the $g$-computation approach. You don't need the propensity score at all to do something like this, although it is sometimes helpful to plug it in as a predictor in the outcome model. – guy Jun 01 '21 at 04:39
-
1Even regression can help. Read here: https://stats.stackexchange.com/questions/493211/under-which-assumptions-a-regression-can-be-interpreted-causally/493905#493905 – markowitz Jun 01 '21 at 05:01
1 Answers
There is a separate class of techniques that use observational data but that leverage some specific aspect of the setting to "almost" (in a very vague sense) have an experimental setting. These are often called "quasi-experimental" settings, with two common techniques being difference in differences or instrumental variable (IV) techniques.
More generally, in any observational study where you're interested in some average treatment effect, you have some data and you impose assumptions to get a treatment effect. In experimental settings, the assumptions come for free: random assignment of treatment often gets you all you need. In observational settings, you need to instead assume something similar, and all these techniques are various assumptions that get you to where you need to be. As a researcher, you need to determine what assumptions are reasonable to impose. All the methods you define basically assume that treatment is random conditional on covariates. But this need not be true in the presence of unobserved factors. See this question for a more detailed answer about this specific point.
Finally, see this question for some more information about techniques outside of matching, and in particular check out the link provided in the comments to that question about other "techniques", which I also am linking here.

- 4,193
- 1
- 10
- 29
-
1Nice answer, though I do want to note that neither DiD nor IV estimate the ATE; DiD estimates the ATT and IV estimates the LATE. – Noah Jun 01 '21 at 16:44
-
Yes, you're right. Didn't want to get into that distinction and just leave it as some average treatment effect (for some population), but yes. I guess you could just assume no treatment effect heterogeneity on unobs, which is anyway assumed with these reweighting on observables :). It's a good point to make regardless, thanks. – doubled Jun 01 '21 at 16:59