Questions tagged [difference-in-difference]

Often abbreviated DID or DD, this is a technique for inferring causality from observational data. It involves comparing measurements before and after a treatment occurs (hence, the growth rate) in both a group that received the treatment, and an otherwise comparable group that did not.

Difference-in-difference (DD) is a version of fixed effects estimation. It uses longitudinal data to estimate the effect of a treatment administered at a certain point in time. The idea is to follow units having a common trend in the outcome variable $Y$ in period $t_1$ before the treatment. Some units are then treated ($T$) and some remain untreated controls ($C$). All units are observed again at time $t_2$ after the treatment. The difference-in-difference treatment effect $\delta$ is the average change experienced by the treatment units compared to the average change experienced by the control units,

$$\delta = (E[Y_{T,t_{2}}] - E[Y_{T,t_{1}}]) - (E[Y_{C,t_{2}}] - E[Y_{C,t_{1}}]).$$

The common pre-treatment trend assumption on $Y$ is important to make the argument that a different rate of change for the treated group after the treatment compared to the control group is due to the treatment itself and not due to unobserved factors. Differencing removes fixed but not time-invariant effects.

616 questions
51
votes
4 answers

What is difference-in-differences?

Difference in differences has long been popular as a non-experimental tool, especially in economics. Can somebody please provide a clear and non-technical answer to the following questions about difference-in-differences. What is a…
Graham Cookson
  • 7,543
  • 6
  • 41
  • 35
26
votes
2 answers

Specifying a difference in differences model with multiple time periods

When I estimate a difference in differences model with two time periods, the equivalent regression model would be a. $Y_{ist} = \alpha +\gamma_s*Treatment + \lambda d_t + \delta*(Treatment*d_t)+ \epsilon_{ist}$ where $Treatment$ is a dummy which…
21
votes
2 answers

Difference in Difference method: how to test for assumption of common trend between treatment and control group?

Following a comment from a previous thread, I want to know how one can test for the assumption of common trend between the treatment and control group in the Difference in Difference method? Can I test that assumption with data of two time points…
Khan
  • 431
  • 1
  • 5
  • 12
14
votes
3 answers

Difference-in-differences with individual level panel data

What is the correct way to specify a difference in difference model with individual level panel data? Here is the setup: Assume that I have individual-level panel data embedded in cities for multiple years and the treatment varies on the city-year…
14
votes
1 answer

difference-in-differences with fixed effects

I have two questions related to having fixed effects in the DD model. I have a treatment that occurs at different times (e.g., 2001, 2005, etc.). I want to fit a DD model, so I standardize the treatment years to year "0" as the the treatment time.…
11
votes
1 answer

3 related questions about DDD (TD, triple-diff) estimators

In Jeff Wooldridge's Econometric Analysis (2nd edition), he derives the expression for the difference-in-difference-in-differences (DDD) estimator on page 151 for the two period case where state B implements a health care policy change aimed at the…
dimitriy
  • 31,081
  • 5
  • 63
  • 138
10
votes
1 answer

Difference-in-difference-in-differences estimator

Two towns, $X$ and $Y$. In each town: Pool cross-sections of male and female hourly wages, one from the year before a wage-discrimination policy took effect and one from the year after. Consider the following…
ajohnrobertson
  • 493
  • 1
  • 6
  • 14
10
votes
2 answers

Difference-in-Differences Estimator for Logistic Regressions

I have a pre-post intervention study with four groups: 1) Pre-Intervention Control, 2) Pre-Intervention Treatment, 3) Post-Intervention Control, and 4) Post-Intervention Treatment. The outcome is a binary variable. There four other predictor…
9
votes
1 answer

Data Setup for Differences-in-Differences

Which setup is correct for a difference in difference regression model using $Y_{ist} = \alpha +\gamma_s*T + \lambda d_t + \delta*(T*d_t)+ \epsilon_{ist}$ where T is a dummy which is equal to 1 if the observation is from the treatment group and…
B_Miner
  • 7,560
  • 20
  • 81
  • 144
9
votes
1 answer

ATT vs ATE in propensity score matching when using DiD estimates

According to Lee and Little 2017, when using propensity score (PS) methods, weighting on odds will generate the Average Treatment Effect on the Treated (ATT), while using subclassification and weighting by the inverse probability of treatment (IPTW)…
9
votes
1 answer

What is the best way to visualize difference-in-differences (multi-period) regression?

What's the best way to visualize difference-in-differences for both binary and continuous treatment? Do I regress the outcome variable on the set of controls but exclude the treatment variable and plot the residuals in each group (binary case)? Is…
9
votes
1 answer

How do I interpret a "difference-in-differences" model with continuous treatment?

How do I interpret the ATE coefficient (i.e., the post-treatment indicator interacted with the continuous variable)? Does it make sense? Should I break it down into subgroups and just run a fixed effects model instead (interact an indicator for each…
8
votes
0 answers

Time trends in Difference-in-difference estimation

I would really appreciate if some one could explain it to me (through econometric or intuitive sense)the importance of interacting control variables with linear time trends? I was running a simple diff-in-diff model i.e: outcome= B0*constant +…
7
votes
1 answer

Difference-in-difference in panel data

Under which conditions should we expect the difference-in-difference estimate to be equal to the equivalent panel data model? Strictly speaking, whenever we have a experiment that offers a well defined treated and control groups in two periods of…
7
votes
2 answers

How do I test if regression slopes are statistically different?

I have a set of data that is composed of a measured parameter (dependent variable) as a function of time (independent variable), before and after an intervention event. I have calculated slope and intercept for this dataset prior to, and subsequent…
1
2 3
41 42