Time trends in Difference-in-difference estimation

Question

I would really appreciate if some one could explain it to me (through econometric or intuitive sense)the importance of interacting control variables with linear time trends?

I was running a simple diff-in-diff model i.e:

outcome= B0*constant + B1*treat + B2time + B3* treat*time + B4*controls + error.

Where treat= 1 if the group receives the treatment, 0 otherwise time= 1 if treated in time period 2, 0 otherwise and the interaction term between them both gives us the treatment effect (through double differencing) and controls are all HH level characteristics.Since the treatment was given in time period 2 ( there are two time periods within the data set)

I was told to reform this model and to "control for the differences in time trends" by interacting each of the control variables with the time dichotomous variable. So now my model looks like this:

outcome= B0*constant + B1*treat + B2time + B3* treat*time + B4*control 1 + B5*control 2 + B5*control 3 + B6*control 4 + B7*controls 1*time + B8*controls 2*time + B9*control 3*time + B10*control 4*time + error term.

Basically, interact each control with the time variable- to quote un quote "control for the difference in time trends within these variables."

I understand the econometric sense behind interacting the treatment variable with the time variable (First this takes the difference across time within group- eliminating any group specific unobserved but time fixed effects. Then it takes the difference of the differences to rid of any time trends in the results. Thereby giving us the impact of the program onto the outcome by way of double differencing). And I would like to derive similar understanding/intuition for the interaction terms of my control variables as well.

But I fail to understand the econometric rational behind interacting the control variables with the time trend. For the following reasons:

Firstly: By including in the time dummy variable, are we not automatically controlling for the difference in the time trends within the specification? (as in the first model, where time was just introduced as a dummy and not interacted wit the controls).

Secondly: How does the interaction term (control x time) control for the differences in time trends between the two time periods? Intuitively speaking?

Lastly: How are the interaction terms with time (control x time; treatment x time) excludable? i.e: they only are able to affect our LHS through the intervention itself, and not by any other means. I kind of understand how the treatment x time interaction term does that (through the double difference mechanism described above). But how would interacting, let's say household wealth score with time period, make the entire term excludable onto child schooling outcome for time period 2?

Would really like some insight in this matter. Or any paper, citations, sources that have followed a similar empirical strategy to the second model shown above.

Thanks!

You control variables are at the household level or at the group (treatment or control) level? — Andy, Mar 07 '15 at 20:54
At the household level. They include things such as location (rural/urban), mother's education,wealth score etc.My LHS variable is the dichotomous child enrollment status. And the treatment is an unconditional cash transfer (UCT); but can not be used directly in the equation because of endogeneity. So instead I am using a proxy variable, which is an interaction term between time and a set of indicators(used to determine HH eligibility to receive the UCT). I was told that this interaction term would make the overall proxy variable excludable to the LHS child schooling outcome? @Andy — troubled student, Mar 08 '15 at 09:07
Basically @Andy I am confused about how by interacting the variables with time we are making the overall interaction term a) excludable to our LHS variable- when time is interacted with the proxy variable. and b) also control for differences in time trends- when time is interacted with the controls. — troubled student, Mar 08 '15 at 09:22
Ok, just one more clarification question: what do you mean by "proxy" variable? Usually those are used when you have an omitted variable (e.g. ability in an earnings regression) and you get a variable that's highly correlated with it (e.g. IQ scores). — Andy, Mar 08 '15 at 10:39
In this case HH's that are eligible to receive the UCT may already be suffering from poor child outcomes;conversely this UCT may improve child educational outcomes-hence OVB (unobserved socio-economic status). The proxy is made up of predicted probabilities of the set of indicators (land ownership, livestock etc) used to determine HH eligibility for UCT,that don't affect child enrollment. It then interacts these predicted probabilities with time to make the overall term excludable with LHS variable. (i.e: The proxy is the interaction term b/w predicted probabilities of the indicators & time). — troubled student, Mar 08 '15 at 13:01

Time trends in Difference-in-difference estimation

0 Answers0

Linked