Questions tagged [regression-discontinuity]

A regression discontinuity design (RDD) is an example of a quasi-experimental design in which the probability of receiving a treatment is a discontinuous functions of one or more underlying variables. Comparing observations near the discontinuity allows for the estimation of a local causal effect of the treatment on an outcome.

In a regression discontinuity design, treatment D get assigned on the basis of some continuous covariate or covariates crossing an explicit threshold. This often happens in circumstances where a treatment is triggered by an administrative or organizational rule. One example is the effect of scholarship receipt on college enrollment. If students become eligible when their test score or grade point average exceed some value, one can compare the difference in enrollment rates for students who are just below the cutoff to students who are just over it. The variable defining the discontinuity must not be easily changed by the agent (or anyone else) to obtain or avoid the treatment.

The basic formula is a ratio of two differences in means: $$\Delta_{RD} = \frac{\bar Y_+-\bar Y_-}{\bar D_+-\bar D_-},$$

where $Y$ is the outcome of interest, $D$ is the binary treatment indicator, and $+$ and $-$ subscripts indicate the position relative to the threshold.

There are two types of RDDs:

  • Sharp RDD: probability of treatment moves from one to zero at the threshold value, so the denominator above is 1.
  • Fuzzy RDD: probability of treatment is discontinuous at the threshold value, so the denominator is between one and zero.

Fuzzy RDD can be thought as a special case of instrumental variables and the Wald estimator. Sharp RDD can be considered a special case of matching.

67 questions
11
votes
2 answers

Graphs in regression discontinuity design in "Stata" or "R"

Lee and Lemieux (p. 31, 2009) suggest the researcher to present the graphs while doing Regression discontinuity design analysis (RDD). They suggest the following procedure: "...for some bandwidth $h$, and for some number of bins $K_0$ and $K_1$…
Metrics
  • 2,526
  • 2
  • 19
  • 31
7
votes
1 answer

Example of discontinous effect of x on y dataset (for paper)

For a paper, I need an example of a dataset $(x_i,y_i)$ where the residuals are $iid$ (the $x$ do not represent time) with a discontinuity on the effect of $x$ on $y$. I have already found a dataset in Berger and Pope 2010 but the jump there is…
user603
  • 21,225
  • 3
  • 71
  • 135
7
votes
3 answers

Fuzzy regression discontinuity design and exclusion restriction

In a fuzzy regression discontinuity design, what does the exclusion restriction look like in terms of a conditional expectation between the instrument in the first stage and the error term in the structural equation? In IV, we normally say that…
7
votes
3 answers

Simple regression models for data with a breakpoint

I am currently working on a segmented regression model with two variables $(x_i,y_i)$ for $i = 1..N$. The model should take the form: $y_i = \beta_0 + \beta_1 x_i \quad$ for $x_i < x_{crit}$ $y_i = \alpha_0 + \beta_0 + \beta_1 x_i \quad$ for $x_i…
Berk U.
  • 4,265
  • 5
  • 21
  • 42
6
votes
1 answer

Regression discontinuity design parametric versus non-parametric different result

I am using the parametric approach and non-parametric (local linear regression) approaches of regression discontinuity design (RDD) to compute the treatment effect using Stata. To get the user-written rd and the 102nd Congress data, I do this: net…
user227710
  • 605
  • 7
  • 21
6
votes
2 answers

Does it make sense to "cluster" when you use a regression discontinuity?

One of the breakthroughs of econometrics over the past two decades has been to employ "clustering" to take into account the correlation of error terms across observations. For instance, if you're evaluating the effect of an educational intervention…
d_a_c321
  • 1,129
  • 1
  • 10
  • 19
5
votes
2 answers

Quantile Regression with Regression Discontinuity

Sort of a methodological question: If one has an exogenous binary treatment and a continuous outcome variable Y and wants to estimate quantile treatment effects by exploiting a (sharp) discontinuity in the treatment, what is the advantage of using…
4
votes
2 answers

Graphical and Statistical Tests for Robustness of Sharp RD

I'm doing sharp regression discontinuity design with my treatment variable $$ D_i = \begin{cases} 1 \enspace \quad \text{if $x_i \geq \overline{x}$} \\ 0 \quad \text{otherwise} \end{cases} $$ where $\overline{x}$ is the threshold value of my forcing…
4
votes
1 answer

Fuzzy regression discontinuity design in Stata

I am currently running computations through a "Fuzzy" Regression discontinuity Design. Suppose my data are in the following form: $Z$: assignment variable; if $Z > Z_0$ then the person is assigned to the treatment with a certain probability $p_D$…
4
votes
0 answers

What is difference between interrupted time series and regression discontinuity design

Say that one has data over time, t, on an outcome, y. There is an event that happens at t==0. One is interested in testing for evidence that the event is related to (I am being cautious on a causal interpretation) a change in the outcome.…
bill999
  • 267
  • 3
  • 15
4
votes
4 answers

Regression discontinuity versus matching with spatial discontinuity

I am interested in using a spatial research design. Imagine a line, like a time zone line. For example, in the United States, the line that makes between Eastern Standard Time and Central Standard Time runs North to South through the U.S. (and…
user1690130
  • 755
  • 2
  • 11
  • 23
4
votes
0 answers

Difference-in-Discontinuities Design

Recently I came across the paper written by Grembi et al. (2016) and thought the methodology employed by them, Difference-in-Discontinuities Design (a combination of Regression Discontinuity and Difference-in-Difference), is very interesting. In…
4
votes
1 answer

Fixed effects in regression discontinuity design

I want to do a non parametric RDD type analysis to know the impact of an intervention (a single dummy variable) on an outcome variable. I have several 'boundaries' (which are actually different geographical locations) around which I will be picking…
4
votes
1 answer

Fuzzy RDD issue

I am fairly new to econometrics and maybe this is a very basic question to some. I am running a Fuzzy Regression Discontinuity (RD) design in Stata and I am having doubts about whether I am specifying my regressions correctly. My running variable is…
Luisa
  • 41
  • 1
  • 1
  • 2
3
votes
1 answer

Why and when would you use Regression Discontuinity (RD) instead of a linear model with a dummy?

More specifically, why can't you just run an OLS with a dummy for the treatment group, instead of a RD. What complications might arise when you do so? And as an extension of that, what criteria's do you have to look at when deciding whether to do…
1
2 3 4 5