Questions tagged [tobit-regression]

Tobit regression is used to estimate a linear regression model when the dependent variable is censored, i.e. when it is only observed over an interval of its support.

Tobit regression deals with censored dependent variables for which standard regression models would yield inconsistent estimates. Examples for such cases are expenditure on automobiles as many individuals will have 0 expenditure for a given year (left censoring) or confidentiality issues as in the Current Population Survey where high incomes are top-coded (right censoring). Left and right censoring may occur at the same time.

For $d=1$ non-censored and $d=0$ censored observations with the censoring point $\gamma$, Tobit regressions estimate $$\begin{equation} f(y_i) = \left[ \frac{1}{\sqrt{2\pi \sigma^2}} exp \left(-\frac{1}{2\sigma^2}(y_i-x'_i\beta)^2\right) \right]^{d_i} \left[\Phi \left(\frac{\gamma - x'_i\beta}{\sigma}\right) \right]^{1-d_i} \end{equation}$$ via maximum likelihood (which requires that the errors are $\epsilon_i \sim N(0,\sigma^2)$). Under less strict assumptions it is also possible to use a two-step estimator in which the first step uses a Probit model to predict the censored outcomes (see Cameron and Trivedi (2009) for details).

Tobit regression can deal with unknown constant censoring points (see Carson and Sun (2007)) but becomes problematic with unknown changing censoring points.

130 questions
33
votes
2 answers

How to model non-negative zero-inflated continuous data?

I'm currently trying to apply a linear model (family = gaussian) to an indicator of biodiversity that cannot take values lower than zero, is zero-inflated and is continuous. Values range from 0 to a little over 0.25. As a consequence, there is quite…
14
votes
2 answers

Tobit model explanation

We have 100 participants in two groups, $n=50$ in each group. We used an assessment of ability of basic functioning at 4 time-points. The assessment comprises 6 questions, each scored 0 – 5. We do not have individual scores for each question, just…
Adam
  • 331
  • 1
  • 3
  • 6
12
votes
1 answer

Exact difference between two-part models (e.g., Cragg) and Tobit type 2 models (e.g., Heckman)

I want to run a regression where the dependant variable (DV) is the amount of funding (in USD) obtained by startups. Naturally the DV contains a lot of zero's (~55%) and has a continuous distribution for y>0. In general my understanding is that the…
JNWHH
  • 360
  • 2
  • 10
11
votes
2 answers

How should I model a continuous dependent variable in the $[0, \infty]$ range?

I have a dependent variable that can range from 0 to infinity, with 0s actually being correct observations. I understand censoring and Tobit models only apply when the actual value of $Y$ is partially unknown or missing, in which case data is said…
Robert Kubrick
  • 4,078
  • 8
  • 38
  • 55
10
votes
2 answers

Tobit model with R

Does anybody know where to find good application and examples (besides the manual and the book applied econometrics with R) using the tobit model with the packages AER? Edit I'm searching for a command to compute the marginal effects for y (not for…
MarkDollar
  • 5,575
  • 14
  • 44
  • 60
9
votes
1 answer

How to train a model when instead of a target we have a range where it is?

Often in machine learning we have a situation when target is numeric (real or integer). Each target comes with an associated input vector. The goal is to learn the mapping from the input vectors to the target. For example: (1.2, 'A', 3) -> 4.0 (3.2,…
7
votes
1 answer

Instrumental variable Tobit in R

I have a data generating process of the form: res1 <- rnorm(N); res2 <- res1*0.5 + rnorm(N) x <- z[,1]*2 + res1; ys <- x*b + res2; d <- (ys>0); #dummy variable y <- d*ys; (blatantly stolen from an old RHelp thread, where the question…
RoyalTS
  • 233
  • 2
  • 15
7
votes
1 answer

How should I handle a left censored predictor variable in multiple regression?

I have a dataset (N=350) for which I would like to regress a neuropsychological test score (continuous) on age, education, symptom severity (continuous), and diagnosis (binary). Symptom severity is censored: symptom severity score was only generated…
Jordan Harp
  • 71
  • 1
  • 2
7
votes
1 answer

Censored regression in R

I have a Tobit model of the form: \begin{align} Y^*_i &= X_i\beta + \epsilon_i \\ Y_i &= \max(Y^*_i,0). \end{align} The regressors are one continuous variable and 30 factors (modelling seasonal effects during the day). In my sample there are…
Richi W
  • 3,216
  • 3
  • 30
  • 53
6
votes
1 answer

What is really the Scale or Sigma parameter of a tobit regression?

In this most upvoted CV answer on that topic the "scale" parameter (aka "sigma" in Stata) thrown in a tobit regression output is explained to be "the estimated standard deviation of the residuals". (Since the question is over three years old, I…
jay.sf
  • 556
  • 3
  • 19
5
votes
1 answer

Tobit versus OLS

There is a dependent variable which is measured in £ and can take the form of £0-£100,000. It is effectively the value of the payment made. If it takes the form of £0 it means a payment was not made (because it wasn't authorised). If it takes the…
NickB2014
  • 743
  • 1
  • 9
  • 16
4
votes
2 answers

Tobit Model with a Variable Lower Censoring Bound?

I am working with NBA contract data and about 20 percent of my free agent sample do not receive contracts. Additionally, the distribution for those who do receive contracts is exponential. For each individual who did not receive a contract, I know…
user17558
  • 41
  • 1
4
votes
1 answer

Heckman selection model with difference-in-differences specification

Following my question on Tobit with DiD specification I am wondering if it is possible to estimate a heckman sample selection model with a Difference in Differences specification? For example in STATA, using the heckman command how do I implement a…
4
votes
2 answers

Tobit with difference-in-differences specification

Is it possible to estimate a tobit model (e.g. a nonlinear model) with a DiD (difference-in-differences) specification? If so, how does such specification look like? If it is possible is this implemented in R or Stata? Comment: @Dimitriy V.…
Steven
  • 141
  • 2
  • 9
4
votes
0 answers

Sign and size of OLS bias for Tobit models

I have a question related to the sign and size of the OLS bias in the case of a Tobit model. Consider the following model (1) Sample of observations $\{X_i,Y_i\}_{i=1}^n$, i.i.d., $X_i$ is a vector $k\times 1$ (2) $ Y_i^\star=X_i'\beta +U_i $ (3)…
TEX
  • 266
  • 3
  • 19
1
2 3
8 9