Questions tagged [identifiability]

A model is identifiable if a single set of parameters can be found that will yield the best fit.

A model is identifiable if a single set of parameters can be found that will yield the best fit.

For example, consider the classic ANOVA model:
$$ y_{ij} = \mu + \alpha_j + \varepsilon_i $$ where $y_{ij}$ represents observed scores decomposed into a population mean, $\mu$, a mean shift, $\alpha_j$, associated with condition $j$, and each unit's individual divergence from their condition's mean, $\varepsilon_i$. When there are $J$ conditions, there are $J+1$ parameters to fit in this model. Without additional constraints, this model is unidentifiable; for instance if three conditions had means $3$, $4$, and $5$, they could be fit equally well with:
\begin{array}[llll] \mu \mu = 1 \qquad &\alpha_1 = \; \; \ 2 \qquad &\alpha_2 = \; \; \ 3 \qquad &\alpha_3 = \; \; \ 4 \\ \mu = 4 \qquad &\alpha_1 = -1 \qquad &\alpha_2 = \; \; \ 0 \qquad &\alpha_3 = \; \; \ 1 \\ \mu = 9 \qquad &\alpha_1 = -6 \qquad &\alpha_2 = -5 \qquad &\alpha_3 = -4 \\ ... \end{array} (Therefore, in practice the ANOVA is given additional constraints such as $\frac{1}{N}\sum_j n_j\alpha_j = 0$.)

Although issues with identifiability are trivial in the above example, they can be more subtle in other contexts. Identifiability concerns can arise in fitting structural equations models (), for example.

207 questions
42
votes
2 answers

What is model identifiability?

I know that with a model that is not identifiable the data can be said to be generated by multiple different assignments to the model parameters. I know that sometimes it's possible to constrain parameters so that all are identifiable, as in the…
Jack Tanner
  • 4,552
  • 3
  • 27
  • 39
16
votes
2 answers

r glmer warnings: model fails to converge & model is nearly unidentifiable

I have seen questions about this on this forum, and I have also asked it myself in a previous post but I still haven't been able to solve my problem. Therefore I am trying again, formulating the question as clearly as I can this time, with as much…
Brechje van Osch
  • 161
  • 1
  • 1
  • 5
14
votes
2 answers

Dirichlet Processes for clustering: how to deal with labels?

Q: What is the standard way to cluster data using a Dirichlet Process? When using Gibbs sampling clusters appear and dissapear during the sampling. Besides, we have a identifiability problem since the posterior distribution is invariant to cluster…
12
votes
2 answers

Identifiability of neural network models

It's quite intuitive that most neural network topologies/architectures are not identifiable. But what are some well-known results in the field? Are there simple conditions which allow/prevent identifiability? For example, all networks with…
11
votes
2 answers

Identifiability of a state space model (Dynamic Linear Model)

Take a general linear Gaussian state space model (SSM)(aka Dynamic Linear Model DLM): \begin{align} X_{t+1} &= FX_t + V_t \\ Y &= HX_t+W_t \\[10pt] V_t &\sim N(0,Q) \\ W_t &\sim N(0,R) \\ \end{align} I am interested in the…
Baz
  • 1,583
  • 3
  • 13
  • 26
9
votes
2 answers

Bayesian inference and degrees of freedom

While learning frequentist linear regressions, one thing the professors always talked about was about the number of degrees of freedom, I never saw this expression in a bayesian book though. Perhaps because bayesian methods don't need this number to…
9
votes
1 answer

Moments of $Y=X_1 + X_2 X_3 + X_4 X_5 X_6 +\cdots$

The $X_i$'s are i.i.d. and $X$ denotes any of these random variables. We assume here that $|E(X)|<1$ to guarantee convergence. I am interested in particular in the third moment $E(Y^3)$. For the first two moments, we have (see here): $$E(Y) =…
9
votes
1 answer

Identification of parameters problem

I always struggle to get the true essence of identification in econometrics. I know that we state that a parameter (say $\hat{\theta}$) can be identified if by simply looking at its (joint) distribution we can infer the value of the parameter. In a…
CharlesM
  • 573
  • 2
  • 12
9
votes
1 answer

How can I tell if a statistical model is "identified"?

My econometrics professor used the term "identified" in class. We are considering data generating processes of the form $$Y = \beta_0 + \beta_1 X + U$$ where $X$ is a random variable and $U$ is a random error term. Our regression lines take the…
Stan Shunpike
  • 3,623
  • 2
  • 27
  • 36
8
votes
1 answer

What's the problem with model identifiability?

I understand that in a decision perspective, identifiability of a model is needed to ensure the convergence (with increasing number of observations) of the parameters to estimate through a single value. But, if the non-identifiability of a given…
peuhp
  • 4,622
  • 20
  • 38
8
votes
1 answer

Definition of softmax function

This question follows up on stats.stackexchange.com/q/233658 The logistic regression model for classes {0, 1} is $$ \mathbb{P} (y = 1 \;|\; x) = \frac{\exp(w^T x)}{1 + \exp(w^T x)} \\ \mathbb{P} (y = 0 \;|\; x) = \frac{1}{1 + \exp(w^T x)} $$ Clearly…
8
votes
1 answer

Identifiability in a nonlinear regression problem

Suppose I'm working with the following model $y_i = \alpha(1-\exp(-\beta t_i))+\gamma(1-\exp(-\delta t_i)) + \varepsilon_i$. The $\varepsilon_i$ are i.i.d. gaussian with zero mean and I'm trying to find best fit values of…
Will
  • 1,118
  • 8
  • 16
7
votes
2 answers

Can anyone help explain this basic example of posterior

I am having trouble understanding the authors reasoning here. It is from "The Bayesian Choice" I am confused about why the posterior is initially written without depending on the data, and why we integrate the numerator. It is, Consider one…
7
votes
1 answer

Why are over identified models preferred over just identified models in Structural Equation Modeling?

It's often stated that for analysis using an SEM technique, it is preferred to use an overidentified model compared to a just identified model. Why is that so ? My intuition says that for an over identified model will have more than one solution and…
7
votes
0 answers

Rule of thumb for excluded variable in Heckman selection model?

I'm working on a project that involves the use of a Heckman selection model (more specifically a Roy or move-stay model, which is essentially a two-sided Heckman) of the following form: $$ Y_{i1} = X_i\beta_1 + \varepsilon_{i1} \text{ if }…
1
2 3
13 14