Questions tagged [identifiability]

A model is identifiable if a single set of parameters can be found that will yield the best fit.

For example, consider the classic ANOVA model:
$$ y_{ij} = \mu + \alpha_j + \varepsilon_i $$ where $y_{ij}$ represents observed scores decomposed into a population mean, $\mu$, a mean shift, $\alpha_j$, associated with condition $j$, and each unit's individual divergence from their condition's mean, $\varepsilon_i$. When there are $J$ conditions, there are $J+1$ parameters to fit in this model. Without additional constraints, this model is unidentifiable; for instance if three conditions had means $3$, $4$, and $5$, they could be fit equally well with:
\begin{array}[llll] \mu \mu = 1 \qquad &\alpha_1 = \; \; \ 2 \qquad &\alpha_2 = \; \; \ 3 \qquad &\alpha_3 = \; \; \ 4 \\ \mu = 4 \qquad &\alpha_1 = -1 \qquad &\alpha_2 = \; \; \ 0 \qquad &\alpha_3 = \; \; \ 1 \\ \mu = 9 \qquad &\alpha_1 = -6 \qquad &\alpha_2 = -5 \qquad &\alpha_3 = -4 \\ ... \end{array} (Therefore, in practice the ANOVA is given additional constraints such as $\frac{1}{N}\sum_j n_j\alpha_j = 0$.)

Although issues with identifiability are trivial in the above example, they can be more subtle in other contexts. Identifiability concerns can arise in fitting structural equations models (sem), for example.

207 questions

votes

2 answers

What is model identifiability?

I know that with a model that is not identifiable the data can be said to be generated by multiple different assignments to the model parameters. I know that sometimes it's possible to constrain parameters so that all are identifiable, as in the…

identifiability

asked Jan 05 '12 at 02:59

Jack Tanner

4,552
3
27
39

votes

2 answers

r glmer warnings: model fails to converge & model is nearly unidentifiable

I have seen questions about this on this forum, and I have also asked it myself in a previous post but I still haven't been able to solve my problem. Therefore I am trying again, formulating the question as clearly as I can this time, with as much…

r lme4-nlme convergence glmm identifiability

asked Aug 03 '15 at 09:10

Brechje van Osch

votes

2 answers

Dirichlet Processes for clustering: how to deal with labels?

Q: What is the standard way to cluster data using a Dirichlet Process? When using Gibbs sampling clusters appear and dissapear during the sampling. Besides, we have a identifiability problem since the posterior distribution is invariant to cluster…

bayesian clustering markov-chain-montecarlo dirichlet-process identifiability

asked Jan 27 '15 at 17:39

alberto

2,646
16
36

votes

2 answers

Identifiability of neural network models

It's quite intuitive that most neural network topologies/architectures are not identifiable. But what are some well-known results in the field? Are there simple conditions which allow/prevent identifiability? For example, all networks with…

neural-networks conv-neural-network recurrent-neural-network identifiability

asked Nov 30 '17 at 18:59

DeltaIV

15,894
4
62
104

votes

2 answers

Identifiability of a state space model (Dynamic Linear Model)

Take a general linear Gaussian state space model (SSM)(aka Dynamic Linear Model DLM): \begin{align} X_{t+1} &= FX_t + V_t \\ Y &= HX_t+W_t \\[10pt] V_t &\sim N(0,Q) \\ W_t &\sim N(0,R) \\ \end{align} I am interested in the…

time-series bayesian kalman-filter state-space-models identifiability

asked Jul 09 '15 at 13:02

Baz

1,583
3
13
26

votes

2 answers

Bayesian inference and degrees of freedom

While learning frequentist linear regressions, one thing the professors always talked about was about the number of degrees of freedom, I never saw this expression in a bayesian book though. Perhaps because bayesian methods don't need this number to…

bayesian degrees-of-freedom hierarchical-bayesian identifiability

asked May 08 '13 at 16:04

random_user

votes

1 answer

Moments of $Y=X_1 + X_2 X_3 + X_4 X_5 X_6 +\cdots$

The $X_i$'s are i.i.d. and $X$ denotes any of these random variables. We assume here that $|E(X)|<1$ to guarantee convergence. I am interested in particular in the third moment $E(Y^3)$. For the first two moments, we have (see here): $$E(Y) =…

machine-learning distributions modeling moments identifiability

asked Nov 30 '19 at 20:55

Vincent Granville

votes

1 answer

Identification of parameters problem

I always struggle to get the true essence of identification in econometrics. I know that we state that a parameter (say $\hat{\theta}$) can be identified if by simply looking at its (joint) distribution we can infer the value of the parameter. In a…

estimation identifiability

asked Oct 03 '12 at 08:24

CharlesM

votes

1 answer

How can I tell if a statistical model is "identified"?

My econometrics professor used the term "identified" in class. We are considering data generating processes of the form $$Y = \beta_0 + \beta_1 X + U$$ where $X$ is a random variable and $U$ is a random error term. Our regression lines take the…

regression econometrics model identifiability

asked Apr 24 '16 at 05:39

Stan Shunpike

3,623
2
27
36

votes

1 answer

What's the problem with model identifiability?

I understand that in a decision perspective, identifiability of a model is needed to ensure the convergence (with increasing number of observations) of the parameters to estimate through a single value. But, if the non-identifiability of a given…

bayesian inference identifiability

asked May 30 '13 at 13:31

peuhp

4,622
20
38

votes

1 answer

Definition of softmax function

This question follows up on stats.stackexchange.com/q/233658 The logistic regression model for classes {0, 1} is $$ \mathbb{P} (y = 1 \;|\; x) = \frac{\exp(w^T x)}{1 + \exp(w^T x)} \\ \mathbb{P} (y = 0 \;|\; x) = \frac{1}{1 + \exp(w^T x)} $$ Clearly…

logistic neural-networks multinomial-logit softmax identifiability

asked Jun 01 '18 at 21:07

farmer

votes

1 answer

Identifiability in a nonlinear regression problem

Suppose I'm working with the following model $y_i = \alpha(1-\exp(-\beta t_i))+\gamma(1-\exp(-\delta t_i)) + \varepsilon_i$. The $\varepsilon_i$ are i.i.d. gaussian with zero mean and I'm trying to find best fit values of…

regression nonlinear-regression identifiability

asked Feb 05 '17 at 03:30

Will

1,118
8
16

votes

2 answers

Can anyone help explain this basic example of posterior

I am having trouble understanding the authors reasoning here. It is from "The Bayesian Choice" I am confused about why the posterior is initially written without depending on the data, and why we integrate the numerator. It is, Consider one…

bayesian normal-distribution independence posterior identifiability

asked Oct 31 '18 at 19:07

Learning

votes

1 answer

Why are over identified models preferred over just identified models in Structural Equation Modeling?

It's often stated that for analysis using an SEM technique, it is preferred to use an overidentified model compared to a just identified model. Why is that so ? My intuition says that for an over identified model will have more than one solution and…

confirmatory-factor identifiability path-model structural-equation-modeling

asked Nov 22 '15 at 18:49

Sahil Talwar

votes

0 answers

Rule of thumb for excluded variable in Heckman selection model?

I'm working on a project that involves the use of a Heckman selection model (more specifically a Roy or move-stay model, which is essentially a two-sided Heckman) of the following form: $$ Y_{i1} = X_i\beta_1 + \varepsilon_{i1} \text{ if }…

regression econometrics instrumental-variables identifiability heckman

asked Feb 06 '15 at 18:27

NickCHK

2 3

…

13 14 Next