Questions tagged [latent-variable]

Latent variables refer to variables that cannot be directly observed. These variable are defined in terms of observable variables. In narrow sense, "latent variable" is seen/modeled as what generates the observed variables in an implied data generation process. Also called hidden or lurking variables.

Common applications using latent variable models:

  • Hidden Markov models
  • Factor analysis
  • Principal component analysis
  • Partial least squares regression
  • Latent semantic analysis and Probabilistic latent semantic analysis
  • EM algorithms

Bayesian methods include:

  • Latent Dirichlet Allocation
  • The Chinese Restaurant Process is often used to provide a prior distribution over assignments of objects to latent categories.
  • The Indian buffet process

In graphical models, latent variables are represented by ovals/circles and observed values are represented by squares/rectangles:

enter image description here

See also https://en.wikipedia.org/wiki/Hidden_variable

386 questions
43
votes
5 answers

LDA vs word2vec

I am trying to understand what is similarity between Latent Dirichlet Allocation and word2vec for calculating word similarity. As I understand, LDA maps words to a vector of probabilities of latent topics, while word2vec maps them to a vector of…
42
votes
3 answers

Latent Class Analysis vs. Cluster Analysis - differences in inferences?

What are the differences in inferences that can be made from a latent class analysis (LCA) versus a cluster analysis? Is it correct that a LCA assumes an underlying latent variable that gives rise to the classes, whereas the cluster analysis is an…
Brian P
  • 455
  • 1
  • 6
  • 12
25
votes
3 answers

How to choose an optimal number of latent factors in non-negative matrix factorization?

Given a matrix $\mathbf V^{m \times n}$, Non-negative Matrix Factorization (NMF) finds two non-negative matrices $\mathbf W^{m \times k}$ and $\mathbf H^{k \times n}$ (i.e. with all elements $\ge 0$) to represent the decomposed matrix as: $$\mathbf…
24
votes
5 answers

How to get started with applying item response theory and what software to use?

Context I have been reading about item response theory, and I find it fascinating. I believe I understand the basics, but I am left wondering how to apply statistical techniques related to the area. Below are two articles that are similar to the…
Behacad
  • 4,916
  • 8
  • 30
  • 48
22
votes
1 answer

Latent variable interpretation of generalized linear models (GLMs)

Short version: We know that logistic regression and probit regression can be interpreted as involving a continuous latent variable that gets discretized according to some fixed threshold prior to observation. Is a similar latent variable…
19
votes
1 answer

Parameters vs latent variables

I have asked about this before and have really been struggling with identifying what makes a model parameter and what makes it a latent variable. So looking at various threads on this topic on this site, the main distinction seems to be: Latent…
Luca
  • 4,410
  • 3
  • 30
  • 52
15
votes
1 answer

What is principal subspace in probabilistic PCA?

if $X$ is observed data matrix and $Y$ is latent variable then $$X=WY+\mu+\epsilon$$ Where $\mu$ is the mean of observed data, and $\epsilon$ is the Gaussian error/noise in data, and $W$ is called principal subspace. My question is when normal PCA…
user3086871
  • 583
  • 4
  • 11
12
votes
2 answers

How to reduce number of items using factor analysis, internal consistency, and item response theory in conjunction?

I am in the process of empirically developing a questionnaire and I will be using arbitrary numbers in this example to illustrate. For context, I am developing a psychological questionnaire aimed at assessing thought patterns commonly identified in…
12
votes
1 answer

How do you use the EM algorithm to calculate MLEs for a latent variable formulation of a zero inflated Poisson model?

The zero inflated Poisson regression model is defined for a sample $(y_1,\ldots,y_n)$ by $$ Y_i = \begin{cases} 0 & \text{with probability} \ p_i+(1-p_i)e^{-\lambda_i}\\ k & \text{with probability} \ (1-p_i)e^{-\lambda_i}…
12
votes
1 answer

What's the difference between a MIMIC factor and a composite with indicators (SEM)?

In structural equation modeling with latent variables (SEM), a common model formulation is "Multiple Indicator, Multiple Cause" (MIMIC) where a latent variable is caused by some variables and reflected by others. Here's a simple…
dmp
  • 283
  • 3
  • 10
10
votes
2 answers

How to compute the confidence intervals on regression coefficients in PLS?

The underlying model of PLS is that a given $n \times m$ matrix $X$ and $n$ vector $y$ are related by $$X = T P' + E,$$ $$y = T q' + f,$$ where $T$ is a latent $n \times k$ matrix, and $E, f$ are noise terms (sssuming $X, y$ are centered). PLS…
10
votes
1 answer

What is the difference between VAE and Stochastic Backpropagation for Deep Generative Models?

What is the difference between Auto-encoding Variational Bayes and Stochastic Backpropagation for Deep Generative Models? Does inference in both methods lead to the same results? I'm not aware of any explicit comparisons between the two methods,…
9
votes
2 answers

EM algorithm Practice Problem

This is a practice problem for a midterm exam. The problem is an EM algorithm example. I am having trouble with part (f). I list parts (a)-(e) for completion and in case I made a mistake earlier. Let $X_1,\ldots,X_n$ be independent exponential…
9
votes
2 answers

Expectation maximization on Bayesian networks with latent variables

I am trying to determine parameters in a bayesian network with two latent variables (in blue). Every variable is discrete with 2-4 categories. The latent variables have 3 categories each. I am trying to find an R package that will let me define the…
9
votes
1 answer

Latent variables, overparameterization and MCMC convergence in bayesian models

Sometimes I have a large number of latent variables in a Bayesian hierarchical model to which, but I am only interested in estimating projected transformations of those latent variables (for example, I will parameterize a binomial parameter as an…
1
2 3
25 26