Questions tagged [gaussian-mixture-distribution]

A type of mixed distribution or model which assumes subpopulations follow Gaussian distributions.

Gaussian mixture can refer to distributions or models which assumes subpopulations follow Gaussian distributions. A mixed Gaussian distribution $\mathcal{P}(x)$ can be generally written as a weighted sum of individual Gaussians $\mathcal{N}$:

$$\mathcal{P}(x) = \sum_{i=1}^n w_i \mathcal{N}(\mu_i,\sigma_i^2)$$

where $\sum_{i=1}^{n}w_i = 1$

GMMs are often used in unsupervised learning where we don't know from which subpopulation a data point belongs. In this case we seek to maximize the likelihood function (which assumes the data is independent):

$$p(X|w,\mu,\sigma) = \prod_{i=1}^{n} p(x_i|w,\mu,\sigma)$$

The parameters $\{w,\mu,\sigma^2\}$ can be solved using either Expectation-Maximization (EM) or MAP estimation.

It can also be used to simulate outliers and thereby test outlier detection and robust statistical methods

579 questions

votes

5 answers

Clustering a dataset with both discrete and continuous variables

I have a dataset X which has 10 dimensions, 4 of which are discrete values. In fact, those 4 discrete variables are ordinal, i.e. a higher value implies a higher/better semantic. 2 of these discrete variables are categorical in the sense that for…

asked May 10 '12 at 10:44

ptikobj

votes

1 answer

Different covariance types for Gaussian Mixture Models

While trying Gaussian Mixture Models here, I found these 4 types of covariances. 'full' (each component has its own general covariance matrix), 'tied' (all components share the same general covariance matrix), 'diag' (each component has its own…

covariance-matrix gaussian-mixture-distribution

asked Feb 03 '18 at 18:55

Bee

votes

2 answers

If k-means clustering is a form of Gaussian mixture modeling, can it be used when the data are not normal?

I'm reading Bishop on EM algorithm for GMM and the relationship between GMM and k-means. In this book it says that k-means is a hard assign version of GMM. I'm wondering does that imply that if the data I'm trying to cluster are not Gaussian, I…

clustering data-mining k-means gaussian-mixture-distribution

asked Sep 07 '13 at 02:24

Eddie Xie

votes

2 answers

EM algorithm manually implemented

I want to implement the EM algorithm manually and then compare it to the results of the normalmixEM of mixtools package. Of course, I would be happy if they both lead to the same results. The main reference is Geoffrey McLachlan (2000), Finite…

r expectation-maximization gaussian-mixture-distribution

asked Apr 04 '13 at 10:14

Stat Tistician

2,113
4
29
54

votes

5 answers

Singularity issues in Gaussian mixture model

In chapter 9 of the book Pattern recognition and machine learning, there is this part about Gaussian mixture model: To be honest I don't really understand why this would create a singularity. Can anyone explain this to me? I'm sorry but I'm just…

gaussian-mixture-distribution

asked Jun 17 '16 at 02:45

Dang Manh Truong

votes

2 answers

Why is optimizing a mixture of Gaussian directly computationally hard?

Consider the log likelihood of a mixture of Gaussians: $$l(S_n; \theta) = \sum^n_{t=1}\log f(x^{(t)}|\theta) = \sum^n_{t=1}\log\left\{\sum^k_{i=1}p_i f(x^{(t)}|\mu^{(i)}, \sigma^2_i)\right\}$$ I was wondering why it was computationally hard to…

machine-learning gaussian-mixture-distribution expectation-maximization

asked Apr 21 '14 at 15:07

Charlie Parker

5,836
11
57
113

votes

1 answer

Quantiles from the combination of normal distributions

I have information on the distributions of anthropometric dimensions (like shoulder span) for children of different ages. For each age and dimension, I have mean, standard deviation. (I also have eight quantiles, but I don't think I'll be able to…

normal-distribution quantiles gaussian-mixture-distribution aggregation

asked Aug 18 '11 at 18:30

Thomas Levine

3,001
1
16
16

votes

2 answers

Why Expectation Maximization is important for mixture models?

There are many literature emphasize Expectation Maximization method on mixture models (Mixture of Gaussian, Hidden Markov Model, etc.). Why EM is important? EM is just a way to do optimization and is not widely used as gradient based method…

machine-learning optimization expectation-maximization gaussian-mixture-distribution

asked Feb 17 '17 at 17:20

Haitao Du

32,885
17
118
213

votes

2 answers

Finding category with maximum likelihood method

Let's say that we had an information for men and women heights. R code: set.seed(1) Women=rnorm(80, mean=168, sd=6) Men=rnorm(120, mean=182, sd=7) par(mfrow=c(2,1)) hist(Men, xlim=c(150, 210), col="skyblue") hist(Women, xlim=c(150, 210),…

maximum-likelihood missing-data unsupervised-learning gaussian-mixture-distribution expectation-maximization

asked Dec 06 '21 at 17:26

john hamilton

votes

4 answers

In cluster analysis, how does Gaussian mixture model differ from K Means when we know the clusters are spherical?

I understand how main difference between K-mean and Gaussian mixture model (GMM) is that K-Mean only detects spherical clusters and GMM can adjust its self to elliptic shape cluster. However, how do they differ when GMM has spherical covariance…

clustering k-means gaussian-mixture-distribution

asked Sep 28 '20 at 05:43

daisybeats

votes

2 answers

How to fit mixture model for clustering

I have two variables - X and Y and I need to make cluster maximum (and optimal) = 5. Let's ideal plot of variables is like following: I would like to make 5 clusters of this. Something like this: Thus I think this is mixture model with 5 clusters.…

r clustering gaussian-mixture-distribution

asked Aug 08 '14 at 02:37

rdorlearn

3,493
6
26
29

votes

2 answers

Why do we use Gaussian distributions in Variational Autoencoder?

I still don't understand why we force the distribution of the hidden representation of a Variational Autoencoder (VAE) to follow a multivariate normal distribution. Why this specific distribution and not another one ? This is maybe linked with…

neural-networks normal-distribution gaussian-mixture-distribution weights variational-bayes

asked Apr 11 '19 at 22:19

Tbertin

votes

3 answers

References that justify use of Gaussian Mixtures

Gaussian mixture models (GMMs) are appealing because they are simple to work with both in analytically and in practice, and are capable of modeling some exotic distributions without too much complexity. There are a few analytic properties we should…

probability normal-distribution references gaussian-mixture-distribution information-theory

asked Jan 27 '16 at 02:16

Christian Chapman

votes

3 answers

Relation between sum of Gaussian RVs and Gaussian Mixture

I know that a sum of Gaussians is Gaussian. So, how is a mixture of Gaussians different? I mean, a mixture of Gaussians is just a sum of Gaussians (where each Gaussian is multiplied by the respective mixing coefficient) right?

normal-distribution random-variable mixture-distribution gaussian-mixture-distribution

asked Apr 17 '15 at 13:16

njk

votes

3 answers

Distance between two Gaussian mixtures to evaluate cluster solutions

I'm running a quick simulation to compare different clustering methods, and currently hit a snag trying to evaluate the cluster solutions. I know of various validation metrics (many found in cluster.stats() in R), but I assume those are best used…

clustering kullback-leibler gaussian-mixture-distribution

asked Oct 04 '13 at 04:03

dmartin

3,010
3
22
27

2 3

…

38 39 Next