Questions tagged [dirichlet-process]

A family of stochastic processes whose realizations are probability distributions

From "Dirichlet Process" by Yee Whye Teh:

The Dirichlet process is a stochastic proces used in Bayesian nonparametric models of data, particularly in Dirichlet process mixture models (also known as infinite mixture models). It is a distribution over distributions, i.e. each draw from a Dirichlet process is itself a distribution. It is called a Dirichlet process because it has Dirichlet distributed finite dimensional marginal distributions, just as the Gaussian process, another popular stochastic process used for Bayesian nonparametric regression, has Gaussian distributed finite dimensional marginal distributions. Distributions drawn from a Dirichlet process are discrete, but cannot be described using a finite number of parameters, thus the classification as a nonparametric model.

From Wikipedia:

Dirichlet processes are usually used when modeling data that tends to repeat previous values in a "rich get richer" fashion. Specifically, suppose that the generation of values $X_{{1}},X_{{2}},\dots$ can be simulated by the following algorithm.

Input: $H$ (a probability distribution called base distribution), $\alpha$ (a positive real number called concentration parameter) Draw $X_{{1}} $ from the distribution $H$. For $n>1$:

  1. With probability ${\frac {\alpha }{\alpha +n-1}}$ draw $X_{{n}}$ from $H$.
  2. With probability ${\frac {n_{{x}}}{\alpha +n-1}}$ set $X_{{n}}=x$, where $n_{{x}}$ is the number of previous observations $X_{{j}},j \lt > n$, such that $X_{{j}}=x$.

At the same time, another common model for data is that the observations $X_{{1}},X_{{2}},\dots$ are assumed to be independent and identically distributed (i.i.d.) according to some distribution $P$. The goal in introducing Dirichlet processes is to be able to describe the procedure outlined above in this i.i.d. model.

124 questions
25
votes
3 answers

Is there a Bayesian approach to density estimation

I am interested to estimate the density of a continuous random variable $X$. One way of doing this that I learnt is the use of Kernel Density Estimation. But now I am interested in a Bayesian approach that along the following lines. I initially…
23
votes
2 answers

Topic stability in topic models

I am working on a project where I want to extract some information about the content of a series of open-ended essays. In this particular project, 148 people wrote essays about a hypothetical student organization as part of a larger experiment. …
15
votes
3 answers

Multinomial-Dirichlet model with hyperprior distribution on the concentration parameters

I will try to describe the problem at hand as general as possible. I am modeling observations as a categorical distribution with a parameter probability vector theta. Then, I assume the parameter vector theta follows a Dirichlet prior distribution…
14
votes
2 answers

Dirichlet Processes for clustering: how to deal with labels?

Q: What is the standard way to cluster data using a Dirichlet Process? When using Gibbs sampling clusters appear and dissapear during the sampling. Besides, we have a identifiability problem since the posterior distribution is invariant to cluster…
13
votes
4 answers

Understanding and Implementing a Dirichlet Process model

I am trying to implement and learn a Dirichlet Process to cluster my data (or as machine learning people speak, estimate the density). I read a lot of paper in the topic and sort of got the idea. But I am still confused; here are a series of…
Areza
  • 1,058
  • 2
  • 11
  • 30
9
votes
1 answer

What does it mean to integrate over a random measure?

I'm currently looking at a paper of Dirichlet process random effects model and the model specification is as follows: $$ \begin{align*}y_{i} &= X_{i}\beta + \psi_{i} + \epsilon_{i}\\ \psi_{i} &\sim G \\ G &\sim \mathcal{DP}\left(\alpha, G_{0}\right)…
8
votes
1 answer

Why is the Dirichlet Process unsuitable for applications in Bayesian nonparametrics?

The discrete nature of the DP makes it unsuitable for general applications in Bayesian nonparametrics, but it is well suited for the problem of placing priors on mixture components in mixture modeling. This quote is from Hierarchical Dirichlet…
8
votes
2 answers

Dirichlet processes for supervised learning?

It seems when I glance around here at the fashionable learning algorithms, things like neural networks, boosted trees, support vector machines, random forests, and friends are promoted for supervised learning problems. Dirichlet processes and their…
guy
  • 7,737
  • 1
  • 26
  • 50
8
votes
3 answers

Mixture Models and Dirichlet Process Mixtures (beginner lectures or papers)

In the context of online clustering, I often find many papers talking about: "dirichlet process" and "finite/infinite mixture models". Given that I've never used or read about dirichlet process or mixture models. Do you know any suggestions of…
8
votes
1 answer

Dirichlet process mixture MCMC

I'm reading Markov Chain Sampling Methods for Dirichlet Process Mixture Models by Radford M. Neal. Equation (3.6) states that $$ \text{If } c=c_{j} \text{ for some } j\neq i: P\left(c_{i}=c\;|\;c_{-i}, y_{i}, \boldsymbol{\phi}\right) =…
7
votes
0 answers

Label Switching in WinBugs/JAGS

I am using JAGS to estimate a Dirichlet Process Mixture of Normals. The code works well and the estimated density is accurate. However, I would like to know which component each observation is assigned to and the corresponding parameters for that…
Glen
  • 6,320
  • 4
  • 37
  • 59
6
votes
3 answers

simulate dirichlet process in R

I am reading the paper of "Dirichlet Process Mixtures of Generalized Linear Models" authored by L. A. Hannah. If I would like to simulate the following model $$\mathcal{P}\sim…
6
votes
2 answers

Understanding the construction of Dirichlet process

I'm trying to understand the construction process of DP, however, with little background in measure theory, the original papers are hard to read, but I believe the ideas behind these papers can be followed. Let $y_1, y_2, \ldots, y_n$ be i.i.d.…
6
votes
1 answer

What does the base distribution of the Dirichlet Process mean?

So far I only really understand the Dirichlet Process through its various metaphors. For the Polya Urn scheme, my understanding is that the "base distribution" is the original distribution of colors in the urn before you start the process. So if…
cgreen
  • 832
  • 6
  • 18
6
votes
1 answer

How can we convert values proportional to probabilities to Bernoulli probabilities?

According to Wikipedia, the parameter in a Bernoulli distribution should be $0
1
2 3
8 9