Questions tagged [gaussian-process]

Gaussian processes refer to stochastic processes whose realization consists of normally distributed random variables, with the additional property that any finite collection of these random variables have a multivariate normal distribution. The machinery of Gaussian processes can be employed in regression and classification problems.

Overview

Gaussian processes refer to stochastic processes whose realization consists of normally distributed random variables, with the additional property that any finite collection of these random variables have a multivariate normal distribution. The machinery of Gaussian processes can be employed in regression and classification problems.

Formulation

We consider a Gaussian process as an infinite version of a multivariate Gaussian distribution and has two parameters:

  1. mean function $\mu$ corresponding to the mean vector
  2. positive definite covariance or kernel function $K$ corresponding to a positive definite covariance matrix

Consider $x_1,...x_n$ as a finite collection of points in $\mathcal{X}$. For a Gaussian process over function $f \in \mathcal{H}$ the probability density of $\mathbf{f} = \\{f(x_1),...,f(x_n)\\}^T$ is a multivariate normal:

$$\mathbf{f} \sim MVN({\bf \mu}, {\bf \Sigma})$$

with:

  • mean vector $\mathbf{\mu} = \\{\mu(x_1),...,\mu(x_n)\\}^T$
  • covariance $\mathbf{\Sigma}_{ij} = K(x_i,x_j)$
  • $\mu(x) = E[f(x)]$
  • $K(x_i,x_j) = E[(f(x_i) - \mu(x_i))(f(x_j) - \mu(x_j)]$

Software Packages

  • mlegp, GPfit and CGP for R.
  • GPy, pygp and gaussian_processes for Python

References

946 questions
40
votes
3 answers

What is the rationale of the Matérn covariance function?

The Matérn covariance function is commonly used as kernel function in Gaussian Process. It is defined like this $$ {\displaystyle C_{\nu }(d)=\sigma ^{2}{\frac {2^{1-\nu }}{\Gamma (\nu )}}{\Bigg (}{\sqrt {2\nu }}{\frac {d}{\rho }}{\Bigg )}^{\nu…
36
votes
3 answers

Why is the mean function in Gaussian Process uninteresting?

I have just started reading about GPs and analogous to the regular Gaussian distribution it is characterized by a mean function and the covariance function or the kernel. I was at a talk and the speaker said that the mean function is usually quite…
Luca
  • 4,410
  • 3
  • 30
  • 52
33
votes
3 answers

Why are Gaussian process models called non-parametric?

I am a bit confused. Why are Gaussian processes called non parametric models? They do assume that the functional values, or a subset of them, have a Gaussian prior with mean 0 and covariance function given as the kernel function. These kernel…
user34790
  • 6,049
  • 6
  • 42
  • 64
28
votes
2 answers

Is it true that Bayesian methods don't overfit?

Is it true that Bayesian methods don't overfit? (I saw some papers and tutorials making this claim) For example, if we apply a Gaussian Process to MNIST (handwritten digit classification), but only show it a single sample, will it revert to the…
25
votes
1 answer

Wavelet-domain Gaussian processes: what is the covariance?

I've been reading Maraun et al, "Nonstationary Gaussian processes in wavelet domain: Synthesis, estimation, and significant testing" (2007) which defines a class of non-stationary GPs that can be specified by multipliers in wavelet domain. A…
22
votes
1 answer

How does a random kitchen sink work?

Last year at NIPS 2017 Ali Rahimi and Ben Recht won the test of time award for their paper "Random Features for Large-Scale Kernel Machines" where they introduced random features, later codified as the random kitchen sinks algorithm. As part of…
MachineEpsilon
  • 2,686
  • 1
  • 17
  • 29
21
votes
3 answers

Splines vs Gaussian Process Regression

I'm know that Gaussian Process Regression (GPR) is an alternative to using splines for fitting flexible nonlinear models. I would like to know in which situations would one be more suitable than the other, especially in the Bayesian regression…
ved
  • 962
  • 5
  • 15
20
votes
2 answers

What is a distribution over functions?

I am reading a textbook Gaussian Process for Machine Learning by C.E. Rasmussen and C.K.I. Williams and I am having some trouble understanding what does distribution over functions mean. In the textbook, an example is given, that one should imagine…
kamilazdybal
  • 672
  • 8
  • 20
19
votes
3 answers

Fitting multivariate, natural cubic spline

note: with no correct answers after a month, I have reposted to SO Background I have a model, $f$, where $Y=f(\textbf{X})$ $\textbf{X}$ is an $n \times m$ matrix of samples from $m$ parameters and $Y$ is the $n \times 1$ vector of model…
18
votes
1 answer

Understanding Gaussian Process Regression via infinite dimensional basis function view

It is often said that gaussian process regression corresponds (GPR) to bayesian linear regression with a (possibly) infinite amount of basis functions. I am currently trying to understand this in detail to get an intuition for what kind of models I…
Julian Karch
  • 1,433
  • 1
  • 13
  • 26
17
votes
1 answer

Relationship between Cholesky decomposition and matrix inversion?

I've been reviewing Gaussian Processes and, from what I can tell, there's some debate whether the "covariance matrix" (returned by the kernel), which needs to be inverted, should be done so through matrix inversion (expensive and numerically…
17
votes
3 answers

Confusion related to difference of kriging and gaussian processes

I am having a hard time understanding what is the difference between kriging and gaussian processes. I mean wiki says they are the same but their formulas for prediction are so different. I am a bit confused why they are called similar.…
user34790
  • 6,049
  • 6
  • 42
  • 64
16
votes
1 answer

Gaussian process : function approximation properties

I'm learning about Gaussian Process and have heard only bits and pieces. Would really appreciate comments and answers. For any set of data, is it true that a Gaussian Process function approximation would give zero or negligible fitting error at the…
oalah
  • 287
  • 1
  • 6
16
votes
1 answer

Do Gaussian process (regression) have the universal approximation property?

Can any continuous function on [a, b], where a and b are real numbers, be approximated or arbitrarily close to the function (in some norm) by Gaussian Processes (Regression)?
Michael D
  • 583
  • 1
  • 3
  • 23
16
votes
2 answers

Derivative of a Gaussian Process

I believe that the derivative of a Gaussian process (GP) is a another GP, and so I would like to know if there are closed form equations for the prediction equations of the derivative of a GP? In particular, I am using the squared exponential (also…
user30490
1
2 3
63 64