Questions tagged [monte-carlo]

Using (pseudo-)random numbers and the Law of Large Numbers to simulate the random behavior of a real system.

Monte Carlo methods are used to mimic the random behavior of real random phenomena by drawing sequences of random or pseudo-random numbers, usually via a computer code. These numbers are incorporated as inputs to the model describing the phenomena of interest, as proxy to a true random sample. They require a (pseudo-)random uniform generator; the most common implementations provide a number from 0 to 1 that mimics the random draw of a $U(0,1)$ variate, let us call this (pseudo-)random variate $U$.

For instance, to simulate a fair coin toss, draw a random number $U$ and associate values in $[0,0.5)$ with heads, and values in $[0.5,1)$, with tails. More complex models require a more involved use of the random uniform generators, as well as smart ways of transforming the basic $U[0,1]$ variates into random variables with the targeted distribution, e.g., using acceptance-rejection methods or Metropolis algorithm. In general, a random generator will be a function $\Psi$ of a random number of uniform variates, $X=\Psi(U_1,U_2,\ldots)$.

In Monte Carlo studies where time and budget are an issue, an efficient use of variance reduction techniques is often warranted. These include antithetic samples, importance sampling, control variates, etc.

Besides the direct use of Monte Carlo methods to model random phenomena, they are also used in mathematical physics and statistics to evaluate multidimensional integrals. Such integrals can be represented as expectations of an integrand against a wide range of (importance) distributions. A point is drawn randomly from such a distribution (often with support the support of the integrand); the value of the integrand is added to the running sum; the process is repeated until the desired accuracy is achieved. The approximation is converging by virtue of the Law of Large Numbers. The advantage of the method is that the error decreases at the rate $O( N^{-1/2})$, independent of the dimension of the space. Low discrepancy, or quasi-Monte Carlo, sequences are arguably better suited for this purpose.

References:

Lemieux, C. (2009) Monte Carlo and Quasi-Monte Carlo Sampling. Springer. Amazon link.

Liu, J. S. (2001) Monte Carlo Strategies in Scientific Computing. Springer. Amazon link.

Robert, C.P. and Casella, G. (2004) Monte Carlo Statistical Methods. Springer. Amazon link

See also: tags sampling, simulation.

1171 questions
52
votes
6 answers

Rule of thumb for number of bootstrap samples

I wonder if someone knows any general rules of thumb regarding the number of bootstrap samples one should use, based on characteristics of the data (number of observations, etc.) and/or the variables included?
hoyem
  • 871
  • 1
  • 7
  • 10
46
votes
1 answer

What is the difference between Metropolis-Hastings, Gibbs, Importance, and Rejection sampling?

I have been trying to learn MCMC methods and have come across Metropolis-Hastings, Gibbs, Importance, and Rejection sampling. While some of these differences are obvious, i.e., how Gibbs is a special case of Metropolis-Hastings when we have the full…
42
votes
8 answers

Approximate $e$ using Monte Carlo Simulation

I've been looking at Monte Carlo simulation recently, and have been using it to approximate constants such as $\pi$ (circle inside a rectangle, proportionate area). However, I'm unable to think of a corresponding method of approximating the value of…
41
votes
5 answers

K-fold vs. Monte Carlo cross-validation

I am trying to learn various cross validation methods, primarily with intention to apply to supervised multivariate analysis techniques. Two I have come across are K-fold and Monte Carlo cross-validation techniques. I have read that K-fold is a…
Liam
  • 553
  • 1
  • 5
  • 6
41
votes
1 answer

How to determine significant principal components using bootstrapping or Monte Carlo approach?

I am interested in determining the number of significant patterns coming out of a Principal Component Analysis (PCA) or Empirical Orthogonal Function (EOF) Analysis. I am particularly interested in applying this method to climate data. The data…
Marc in the box
  • 3,532
  • 3
  • 33
  • 47
39
votes
2 answers

Can somebody explain to me NUTS in english?

My understanding of the algorithm is the following: No U-Turn Sampler (NUTS) is a Hamiltonian Monte Carlo Method. This means that it is not a Markov Chain method and thus, this algorithm avoids the random walk part, which is often deemed as…
user3007270
  • 521
  • 1
  • 4
  • 6
37
votes
7 answers

Are all simulation methods some form of Monte Carlo?

Is there a simulation method that is not Monte Carlo? All simulation methods involve substituting random numbers into the function to find a range of values for the function. So are all simulation methods in essence Monte Carlo methods?
Victor
  • 5,925
  • 13
  • 43
  • 67
31
votes
5 answers

Generating random numbers manually

How can I manually generate a random number from a given distribution, as for instance, 10 realisations from the standard normal distribution?
29
votes
5 answers

What are examples of statistical experiments that allow the calculation of the golden ratio?

There are some very simple experiences that can be done by a kid at home, whose result allows one to statistically approach famous numbers such as $\pi$ or $e$. An example where $\pi$ shows up is perhaps the most famous one of its kind. In Buffon's…
27
votes
5 answers

Why is the term "Monte Carlo simulation" used instead of "Random simulation"?

I always read/hear "Monte Carlo" simulations. I have done "Monte Carlo" simulations before to calculate the odds in certain gambling games as part of my job and it was nothing more than basically using an RNG to simulate random results (slot…
SpaceMonkey
  • 383
  • 3
  • 6
27
votes
5 answers

Why use Monte Carlo method instead of a simple grid?

when integrating a function or in complex simulations, I have seen the Monte Carlo method is widely used. I'm asking myself why one doesn't generate a grid of points to integrate a function instead of drawing random points. Wouldn't that bring more…
Alexander Engelhardt
  • 4,161
  • 3
  • 21
  • 25
25
votes
2 answers

What is importance sampling?

I'm trying to learn reinforcement learning and this topic is really confusing to me. I have taken an introduction to statistics, but I just couldn't understand this topic intuitively.
24
votes
3 answers

Would a Random Forest with multiple outputs be possible/practical?

Random Forests (RFs) is a competitive data modeling/mining method. An RF model has one output -- the output/prediction variable. The naive approach to modeling multiple outputs with RFs would be to construct an RF for each output variable. So we…
redcalx
  • 788
  • 2
  • 7
  • 14
22
votes
4 answers

Can Machine Learning or Deep Learning algorithms be utilised to "improve" the sampling process of a MCMC technique?

Based on the little knowledge that I have on MCMC (Markov chain Monte Carlo) methods, I understand that sampling is a crucial part of the aforementioned technique. The most commonly used sampling methods are Hamiltonian and Metropolis. Is there a…
21
votes
2 answers

What are some techniques for sampling two correlated random variables?

What are some techniques for sampling two correlated random variables: if their probability distributions are parameterized (e.g., log-normal) if they have non-parametric distributions. The data are two time series for which we can compute…
Pete
  • 587
  • 4
  • 13
1
2 3
78 79