Highest Voted 'stochastic-approximation' Questions - Statistical Analysis Stack Exchange

10

votes

2 answers

Is Markov chain based sampling the "best" for Monte Carlo sampling? Are there alternative schemes available?

Markov Chain Monte Carlo is a method based on Markov chains that allows us to obtain samples (in a Monte Carlo setting) from non-standard distributions from which we cannot draw samples directly. My question is why Markov chain is…

asked Jan 06 '15 at 19:31

Ikram Ullah

443
1
4
10

6

votes

3 answers

Stochastic Differential Equations - A Few General Questions

I just have a few questions about stochastic differential equations. I generally did a lot of pure math but signed up for a course on probability models and stochastic differential equations because I wanted to try something different. I have really…

stochastic-processes differential-equations stochastic-approximation stochastic-calculus

asked Apr 04 '15 at 02:44

Islands

163
4

5

votes

1 answer

Confusion about Robbins-Monro algorithm in Bishop PRML

This is basically how Robbins-Monro is presented in chapter 2.3 of Bishop's PRML book (from his slides): In the general update equation, $$ \theta^{(N)} = \theta^{(N-1)} - \alpha_{N-1}z(θ^{(N-1)}) $$ $z(θ^{(N)})$ is an observed value of $z$ when…

machine-learning maximum-likelihood gradient-descent optimization stochastic-approximation

asked Jan 12 '17 at 06:50

Yibo Yang

622
6
11

4

votes

0 answers

Difference between Stochastic Approximation (SA) and Stochastic Gradient Descent (SGD)

I understand the intended use cases for both stochastic approximation algorithms like SPSA or FDSA, and for SGD algorithms like Adam. SPSA is intended for noisy objective functions, and Adam for randomized mini batches. So for me it looks like the…

optimization application stochastic-gradient-descent stochastic-approximation adam

asked Jan 17 '18 at 08:46

flxh

217
1
7

4

votes

1 answer

Stochastic gradient descent: why randomise training set

I'm given a dataset of 200 million training examples. The stochastic gradient descent method requires me to sample these randomly, to avoid it gets 'stuck'. First and for all, I don't see how it gets stuck. So the fact the sample needs to be…

regression logistic sampling gradient-descent stochastic-approximation

asked Oct 25 '16 at 11:17

xrdty

215
3
8

4

votes

1 answer

100 m sprint world record times - lower bound

Can we find the lowest attainable bound for the 100 m sprint times, i.e. the quickest it can be run ever, using the past data? So every now and again the record gets broken and we can map the new record time. But surely there must be a particular…

stochastic-approximation

asked Oct 06 '14 at 13:10

Matthew Smith

41
1

4

votes

0 answers

Rootfinding in the presence of one-sided noise

I'm faced with a practical problem of solving for a 1-D function which has noise, so find myself in the territory of stochastic approximation (and I am well out of my comfort zone here!). I know there is some literature on the problem, but I've yet…

stochastic-approximation

asked Aug 31 '14 at 16:50

J.J. Green

141
3

3

votes

1 answer

Root-finding via Robbins-Monro method: A real and simple example

I am looking for a real and simple example for the Robbins-Monro (RM) method, but most of the googled results are theoretical and abstract. To understand the RM, I used a simple function \begin{equation*} f( x) =\frac{1}{1+exp( -0.5x)}…

r optimization algorithms stochastic-approximation

asked Mar 19 '21 at 16:46

Keith Lau

63
6

3

votes

1 answer

How do I compare the the sampling distribution of the minimum of a distribution by sample sizes

I saw this question (link) but when I read it, I see that it has a fixed "N" so I thought it was asking about for a finite sample size. When I read the answer that it was suggested to be a duplicate of (link) the answer is analytic, and in terms of…

distributions random-variable extreme-value stochastic-approximation

asked Mar 27 '16 at 02:38

EngrStudent

8,232
2
29
82

3

votes

2 answers

Frequency distribution of Chinese Restaurant Process?

Set-up I was simulating the Generalized Chinese Restaurant Process as shown on the wikipedia page [link] with a discount, $\alpha$, and concentration parameter $\theta$ For $n=5$ total customers being seated with $\alpha=.3$ and $\theta=1.5$ and…

stochastic-processes combinatorics dirichlet-process stochastic-approximation

asked Mar 14 '15 at 08:24

sheppa28

1,287
8
15

2

votes

1 answer

Variational inference with discrete variational parameters

Typically Variational Inference relies on taking gradient steps on KL divergence between the variational and true posterior, or on the ELBO. This does not seem valid when variational parameters are discrete (since gradients wrt those arguments are…

bayesian approximate-inference stochastic-approximation variational

asked Sep 27 '19 at 12:36

Dionysis M

794
6
17

2

votes

1 answer

Is it possible to combine SPSA and Adam?

In SGD algorithms such as Adam you generally make a bad estimate of the gradient of the loss function and take that gradient to move the parameters in the desired direction. Gradient free methods such as SPSA do basically the same, they make a bad…

machine-learning optimization stochastic-gradient-descent stochastic-approximation

asked Oct 24 '18 at 09:58

flxh

217
1
7

2

votes

0 answers

Log Euler simulation scheme for Cox–Ingersoll–Ross model

https://en.wikipedia.org/wiki/Cox%E2%80%93Ingersoll%E2%80%93Ross_model In this article the Cox–Ingersoll–Ross is given. I want to design a simulation scheme for this process. A continuous SDE can be discretized is also given in the article. However…

simulation stochastic-processes monte-carlo stochastic-approximation

asked Apr 15 '18 at 23:13

financegrad

97
4

2

votes

0 answers

SGD shows the same convergence behaviour as batch gradient descent when using adaptive learning rate?

SGD shows the same convergence behaviour as batch gradient descent when using adaptive learning rate ? I dont understand why he claimed that. I couldnt find any reference about it in any paper. However, it has been shown that when we slowly…

neural-networks gradient-descent stochastic-gradient-descent stochastic-approximation

asked Jun 13 '17 at 02:41

KenobiBastila

351
1
6
17

2

votes

1 answer

Simulation of Secretary problem: optimal pool size given k=2?

Question: Is it incorrect to think there is a "sweet spot" where more samples slightly decreases the likelihood of a "Best pick" in the Secretary Problem? Details: The "Secretary Problem" from "optimal stopping" is a classic in decision theory. …

optimal-stopping stochastic-approximation game-theory secretary-problem

asked Mar 23 '17 at 22:13

EngrStudent

8,232
2
29
82

Questions tagged [stochastic-approximation]