Highest Voted Questions - Statistical Analysis Stack Exchange

37

votes

3 answers

Why is a likelihood-ratio test distributed chi-squared?

Why is the test statistic of a likelihood ratio test distributed chi-squared? $2(\ln \text{ L}_{\rm alt\ model} - \ln \text{ L}_{\rm null\ model} ) \sim \chi^{2}_{df_{\rm alt}-df_{\rm null}}$

distributions chi-squared-test likelihood-ratio

asked Mar 20 '13 at 15:49

Dr. Beeblebrox

1,120
1
11
16

37

votes

6 answers

Is there a name for the opposite of the gambler's fallacy?

The gambler's fallacy is a fallacy because of the assumed probability and the independence of the events. However, if, after flipping a coin 100 times and obtaining heads each time, I still believe the probability of obtaining tails to be 0.5, am I…

bayesian terminology fallacy

asked Apr 09 '21 at 10:00

Igor F.

6,004
1
16
41

37

votes

3 answers

Explanation of finite population correction factor?

I understand that when sampling from a finite population and our sample size is more than 5% of the population, we need to make a correction on the sample's mean and standard error using this formula: $\hspace{10mm} FPC=\sqrt{\frac{N-n}{N-1}}$ Where…

sampling finite-population

asked Dec 05 '10 at 09:40

Sara

1,347
4
13
16

37

votes

4 answers

Intuitive explanation of Kolmogorov Smirnov Test

What is the cleanest, easiest way to explain someone the concept of Kolmogorov Smirnov Test? What does it intuitively mean? It's a concept that I have difficulty in articulating - especially when explaining to someone. Can someone please explain it…

distributions intuition cumulative-distribution-function kolmogorov-smirnov-test empirical-cumulative-distr-fn

asked Jun 12 '20 at 07:59

Pluviophile

2,381
8
18
45

37

votes

5 answers

What is a good use of the 'comment' function in R?

I just discovered the comment function in R. Example: x <- matrix(1:12, 3,4) comment(x) <- c("This is my very important data from experiment #0234", "Jun 5, 1998") x comment(x) This is the first time I came by this function and was…

r

asked Nov 09 '10 at 08:55

Tal Galili

19,935
32
133
195

37

votes

2 answers

Dealing with singular fit in mixed models

Let's say we have a model mod <- Y ~ X*Condition + (X*Condition|subject) # Y = logit variable # X = continuous variable # Condition = values A and B, dummy coded; the design is repeated # so all participants go through both…

r lme4-nlme glmm overfitting singular

asked Nov 27 '18 at 00:15

User33268

1,408
2
10
21

37

votes

7 answers

What is the minimum recommended number of groups for a random effects factor?

I'm using a mixed model in R (lme4) to analyze some repeated measures data. I have a response variable (fiber content of feces) and 3 fixed effects (body mass, etc.). My study only has 6 participants, with 16 repeated measures for each one (though…

mixed-model sample-size

asked Sep 20 '12 at 01:56

Chris

799
1
7
15

37

votes

3 answers

Difference between generalized linear models & generalized linear mixed models

I am wondering what the differences are between mixed and unmixed GLMs. For instance, in SPSS the drop down menu allows users to fit either: analyze-> generalized linear models-> generalized linear models & analyze-> mixed models-> generalized…

mixed-model generalized-linear-model glmm generalized-estimating-equations

asked Jul 16 '12 at 23:47

user9203

629
1
8
13

37

votes

5 answers

Timing functions in R

I would like to measure the time that it takes to repeat the running of a function. Are replicate() and using for-loops equivalent? For example: system.time(replicate(1000, f())); system.time(for(i in 1:1000){f()}); Which is the prefered…

r

asked Oct 01 '10 at 11:46

Tim

1
29
102
189

37

votes

3 answers

Linearity of PCA

PCA is considered a linear procedure, however: $$\mathrm{PCA}(X)\neq \mathrm{PCA}(X_1)+\mathrm{PCA}(X_2)+\ldots+\mathrm{PCA}(X_n),$$ where $X=X_1+X_2+\ldots+X_n$. This is to say that the eigenvectors obtained by the PCAs on the data matrices $X_i$…

pca linear

asked Jul 10 '17 at 12:14

AlphaOmega

667
7
13

37

votes

10 answers

What are the most useful sources of economics data?

When doing research in Economy, one frequently needs to verify theoretical conclusions on real data. What are reliable data sources to use and cite? I am mainly interested in sources that provide various statistical data such as GDP, population,…

references

asked Oct 15 '11 at 15:12

Karel Petranek

341
1
3
3

37

votes

3 answers

How to estimate shrinkage parameter in Lasso or ridge regression with >50K variables?

I want to use Lasso or ridge regression for a model with more than 50,000 variables. I want do so using software package in R. How can I estimate the shrinkage parameter ($\lambda$)? Edits: Here is the point I got up to: set.seed (123) Y <- runif…

r lasso ridge-regression high-dimensional

asked Apr 16 '12 at 12:02

John

2,088
6
27
37

37

votes

4 answers

How does one measure the non-uniformity of a distribution?

I'm trying to come up with a metric for measuring non-uniformity of a distribution for an experiment I'm running. I have a random variable that should be uniformly distributed in most cases, and I'd like to be able to identify (and possibly measure…

distributions variance random-variable uniform-distribution

asked Apr 04 '12 at 09:00

JJC

473
1
4
7

37

votes

4 answers

Why does logistic regression become unstable when classes are well-separated?

Why is it that logistic regression becomes unstable when classes are well-separated? What does well-separated classes mean? I would really appreciate if someone can explain with an example.

r regression logistic separation

asked Jan 02 '17 at 08:44

Jane Dow

471
1
4
3

37

votes

2 answers

Quantile regression: Loss function

I am trying to understand the quantile regression, but one thing that makes me suffer is the choice of the loss function. $\rho_\tau(u) = u(\tau-1_{\{u<0\}})$ I know that the minimum of the expectation of $\rho_\tau(y-u)$ is equal to the…

quantiles loss-functions quantile-regression

asked Dec 14 '16 at 17:14

CDO

473
1
4
6

Most Popular