Highest Voted Questions - Statistical Analysis Stack Exchange

45

votes

2 answers

Difference between naive Bayes & multinomial naive Bayes

I've dealt with Naive Bayes classifier before. I've been reading about Multinomial Naive Bayes lately. Also Posterior Probability = (Prior * Likelihood)/(Evidence). The only prime difference (while programming these classifiers) I found between…

bayesian classification text-mining naive-bayes

asked Jul 27 '12 at 14:17

garak

2,033
4
26
31

45

votes

7 answers

Why does Andrew Ng prefer to use SVD and not EIG of covariance matrix to do PCA?

I am studying PCA from Andrew Ng's Coursera course and other materials. In the Stanford NLP course cs224n's first assignment, and in the lecture video from Andrew Ng, they do singular value decomposition instead of eigenvector decomposition of…

pca linear-algebra svd eigenvalues numerics

asked Nov 16 '17 at 10:11

DongukJu

593
1
5
5

45

votes

1 answer

Computing Cohen's Kappa variance (and standard errors)

The Kappa ($\kappa$) statistic was introduced in 1960 by Cohen [1] to measure agreement between two raters. Its variance, however, had been a source of contradictions for quite a some time. My question is about which is the best variance…

estimation variance reliability cohens-kappa

asked Jun 17 '12 at 00:37

Cesar

984
1
9
21

45

votes

5 answers

What is the difference between a population and a sample?

What is the difference between a population and a sample? What common variables and statistics are used for each one, and how do those relate to each other?

standard-deviation variance sample population

asked Jul 20 '10 at 11:07

Baltimark

2,028
3
19
20

45

votes

3 answers

What kind of information is Fisher information?

Suppose we have a random variable $X \sim f(x|\theta)$. If $\theta_0$ were the true parameter, the the likelihood function should be maximized and the derivative equal to zero. This is the basic principle behind the maximum likelihood estimator. As…

bayesian maximum-likelihood likelihood intuition fisher-information

asked Feb 14 '16 at 21:42

Stan Shunpike

3,623
2
27
36

45

votes

3 answers

What does the term saturating nonlinearities mean?

I was reading the paper ImageNet Classification with Deep Convolutional Neural Networks and in section 3 were they explain the architecture of their Convolutional Neural Network they explain how they preferred using: non-saturating nonlinearity…

machine-learning neural-networks terminology conv-neural-network

asked Sep 26 '15 at 19:45

Charlie Parker

5,836
11
57
113

45

votes

10 answers

What exactly is Big Data?

I have been asked on several occasions the question: What is Big-Data? Both by students and my relatives that are picking up the buzz around statistics and ML. I found this CV-post. And I feel that I agree with the only answer there. The…

large-data

asked Sep 18 '15 at 12:10

Gumeo

3,551
1
21
31

45

votes

1 answer

How to interpret and report eta squared / partial eta squared in statistically significant and non-significant analyses?

I have data that has eta squared values and partial eta squared values calculated as a measure of effect size for group mean differences. What is the difference between eta squared and partial eta squared? Can they both be interpreted using the…

anova statistical-significance effect-size

asked Sep 23 '11 at 07:22

Short Elizabeth

731
2
10
12

45

votes

4 answers

How to calculate a confidence level for a Poisson distribution?

Would like to know how confident I can be in my $\lambda$. Anyone know of a way to set upper and lower confidence levels for a Poisson distribution? Observations ($n$) = 88 Sample mean ($\lambda$) = 47.18182 what would the 95% confidence look…

poisson-distribution confidence-interval

asked Sep 09 '11 at 12:25

Travis

721
1
8
15

45

votes

4 answers

Compendium of cross-validation techniques

I'm wondering if anybody knows of a compendium of cross-validation techniques with a discussion of the differences between them and a guide on when to use each of them. Wikipedia has a list of the most common techniques, but I'm curious if there are…

cross-validation

asked Aug 18 '11 at 15:53

Amelio Vazquez-Reina

17,546
26
74
110

45

votes

4 answers

Why does inversion of a covariance matrix yield partial correlations between random variables?

I heard that partial correlations between random variables can be found by inverting the covariance matrix and taking appropriate cells from such resulting precision matrix (this fact is mentioned in http://en.wikipedia.org/wiki/Partial_correlation,…

covariance covariance-matrix linear-algebra partial-correlation matrix-inverse

asked Mar 03 '15 at 06:48

michal

1,138
3
11
14

45

votes

3 answers

How to interpret F- and p-value in ANOVA?

I am new to statistics and I currently deal with ANOVA. I carry out an ANOVA test in R using aov(dependendVar ~ IndependendVar) I get – among others – an F-value and a p-value. My null hypothesis ($H_0$) is that all group means are equal. There…

r anova interpretation

asked Jun 27 '11 at 14:34

JanD

561
1
5
6

45

votes

6 answers

How to determine the optimal threshold for a classifier and generate ROC curve?

Let say we have a SVM classifier, how do we generate ROC curve? (Like theoretically) (because we are generate TPR and FPR with each of the threshold). And how do we determine the optimal threshold for this SVM classifier?

machine-learning svm roc

asked Nov 07 '14 at 19:20

RockTheStar

11,277
31
63
89

45

votes

8 answers

How can I test if given samples are taken from a Poisson distribution?

I know of normality tests, but how do I test for "Poisson-ness"? I have sample of ~1000 non-negative integers, which I suspect are taken from a Poisson distribution, and I would like to test that.

hypothesis-testing distributions poisson-distribution goodness-of-fit

asked Aug 03 '10 at 13:54

David B

1,161
2
12
15

45

votes

6 answers

What is your favorite statistical graph?

This is a favorite of mine This example is in a humorous vein (credit goes to a former professor of mine, Steven Gortmaker), but I am also interested in graphs that you feel beautifully capture and communicate a statistical insight or method, along…

data-visualization

asked Jul 23 '14 at 18:52

Alexis

26,219
5
78
131

Most Popular