Most Popular

1500 questions
32
votes
4 answers

The difference of kernels in SVM?

Can someone please tell me the difference between the kernels in SVM: Linear Polynomial Gaussian (RBF) Sigmoid Because as we know that kernel is used to mapped our input space into high dimensionality feature space. And in that feature…
user3378327
  • 951
  • 2
  • 8
  • 11
32
votes
4 answers

Intuition behind standard deviation

I'm trying to gain a better intuitive understanding of standard deviation. From what I understand it is representative of the average of the differences of a set of observations in a data set from the mean of that data set. However it is NOT…
sonicboom
  • 780
  • 1
  • 6
  • 13
32
votes
3 answers

Can ANOVA be significant when none of the pairwise t-tests is?

Is it possible for one-way (with $N>2$ groups, or "levels") ANOVA to report a significant difference when none of the $N(N-1)/2$ pairwise t-tests does? In this answer @whuber wrote: It is well known that a global ANOVA F test can detect a…
amoeba
  • 93,463
  • 28
  • 275
  • 317
32
votes
3 answers

How to judge if a supervised machine learning model is overfitting or not?

Can anyone tell me how to judge if a supervised machine learning model is overfitting or not? If I don't have an external validation dataset, I want to know if I can use ROC of 10 fold cross validation to explain overfitting. If I have an external…
user22062
  • 1,319
  • 3
  • 15
  • 21
32
votes
3 answers

Visualizing a million, PCA edition

Is it possible to visualize the output of Principal Component Analysis in ways that give more insight than just summary tables? Is it possible to do it when the number of observations is large, say ~1e4? And is it possible to do it in R [other…
gappy
  • 5,390
  • 3
  • 28
  • 50
32
votes
2 answers

Three versions of discriminant analysis: differences and how to use them

Can anybody explain differences and give specific examples how to use these three analyses? LDA - Linear Discriminant Analysis FDA - Fisher's Discriminant Analysis QDA - Quadratic Discriminant Analysis I searched everywhere, but couldn't find real…
Andrius
  • 457
  • 1
  • 9
  • 11
32
votes
3 answers

Accommodating entrenched views of p-values

Sometimes in reports I include a disclaimer about the p-values and other inferential statistics I've provided. I say that since the sample wasn't random, then such statistics would not strictly apply. My specific wording is usually given in a…
rolando2
  • 11,645
  • 1
  • 39
  • 60
32
votes
1 answer

Bound for Arithmetic Harmonic mean inequality for matrices?

NOTE: This question has originally been posted in MSE, but it did not generate any interest. It was first posted there, because the question itself is a pure matrix-algebra question. Nevertheless, since the motive has to do with statistics and…
32
votes
12 answers

What is the hardest statistical concept to grasp?

This is a similar question to the one here, but different enough I think to be worthwhile asking. I thought I'd put as a starter, what I think one of the hardest to grasp is. Mine is the difference between probability and frequency. One is at the…
probabilityislogic
  • 22,555
  • 4
  • 76
  • 97
32
votes
2 answers

Likelihood ratio test in R

Suppose I am going to do a univariate logistic regression on several independent variables, like this: mod.a <- glm(x ~ a, data=z, family=binominal("logistic")) mod.b <- glm(x ~ b, data=z, family=binominal("logistic")) I did a model comparison…
lokheart
  • 2,999
  • 7
  • 37
  • 47
32
votes
1 answer

Difference between standard and spherical k-means algorithms

I would like to understand, what is the major implementation difference between standard and spherical k-means clustering algorithms. In each step, k-means computes distances between element vectors and cluster centroids, and reassigns document to…
user1315305
  • 1,199
  • 4
  • 14
  • 15
32
votes
4 answers

Why are lower p-values not more evidence against the null? Arguments from Johansson 2011

Johansson (2011) in "Hail the impossible: p-values, evidence, and likelihood" (here is also link to the journal) states that lower $p$-values are often considered as stronger evidence against the null. Johansson implies that people would consider…
luciano
  • 12,197
  • 30
  • 87
  • 119
32
votes
3 answers

Visualizing the intersections of many sets

Is there a visualization model that is good for showing the intersection overlap of many sets? I am thinking something like Venn diagrams but that somehow might lend itself better to a larger number of sets such as 10 or more. Wikipedia does show…
Kyle Brandt
  • 737
  • 1
  • 6
  • 17
32
votes
2 answers

libsvm data format

I'm using the libsvm (http://www.csie.ntu.edu.tw/~cjlin/libsvm/) tool for support vector classification. However, I'm confused about the format of the input data. From the README: The format of training and testing data file is:
Siddhant Saraf
  • 421
  • 1
  • 4
  • 6
32
votes
4 answers

How to perform dimensionality reduction with PCA in R

I have a big dataset and I want to perform a dimensionality reduction. Now everywhere I read that I can use PCA for this. However, I still don't seem to get what to do after calculating/performing the PCA. In R this is easily done with the command…
Floris Devriendt
  • 493
  • 1
  • 6
  • 9