Highest Voted Questions - Statistical Analysis Stack Exchange

34

votes

1 answer

Is there Factor analysis or PCA for ordinal or binary data?

I have completed the principal component analysis (PCA), exploratory factor analysis (EFA), and confirmatory factor analysis (CFA), treating data with likert scale (5-level responses: none, a little, some,..) as a continuous variable. Then, using…

pca factor-analysis ordinal-data binary-data likert

asked May 30 '16 at 15:41

user116948

383
1
4
6

34

votes

3 answers

When is it appropriate to use an improper scoring rule?

Merkle & Steyvers (2013) write: To formally define a proper scoring rule, let $f$ be a probabilistic forecast of a Bernoulli trial $d$ with true success probability $p$. Proper scoring rules are metrics whose expected values are minimized if…

classification forecasting scoring-rules

asked Apr 21 '16 at 06:14

user1205901 - Reinstate Monica

11,303
26
77
152

34

votes

7 answers

What is normality?

In many different statistical methods there is an "assumption of normality". What is "normality" and how do I know if there is normality?

distributions normality-assumption

asked Jul 19 '10 at 19:12

A Lion

1,081
2
12
12

34

votes

3 answers

How to decide which glm family to use?

I have fish density data that I am trying to compare between several different collection techniques, the data has lots of zeros, and the histogram looks vaugley appropriate for a poisson distribution except that, as densities, it is not integer…

regression distributions generalized-linear-model link-function

asked Jan 14 '16 at 23:57

C. Denney

575
1
5
9

34

votes

3 answers

Is p-value a point estimate?

Since one can calculate confidence intervals for p-values and since the opposite of interval estimation is point estimation: Is p-value a point estimate?

confidence-interval estimation p-value estimators point-estimation

asked Nov 13 '15 at 12:56

00schneider

1,202
1
14
26

34

votes

3 answers

(Why) Has Kohonen-style SOM fallen out of favor?

As far as I can tell, Kohonen-style SOMs had a peak back around 2005 and haven't seen as much favor recently. I haven't found any paper that says that SOMs have been subsumed by another method, or proven equivalent to something else (at higher…

clustering self-organizing-maps

asked Oct 19 '15 at 02:36

Wayne

19,981
4
50
99

34

votes

7 answers

Why is it bad to teach students that p-values are the probability that findings are due to chance?

Can someone please offer a nice succinct explanation why it is not a good idea to teach students that a p-value is the prob(their findings are due to [random] chance). My understanding is that a p-value is the prob(getting more extreme data | null…

p-value randomness teaching

asked Oct 13 '11 at 02:55

Patrick

1,381
1
15
21

34

votes

2 answers

Degrees of freedom of $\chi^2$ in Hosmer-Lemeshow test

The test statistic for the Hosmer-Lemeshow test (HLT) for goodness of fit (GOF) of a logistic regression model is defined as follows: The sample is then split into $d=10$ deciles, $D_1, D_2, \dots , D_{d}$, per decile one computes the following…

regression logistic goodness-of-fit degrees-of-freedom hosmer-lemeshow-test

asked Aug 17 '15 at 14:00

user83346

34

votes

3 answers

How to find confidence intervals for ratings?

Evan Miller's "How Not to Sort by Average Rating" proposes using the lower bound of a confidence interval to get a sensible aggregate "score" for rated items. However, it's working with a Bernoulli model: ratings are either thumbs up or thumbs…

confidence-interval estimation

asked Sep 23 '11 at 16:41

Peter Taylor

393
3
11

34

votes

6 answers

Interpretation of Shapiro-Wilk test

I'm pretty new to statistics and I need your help. I have a small sample, as follows: H4U 0.269 0.357 0.2 0.221 0.275 0.277 0.253 0.127 0.246 I ran the Shapiro-Wilk test using R: shapiro.test(precisionH4U$H4U) and I got the…

r distributions interpretation goodness-of-fit normality-assumption

asked Sep 17 '11 at 12:18

Jakub

707
3
7
8

34

votes

3 answers

Coordinate vs. gradient descent

I was wondering what the different use cases are for the two algorithms, Coordinate Descent and Gradient Descent. I know that coordinate descent has problems with non-smooth functions but it is used in popular algorithms like SVM and LASSO. Gradient…

optimization gradient-descent

asked Apr 14 '15 at 14:38

Bar

2,492
3
19
31

34

votes

3 answers

What are the benefits of using ReLU over softplus as activation functions?

It is often mentioned that rectified linear units (ReLU) have superseded softplus units because they are linear and faster to compute. Does softplus it still have the advantage of inducing sparsity or is that restricted to the ReLU? The reason I ask…

machine-learning neural-networks

asked Apr 13 '15 at 04:21

brockl33

441
4
5

34

votes

7 answers

Inference vs. estimation?

What are the differences between "inference" and "estimation" under the context of machine learning? As a newbie, I feel that we infer random variables and estimate the model parameters. Is my this understanding right? If not, what are the…

machine-learning inference terminology

asked Jan 01 '15 at 08:12

Sibbs Gambling

2,208
5
20
42

34

votes

4 answers

What is the weak side of decision trees?

Decision trees seems to be a very understandable machine learning method. Once created it can be easily inspected by a human which is a great advantage in some applications. What are the practical weak sides of Decision Trees?

machine-learning nonparametric cart

asked Aug 05 '10 at 10:42

Łukasz Lew

1,312
2
14
24

34

votes

3 answers

MSE decomposition to Variance and Bias Squared

In showing that MSE can be decomposed into variance plus the square of Bias, the proof in Wikipedia has a step, highlighted in the picture. How does this work? How is the expectation pushed in to the product from the 3rd step to the 4th step? If the…

random-variable expected-value mse

asked Nov 09 '14 at 19:28

statBeginner

1,251
2
17
22

Most Popular