Questions tagged [nonparametric]

Use this tag to ask about the nature of nonparametric or parametric methods, or the difference between the two. Nonparametric methods generally rely on few assumptions about the underlying distributions, whereas parametric methods make assumptions that allow data to be described by a small number of parameters.

Most statistical procedures derive their justification from a probability model of the observations to which they are applied. Such a model posits that the data appear to be related in a specific way to draws from some probability distribution that is an unknown member of some family of distributions. The family of distributions for a parametric procedure can be described in a natural way by a finite set of real numbers, the "parameters." Examples include the family of Binomial distributions (which can be parameterized by the chance of a "success") and the family of Normal distributions (usually parameterized by an expectation $\mu$ and variance $\sigma^2$). When such a description is not possible, the procedure is termed "nonparametric." Wikipedia provides a list of some non-parametric procedures.

1900 questions
124
votes
7 answers

How to choose between t-test or non-parametric test e.g. Wilcoxon in small samples

Certain hypotheses can be tested using Student's t-test (maybe using Welch's correction for unequal variances in the two-sample case), or by a non-parametric test like the Wilcoxon paired signed rank test, the Wilcoxon-Mann-Whitney U test, or the…
91
votes
5 answers

Kendall Tau or Spearman's rho?

In which cases should one prefer the one over the other? I found someone who claims an advantage for Kendall, for pedagogical reasons, are there other reasons?
Tal Galili
  • 19,935
  • 32
  • 133
  • 195
85
votes
14 answers

Why haven't robust (and resistant) statistics replaced classical techniques?

When solving business problems using data, it's common that at least one key assumption that under-pins classical statistics is invalid. Most of the time, no one bothers to check those assumptions so you never actually know. For instance, that so…
doug
  • 9,901
  • 1
  • 22
  • 26
73
votes
15 answers

Why would parametric statistics ever be preferred over nonparametric?

Can someone explain to me why would anyone choose a parametric over a nonparametric statistical method for hypothesis testing or regression analysis? In my mind, it's like going for rafting and choosing a non-water resistant watch, because you may…
61
votes
6 answers

Which permutation test implementation in R to use instead of t-tests (paired and non-paired)?

I have data from an experiment that I analyzed using t-tests. The dependent variable is interval scaled and the data are either unpaired (i.e., 2 groups) or paired (i.e., within-subjects). E.g. (within subjects): x1 <- c(99, 99.5, 65, 100, 99,…
Henrik
  • 13,314
  • 9
  • 63
  • 123
53
votes
9 answers

Are all models useless? Is any exact model possible -- or useful?

This question has been festering in my mind for over a month. The February 2015 issue of Amstat News contains an article by Berkeley Professor Mark van der Laan that scolds people for using inexact models. He states that by using models, statistics…
48
votes
3 answers

Bootstrap vs. permutation hypothesis testing

There are several popular resampling techniques, which are often used in practice, such as bootstrapping, permutation test, jackknife, etc. There are numerous articles & books discuss these techniques, for example Philip I Good (2010) Permutation,…
Tu.2
  • 2,627
  • 6
  • 26
  • 26
41
votes
1 answer

Intuition behind tensor product interactions in GAMs (MGCV package in R)

Generalized additive models are those where $$ y = \alpha + f_1(x_1) + f_2(x_2) + e_i $$ for example. the functions are smooth, and to be estimated. Usually by penalized splines. MGCV is a package in R that does so, and the author (Simon Wood)…
generic_user
  • 11,981
  • 8
  • 40
  • 63
41
votes
2 answers

Difference between LOESS and LOWESS

What is the difference between LOESS (locally estimated scatterplot smoothing) and LOWESS (locally weighted scatterplot smoothing)? From Wikipedia I can only see that LOESS is a generalization of LOWESS. Do they have slightly different parameters?
pir
  • 4,626
  • 10
  • 38
  • 73
36
votes
2 answers

Is there a reliable nonparametric confidence interval for the mean of a skewed distribution?

Very skewed distributions such as the log-normal do not result in accurate bootstrap confidence intervals. Here is an example showing that the left and right tail areas are far from the ideal 0.025 no matter which bootstrap method you try in…
Frank Harrell
  • 74,029
  • 5
  • 148
  • 322
34
votes
4 answers

What is the weak side of decision trees?

Decision trees seems to be a very understandable machine learning method. Once created it can be easily inspected by a human which is a great advantage in some applications. What are the practical weak sides of Decision Trees?
Łukasz Lew
  • 1,312
  • 2
  • 14
  • 24
33
votes
3 answers

Why are Gaussian process models called non-parametric?

I am a bit confused. Why are Gaussian processes called non parametric models? They do assume that the functional values, or a subset of them, have a Gaussian prior with mean 0 and covariance function given as the kernel function. These kernel…
user34790
  • 6,049
  • 6
  • 42
  • 64
33
votes
4 answers

What exactly is the difference between a parametric and non-parametric model?

I am confused with the definition of non-parametric model after reading this link Parametric vs Nonparametric Models and Answer comments of my another question. Originally I thought "parametric vs non-parametric" means if we have distribution…
Haitao Du
  • 32,885
  • 17
  • 118
  • 213
32
votes
2 answers

StackExchange fires a moderator, and now in response hundreds of moderators resign: is the increase in resignations statistically significant?

I am doing a study on StackExchange. The management of StackExchange has demodded (for unclear reasons) a moderator, and now the network is on fire. Currently many moderators resign or suspend their activities because they are dissatisfied. I wish…
29
votes
3 answers

Why does the Kolmogorov-Smirnov test work?

In reading about the 2-sample KS test, I understand exactly what it is doing but I don't understand why it works. In other words, I can follow all the steps to compute the empirical distribution functions, find the maximum difference between the two…
1
2 3
99 100