I was looking at the outlier detection formula which uses the IQR and I wonder why it should be multiplied by 1.5? Can the constant be increased i.e 3 or 6 to be more "acid" if so under what criteria?
Asked
Active
Viewed 7,698 times
6
-
The criteria are yours to choose. Why would you declare any observation to be an outlier? – Glen_b Oct 21 '13 at 01:06
-
1Ok thanks. I just looked at some text books and found that the IQR is one of the myriad of methods to spot an outlier. – Aureon Oct 21 '13 at 02:13
-
1It's really your model (at least in the loose sense) that makes an outlier an outlier (because what is an outlier but something that 'doesn't fit'?) – Glen_b Oct 21 '13 at 02:42
-
4John Tukey, who invented the approach from which this method appears derived, used *two* multipliers: He set one "fence" at 1.5 times (an analog of) the IQR away from each quartile and another fence at 3 times the IQR from each quartile. Values beyond the first fence were "out" and values beyond the second were considered "far out" (those who remember the '60s will understand this terminology). If you think you need more extreme fences, then most likely you should consider *re-expressing* your data rather than changing the fences. – whuber Oct 21 '13 at 13:42
-
Question was also asked here: http://mathoverflow.net/questions/26434/why-1-5iqr-whiskers-in-boxplot and one of the answers is that such rule in case of normal distribution marks cases that appear with less then probability smaller then 1%. – Tim Oct 05 '16 at 11:32
-
see some discussion [here](http://stats.stackexchange.com/questions/125521/why-does-tableaus-box-whisker-plot-show-outliers-automatically-and-how-can-i-ge) and [here](http://stats.stackexchange.com/questions/129274/outlier-detection-on-skewed-distributions) that may be somewhat related – Glen_b Oct 05 '16 at 22:33
1 Answers
7
Certainly you can change the criterion.
The 1.5 multiplier is so that a certain proportion of the sample in a normal population will be outside it. But there is nothing sacred about it.
However, I would caution against any automatic method of selecting outliers.

Peter Flom
- 94,055
- 35
- 143
- 276
-
1Thank you, what I understood is that the criterion can be changed according to the data. – Aureon Oct 21 '13 at 02:15
-
1
-
To be specific, 0.7% of the data. These commonly used limits make sure that '0.7%' of data is treated as an outlier; if there's any data point in that region. – Dr Nisha Arora Nov 01 '19 at 15:13