Questions tagged [mad]

Median Absolute Deviation (MAD) is a measure of variability in a sample of data, and is often used as an alternative to measures like standard deviation since it is more resistant to outliers. Use the [mae] tag if you are asking about the point forecast accuracy measure called MAD or MAE.

The MAD (Median Absolute Deviation from the median) of a sample $X_1, X_2, ..., X_n$ is

$$ {\rm MAD} = {\rm median}_{i} (|X_i - M|) $$

where

$$ M = {\rm median}( \{X_1, X_2, ..., X_n \} ) $$

It is used as a measure of dispersion in a data set for being a robust alternative to measures like the observed standard deviation, since it is less sensitive to outliers. It is not to be confused with the mean absolute deviation, $\mathbb E^X\{|X-\mathbb E^X[X]]\}$ or its empirical version.

59 questions
16
votes
2 answers

Median absolute deviation (MAD) and SD of different distributions

For normally distributed data, the standard deviation $\sigma$ and the median absolute deviation $\text{MAD}$ are related by: $\sigma=\Phi^{-1}(3/4)\cdot \text{MAD}\approx1.4826\cdot\text{MAD},$ where $\Phi()$ is the cumulative distribution function…
vic
  • 161
  • 1
  • 3
14
votes
1 answer

Does a median-unbiased estimator minimize mean absolute deviance?

This is a follow-up but also a different question of my previous one. I read on Wikipedia that "A median-unbiased estimator minimizes the risk with respect to the absolute-deviation loss function, as observed by Laplace." However, my Monte Carlo…
Zhenglei
  • 393
  • 1
  • 16
13
votes
1 answer

Are there non-trivial settings where the MAD statistic has a closed-form density?

The MAD statistic of an iid sample $(x_1,\ldots,x_n)$ is defined as the median of the absolute deviation from the median: $$ \text{mad}(x_1,\ldots,x_n)=\text{med}\left\{|x_i-\text{med}(x_1,\ldots,x_n)|;\ i=1,\ldots,n \right\}\,. $$ I wonder if there…
13
votes
2 answers

MAD formula for outlier detection

Does anyone know what is the name of this formula? $$M_i = \displaystyle\frac{0.6745(x_i - \hat{x})}{\mathrm{MAD}}$$ where $\textrm{MAD}$ is the median absolute deviation and $\hat{x}$ is the median of $x$. Does it appear in some scientific…
synonym
  • 133
  • 1
  • 1
  • 5
12
votes
1 answer

What distribution has the maximum entropy for a known mean absolute deviation?

I was reading the discussion on Hacker News about the use of the standard deviation as opposed to other metrics such as the mean absolute deviation. So, if we were to follow the principle of maximum entropy, what kind of distribution would we use…
Dietrich Epp
  • 223
  • 1
  • 6
12
votes
4 answers

Is there a version of the correlation coefficient that is less-sensitive to outliers?

The correlation coefficient is: $$ r = \frac{\sum_k \frac{(x_k - \bar{x}) (y_k - \bar{y_k})}{s_x s_y}}{n-1} $$ The sample mean and the sample standard deviation are sensitive to outliers. As well, the mechanism where, $$ r = \frac{\sum_k…
11
votes
3 answers

Mean$\pm$SD or Median$\pm$MAD to summarise a highly skewed variable?

I'm working on highly skewed data, so I'm using the median instead of the mean to summarise the central tendency. I'd like to have a measure of dispersion While I often see people reporting mean $\pm$ standard deviation or median$\pm$quartiles to…
Mulone
  • 295
  • 2
  • 11
8
votes
2 answers

Median + MAD for skewed data

I am trying to figure out what happens if you apply Hampel's outlier detection technique based on the median and the MAD to data that is skewed. Apparently, the advantage of Hampel's method over z-scores is that it is much less influenced by…
user969113
  • 611
  • 1
  • 5
  • 8
8
votes
1 answer

Using MAD as a way of defining a threshold for significance testing

If I have a set of terms each term having a particular frequency associated with it (the number of the times the term has appeared in fixed corpus of papers), then is the following method of significance testing valid? calculate the median absolute…
user1447630
  • 999
  • 3
  • 8
  • 12
8
votes
1 answer

Determine outliers using IQR or standard deviation?

Similar to ... which doesn't have the answer I'm looking for. The data set has a normal distribution. For the project I'm working on, outliers have to be determined over residuals of breeding data. We want to remove data which is most likely…
Bas
  • 203
  • 1
  • 2
  • 7
7
votes
1 answer

Can Median Absolute Deviation (MAD)/SD be used to determine if a distribution is normal or not?

I have recently come across this post on Median Absolute Deviation (MAD). The Wikipedia article here, by the article as an estimator Standard deviation of the distribution is 'k' times MAD, where the value of 'k' is dependent on the distribution.…
Ironluca
  • 198
  • 1
  • 6
6
votes
1 answer

Median of medians as robust mean of means?

The location and scale of a normally distributed data can be estimated by sampling the data then taking the mean of the sample means and standard deviations, respectively. For non-normal (heavy-tailed) data, is it correct to take the median of the…
Christian Alis
  • 343
  • 3
  • 6
5
votes
2 answers

MAD in relation to 95% confidence

MAD (Median Absolute Deviation) is: $\text{MAD} = M_i(|x_i-M_j(x_j)|)$ where $M()$ is the median operator ($M_i(x_i) = \text{median}(x_1,...,x_n)$). I'd like to scale the MAD in such a way as to include (say) 95% of a distribution around the…
GK89
  • 244
  • 4
  • 14
5
votes
3 answers

How to normalize if MAD equals zero?

A known way to normalize our feature vectors is: $$\frac{x_i - \operatorname{median}( X^{(j)})}{\operatorname{MAD}^{(j)}},$$ where $\operatorname{MAD}^{(j)}$ is the median absolute deviation of feature $j$. However, if more than $50\%$ of a feature…
Low Yield Bond
  • 195
  • 1
  • 6
5
votes
1 answer

Median absolute deviation: impact/bias when forcing a median of zero

I want to estimate the median absolute deviation (MAD) of a signal. The MAD is defined as the median of the absolute difference between the signal and its median. Now I have a signal that I know for a fact has a median of zero. Its empirical median…
P-Gn
  • 147
  • 7
1
2 3 4