How to determine normal variability and understand whether other values may be clinically relevant?

Question

I have been examining asymmetry in a new measure of brain pathology in a large sample of 300 clinically normal individuals. Many of these variables are normally distributed, some have a bit of a skew due to 4 or 5 outliers.

We have been wondering how we can determine whether this variability is just normal or whether there is some level that may be indicative of true asymmetry. The easiest way would be mean ±2 standard deviations, but that may be a bit simplistic.

Are there other ways to figure this out? This is the first dataset measuring this pathology, so we cannot compare it to another sample.

See here for an example:

no, I am trying to figure out which level of asymmetry could potentially be clinically interesting (beyond normal variation). — hilde, Apr 18 '21 at 16:57
If you want to measure non-normality you can use (e.g.) skewness and kurtosis or (more unusual, but better) L-moments. DIstributions don't divide into sheep (normal) and goats (not normal), and even those that look close to normal usually show some small departures from normality, which may or may not be important. The fraction within 2 SD from the mean is not a good measure of whether a distribution is close to normal; back in 1965 Pearson and Tukey showed that it is remarkably constant across a range of distributions. — Nick Cox, Apr 18 '21 at 17:52
Question implies both that you have one variable ("a new measure of brain pathology") and that you have several variables ("many variables"). Which is it? Are you referrring to variation between subsets? Why not show us the data? — Nick Cox, Apr 18 '21 at 17:54
Thank you. I will look into L-moments. I have this asymmetry calculated for different brain regions and I am trying to assess for each region when the asymmetry is no longer part of the normal variability (see figure added to the post). — hilde, Apr 18 '21 at 18:32
"Clinically interesting" isn't a question statistics can address. (It is meaningless without a definition.) Are you perhaps asking how to test whether the center of a distribution is nonzero? — whuber, Apr 18 '21 at 19:10
Yes, it is the best I can describe it I think. In a way we are trying to see if there a is a value that reflects asymmetry in pathology that deviates from the normal variability. Would this be a good place to use a Gaussian Mixture Modeling approach? — hilde, Apr 19 '21 at 04:29
The graph is modern in style but says nothing much directly about approximation to a normal distribution. I'd suggest that side-by-side normal quantile plots (many other names: normal probability plots, probit plot) would be a good idea. I say side-by-side but superimposing them might not be too crazy. Posting the data, or say a smaller sample, would allow experiment (subject to any constraints on your sharing them). — Nick Cox, Apr 19 '21 at 15:46
I would have thought a mixture might be appropriate if you believe there are two sorts of brain, symmetric and not symmetric. — mdewey, Apr 22 '21 at 16:28

Carl · Answer 1 · 2021-05-12T06:30:03.983

Whether or not something is an outlier depends upon the physics of the process that generates data. To see how this has meaning let us consider that some distributions naturally have heavy tails. For example the Cauchy distribution has tails so heavy that one cannot take a mean value and expect that as the number of realizations increases that the mean will converge to a stable value. In that case, one can find a median that will be stable, or use a truncated mean by leaving out a percentage, e.g., 22%, of the area under the distribution's tails. That latter process, leaving out samples does not signify that they are outliers, and if performed properly the truncated mean will converge for large numbers faster than the median.

There are other cases in which outliers are created because of imperfect measurement systems' data generation. To distinguish between measurement system problems and natural variability takes work. I would prefer not to call something an outlier unless I can identify some problem with the measurement system that can justify that claim. In other words, if I cannot say "This is an outlier because...." then I wouldn't. With the example of truncated means and the Cauchy distribution, some statistical parameters are not sensitive to truncation of data, others, like variance, would be sensitive to truncation of data.

For your data, I would suggest trying to fit multiple different distributions to see what distribution types they likely are. Some software implementations will automatically attempt to find the best distribution type, for example, the FindDistribution routine in the Mathematica language does that. It is my personal observation that when I use a better measurement system or more accurate model, the parameters tend to be more normally distributed.

Then, once one has identified what the distribution type is, one can calculate the probability of generating data values that are as extreme as the ones seen. Often, that probability is too large to expect that the values are outliers, but, I also occasionally find outliers that appear to be real because, for example, an assay becomes unstable at very low concentrations of the substance assayed. There are tests for outliers, e.g., see https://stats.stackexchange.com/a/28221/99274, but I wouldn't use the results without thinking in terms of cause and effect.

Now there is no blanket answer for clinical relevancy, it is generally an opinion. True enough, physicians tend to regard relevancy in absolute terms; yes/no. However, when such opinions are dissected they cease to be so. Consider for example this commentary on metformin dosing for dialysis patents: Comment on: “The pharmacokinetics of metformin in patients receiving intermittent haemodialysis” by Sinnappah et al. That represents a difference of peer reviewed expert clinical opinion concerning what is relevant to consider for dosing. So in general, what research can do is change what is considered to be clinically relevant. In your data, there may be more difficulty measuring whatever it is in the occipital and temporal regions, which might, for example, be expected on CT scanning in regions of thicker bone, but, you need to do some testing, and it never hurts to actually explain what you are doing.

How to determine normal variability and understand whether other values may be clinically relevant?

1 Answers1