Possible Duplicate:
Threshold for correlation coefficient to indicate statistical significance of a correlation in a correlation matrix
Context
I am doing an exploratory study to investigate the relationship between a drug (actually measured in two ways - by direct and indirect methods) and 15 various parameters. There are three groups of individuals, on which I compute correlation statistics separately ($n=11$, $n=11$ and $n=24$ - the first two are paired). Obviously, the number of statistical tests is large, and I would like to control for multiple testing.
I am using Spearman correlation as data are limited and do not (always) follow a normal distribution. I am working in R.
Questions
- How should I control for multiplicity of testing?
This is an hypothesis generating study, and I would rather prefer to use less conservative methods.
I have been looking into Benjamini–Hochberg's procedure (known under R as "BH"
or "fdr"
) which allows to control the false discovery rate. However, I am not sure if I would violate dependence assumptions. And I am not sure if this could be used in correlation statistics.
Perhaps I should not adjust for multiple testing at all. In my mind, if both direct and indirect methods give the same association, false positives are highly unlikely.