I've been going down the rabbit-hole of multiple-testing, and I haven't figured it out quite yet. So let's take a simplified example. Let's say I have 200 drugs and I want to test their effectiveness.
- Option 1 is an ANOVA, but that only tests group differences.
- Option 2 is ANOVA + a post-hoc pairwise tests with a correction like Scheffe. But since I'm testing SO MANY drugs, its so unlikely to find any of them significant at all.
- Option 3 is a linear regression with where drug is a 199 term with a baseline control (-reg y i.drug). Then I do a multiple-test correction (-test drug, mtest(bonferroni)-). But this has the same problem as option 2 in that nothing will be found, AND it only compares the drugs to controls not to each other
How do I address this? I'm at a loss. Seems like either I fish for p-values with data mining or I do a correction that pretty much guarantees I find all null results. Neither of those seems reflective of reality.