I looking for the best statistical test to apply in a particular situation and I hope I can find here the answer(s) I'm looking for.
First of all some details:
I'm studying 33 different mutants of a particular protein and I've partitioned these mutants in 4 small groups on the basis of their severity:
- Group A has 11 mutants
- Group B has 8 mutants
- Group C has 6 mutants
- Group D has 8 mutants
I can test these mutants for the presence/absence of a series of particular internal interactions and I want to know if there is a statistical difference among the 4 groups. These internal interactions are essentially independent binary variables: 0 the mutant does not have a particular interaction or 1 the mutant has the interaction.
Basically, what I want to do is checking if there is a significant statistical difference in the percentage of mutants of each group that sport or not a series of these interactions.
My final goal is to correlate the presence/absence of some of these interactions with the severity of the mutations and find out which of these interactions are peculiar of a given group.
This is an example with real data:
Interaction #1
- 27.3% of the mutants in Gourp A has this interaction
- 12.5% of the mutants in Gourp B has this interaction
- 83.3% of the mutants in Gourp C has this interaction
- 50.0% of the mutants in Gourp D has this interaction
My question is: Which statistical test should I use to check if the differences in these percentages are statistically significant?
Thank you
[edit]
As suggested by @AndrewM, here are some more details about what I'm trying to do.
I've ~150 interactions and only a few of them are missing solely in mutants of GroupD (highest severity), while the vast majority are variably missing by mutants in all clusters.
What I need is an unbiased way to highlight those interactions that, even if also missing in a small number of other mutants in other clusters, could be defined are typically missing in GroupD.
My final aim is to test if I can explain, at least partially, the severity of these mutants looking at their missing interactions and then correlate mutant severity with presence/absence of the interactions.
Thanks