The experimental set-up is an almost typical control vs treatment, in which we are counting how many reads map to a particular class of genes in each condition (RNA-seq), done in triplicate. An example table would look like this:
feature c1 c2 c3 t1 t2 t3
A 10 7 11 5 3 4
B 9 10 15 6 7 8
C 15 6 10 11 5 9
(.. up to several hundreds)
The question we are asking is:
"Is the distribution of counts in the control different from the treatment?".
Which would be the most appropriate test for this? From a previous question, an anova sounds like the way to go. Knowing R I could melt the data.table and run aov(Counts~Conditions, data)
. Would this be acceptable?
Extra points for suggestions on how to represent the data, including the replicates. Since there is more than one treatment, something like this, or boxplots for each replicate will become too cluttered.