I hope everyone who reads this is fine. I come to you for help, I am a student in data analysis, but I am not yet familiar with everything, especially regarding tests.
I have 3 groups of independent objects and their durability in hours (they are not totally independent due to their use, but in practice they are considered as such. If other alternatives exist are possible I'm interested in, they are not paired, object 1 can be used 50 hours tied up with object 2 then 50 hours with object 3).
The 3 groups (or more) are: the currently sold object, a new one to test and one from a competitor. They do not follow normal distribution, and there is a high chance that they follow Weibull distributions. Their variances are probably heterogeneous. Here is an R script to generate a similar sample:
library(stats)
library(lawstat)
# Number of observation for each sample
# What test to use under 30 ? with different numbers ?
n = rep(30, 3)
# Generating simple samples
# Current model
current = data.frame(rep("cur", n[1]), rweibull(n[1], 12, 7500))
# New model
new = data.frame(rep("new", n[2]), rweibull(n[2], 16, 8000))
# Competitor
comp = data.frame(rep("com", n[3]), rweibull(n[3], 13, 7350))
# Merging
colnames(current) = c("Type", "Value")
colnames(new) = c("Type", "Value")
colnames(comp) = c("Type", "Value")
df = rbind(current, new)
df = rbind(df, comp)
# Non paramtric test of homogeneité of variance
fligner.test(Value~Type, data=df)
I would like to study the existence of a mean/median difference between the durability of those groups, but I don’t know which test to use. This journey in search for the right test has made me learn a lot, but it also has made me question myself a lot.
1.) I considered Kruskal test and then post hoc Dunn (pairwise), am I right? Could I use a pairwise Mann-Whitney-Wilcoxon Test?
2.) I would like to compare the objects in terms of % of performance. How can I create a confidence interval for this type of test (I would like to get something like “new is 15% better than cur, with a 95% confidence interval of [7%, 25%]”)?
2.bis) If I plan on comparing them only in pairs (new to cur, new to comp, cur to comp) and not in their globality (new to cur to comp), is it okay to do not adjust the method, and to simply do multiple tests ?
3.) If the variances were homogeneous, could I use another test?
4.) Can I do a post hoc test (I am interested in the differences between the groups) without doing an ANOVA or a Kruskal test? Like if the normality assumption was fulfilled, would it be okay to do a pairwise t.test (corrected) instead of an ANOVA + Tukey HSD?
5.) Can I compare 2 Weibull distribution with a t.test (eventually Welch t-test)?
6.) Is there any parametric test that doesn’t require normality but the estimated parameters of the distribution?
7.) What if the groups have different distribution? Can I still say that new might be better than cur?
8.) What if the samples are too small (5-15)?
9.) I love the plot of TukeyHSD, but I can’t replicate it with other tests :’(
I know I asked a lot of things, but I look forward to any answer. Thanks in advance.