I have two groups of buyers, A
and B
, and I want to test whether the difference between the percentage of them who would buy a product is significant.
- Group A: 271 out of 2520 bought the product (10.8%) and 2,249 didn't buy.
- Group B: 1,073,839 out of 41,873,457 bought the product (2.6%) and 40,799,618 didn't buy.
I have used chisq.test()
to conduct a $\chi^{2}$ test to answer the question of whether the percentage of buyers across my groups has a significant difference (I would say 10.8% and 2.6% are different enough).
library(vcd)
data <- rbind(x=c(271,1073839), n=c(2249, 40799618))
chisq.test(data)
# Pearson's Chi-squared test with Yates' continuity correction
#
# data: data
# X-squared = 672.9477, df = 1, p-value < 2.2e-16
assocstats(data)
# X^2 df P(> X^2)
# Likelihood Ratio 382.03 1 0
# Pearson 676.22 1 0
#
# Phi-Coefficient : 0.004
# Contingency Coeff.: 0.004
# Cramer's V : 0.004
So the $p$-value says there is a significant difference, the Cramer's $V$ says there isn't. How is it possible that with such a big difference in proportions (group A
has more than 4 times more in sales) there is no significant difference according to Cramér's $V$?