I need to compare means of two data sets that binary. For example:
a = [1,1,0,0,0,0,0,0,0,1]
b = [1,0,1,1,1,0,0,1,1,0]
All I need to know is whether the means are statistically significantly different between the two datasets, in other words the order in which 1
are arranged does not matter. And I do know that all values are either 0
or 1
. Also in my case sizes of and be are fairly large, greater than 10,000
and number of 1
is about 10
to 100
.
What is the best test to use in this case?
I know that I cannot use t-test
because my data is not normally distributed.