I want to perform a chi-square test on some data. To generate the histogram, I can use python's y,x = np.histogram(data)
.
This gives me the height of the histogram, y
, and the bin edges x
(I can find the bin centres easily).
I have the choice of normalizing the histogram. My question is: If I plan to perform a chi-square test, should I normalize my data or should I use the original frequencies?