I'm learning about fitting methods and have a question about finding a meaningful fitting value.
Let's assume that we have a distribution of the weights of some people. If the distribution looks like Gaussian, then we can make a hypothesis that their mean weight is 70kg and the standard deviation is 5kg. Now we generate a histogram by taking into account this model and the total number of the people. After that we can compare the two histograms (measurements and simulation), by calculating the $\chi^{2}$ statistic. Given the number of the bins, $k$, we would know the degrees of freedom, $k-1$. Then using the $\chi^{2}$ and the degrees of freedom, we can check the $\chi^{2}$-distribution with $k-1$ degrees of freedom and find the p-value corresponding to our $\chi^{2}$ value. If we repeat this work for five different hypotheses, for instance, five different mean values with the standard deviation. Then we should have five p-values.
In this case, if the five p-values are all smaller than 1%, then can we say all of these models are rejected, and the best fit value(mean weight) among the five is anyway useless? Then, in order to achieve a good fitting, we would have to find models that result in large p-values and then find the best value among them?