I build a few models, each model will produce a normal distribution for the value of a future event. For example, model M1 will produce a normal distribution $n(30, 5^2)$, and the value of the future event observed is 24. Likewise,
M2-> $n(40, 10^2)$, observation: 35
M3-> $n(10, 2^2)$, observation: 20
How do I evaluate the underlying theory used to build M1, M2, M3?