A new market recently appeared on PredictIt, for the chance of Trump winning 50% of the vote in each of the April 26th primaries.
This market seemed overvalued to me. I tried to get a handle on how overvalued it was by calculating the conjunct probability of Trump winning majorities in all five states (explanation of my method here).
In doing this, I assumed that the primaries were statistically independent. My reasoning was that, because they are occurring simultaneously, there is no way they could affect each other, so it's safe to assume independence.
A couple of people have pointed out that assuming this independence is incorrect. I'm realizing that I don't have a good grasp of what independence means.
Some questions I have about this:
- Why are the primaries not independent? Are they not independent because they are correlated somehow? (i.e. if events are correlated, can they not be independent?)
- When two variables are influenced by a third, confounding variable, can we assume that the variables are independent because they don't effect each other, even though they correlate?
- If the primaries are not independent, how can I calculate the conjunct probability that Trump wins >50% in each? Is it unsafe to assume independence for the purpose of getting an estimate? Is there a method for calculating the conjunct probability of dependent events?
- For future cases, how can I assess whether events are independent? If there isn't a clear mechanism linking two events, should I assume independence?