There is no such math, because it would be invalid. Statistics is the crow-bar that is applied to natural philosophy to make science, to make sense of the universe. There is no "golden bullet" to make the universe fall into perfect understanding.
There are infinitely many cases. Some are inverses of each other. Some are very extremal cases. Some are similar to each other. For a case where a given assumption is true, there are an infinite number of cases for which that assumption is false.
Heuristics are often experience based, but have the advantage that the tend to work in many cases, often the more frequently observed cases. They are a generalized abstraction.
Approaches considered:
- use train-validate-test to determine the general form of the model, then use 100% of the data for the fit.
- leave one out validation
- cross validation, where the split-train-validate process is repeated several times and the ensemble result is evaluated
- various splits (50/50, 80/20, 90/10)
- look at the distribution of the results, and use it to inform the split