My advice to obtain the best results, start with a randomly selected subsample of the population and working with dummy variables, determine the contrasts of interest. Then, apply your pre-selection knowledge employing said contrasts of interest as variables to perform a new analysis on the whole database excluding the previewed subsample.
Why? Here is a reference, to quote:
Planned orthogonal contrasts have the greatest amount of statistical power of any of the multiple comparison methods. That means that planned orthogonal contrasts are more likely to identify true population differences than the alternatives (such as Dunnett and Scheffé). However, they require that you be able to specify your hypotheses in the form of contrasts before the experiment, and that you are able to obtain equal group sizes. If you add even one observation to any of the groups, the correlations among the vectors will no longer be 0.0, you’ll have lost the orthogonality, and you’ll need to resort to (probably) planned nonorthogonal contrasts, which, other things equal, are less powerful.