The denominator, $\Pr(\textrm{data})$, is obtained by integrating out the parameters from the join probability, $\Pr(\textrm{data}, \textrm{parameters})$. This is the marginal probability of the data and, of course, it does not depend on the parameters since these have been integrated out.
Now, since:
- $\Pr(\textrm{data})$ does not depend on the parameters for which one wants to make inference;
- $\Pr(\textrm{data})$ is generally difficult to calculate in a closed-form;
one often uses the following adaptation of Baye's formula:
$\Pr(\textrm{parameters} \mid \textrm{data}) \propto \Pr(\textrm{data} \mid \textrm{parameters}) \Pr(\textrm{parameters})$
Basically, $\Pr(\textrm{data})$ is nothing but a "normalising constant", i.e., a constant that makes the posterior density integrate to one.