I have an ordinal categorical variable (A
, with 3 categories). There are 2 ways to include it in a regression model: 1) as a factor or as 2) a continuous variable.
I am interested in modeling the interaction of this variable with another continuous variable (B
). Including it as a continuous variable results in a significant interaction between A
and B
(Y ~ A + B + A*B
gives P(A*B)=0.001
), but including it as an ordinal variable results in less significant interaction terms (Y ~ A.1 + A.2 + B + A.1*B + A.2*B
yields P(A.1*B)=0.01; P(A.2*B)=0.1
). Purely looking at these p-values, it seems like the continuous form has a better fit and is simpler.
What are general principles to decide which interaction is the better fit? Is there a statistical way to compare the 1 term version with the 2 term version? How do you decide whether to use a more complicated model over a simpler model? (R or Stata code may be helpful.)