If I have a model that is Y~A+B, where only A is significant (low p-value). After adding an interaction term to form a new model Y~A+B+(AB), B remains not significant but the interaction term (AB) is significant.
I have read that "Note that, an interaction term is added into the model when the main term is already in the model. For example, we should not have the term $x_1x_2$ in the model if there is no term $x_2$ or $x_1$."
Would this be an issue since I am not dropping the not significant term (in this case B)? Do I fit Y~A or Y~A + B + (AB) since I don't think it is appropriate to fit Y~A + AB?