I am working on a fleet management system and I need to predict user request, and I use a linear model to do this.
$r = \beta*X$
I am thinking about dividing X into individual groups and fit a dedicated model for each individual group.
For example, for $X$, we have $x_{7}$ and $x_{8}$, both of which are Gaussian with mean at 0. I am thinking about dividing the training samples into 2 groups: $x_{7}*x_{8}>0$ belongs to one group and $x_{7}*x_{8}<0$ belongs to another group.
From business logic perspective, such division makes sense. In practice, having 2 individual models shows better OOS performance. However, I am not sure if it makes sense from a statistical perspective. Also, I am not sure what's the consequence of such dependency.
Can anyone give any insights here?
Thanks a lot!