I have thinking about how to model proportions for a problem with hierarchical structure.
In the problem, I have observations of users over multiple days, where each observation is a proportion of time spent on different activities in that day.
Each user belongs to a company and companies also belong to clusters.
The output of the model is to predict proportion of time they will spend on activities for the next observation.
An additional requirement is that we will need to make this prediction sometimes for new users, In that case we would like the prediction to be based on the other users in the company. Similarly, we will need to make predictions for new users in companies where we don't have data on others, and would like the prediction to be based on the cluster level.
I was looking for some sort of hierarchical multinomial-dirichlet model that might fit this problem and didn't see one. I took a stab at it with the following, but I wasn't sure if it made sense to use these dirichlet priors like so.
$$ \begin{eqnarray} \boldsymbol{y_{u,t}} &=& (y_1, y_2,...,y_k),\;\sum y_i = 1\\ \boldsymbol{y_{u, t}} &\sim& Multinomial(\boldsymbol\theta_{u,co,cl})\\ \boldsymbol{\theta_{u,co,cl}} &\sim& Dirichlet(\boldsymbol\alpha_{u,co,cl})\\ \boldsymbol{\alpha_{u,co,cl}} &\sim& Dirichlet(\boldsymbol\alpha_{co,cl})\\ \boldsymbol{\alpha_{co,cl}} &\sim& Dirichlet(\boldsymbol\alpha_{cl})\\ \boldsymbol{\alpha_{cl}} &\sim& Dirichlet(\boldsymbol\alpha)\\ \end{eqnarray} $$
I guess an alternative approach is a hiearchical glm model with cluster, company, and user level intercepts.
I'm quite confused as to how to approach this problem, any suggestions?