1

Suppose I have some outcome variable $Y$ that depends on several factors $F_1, \ldots, F_m$, which I assume to be binary for simplicity. Each time I conduct an experiment on levels of the factors, I get a result of $Y$. I would like to understand how much of the sampling variation on $Y$ is attributable to each of $F_1, \ldots, F_m$. For example, 20% of variation when I sample $Y$ is due to $F_1$, etc. What are some general methods for inferring the variance that can be attributable to various underlying factors?

kjetil b halvorsen
  • 63,378
  • 26
  • 142
  • 467
user321627
  • 2,511
  • 3
  • 13
  • 49
  • 2
    It might be helpful to construct an orthogonal design (https://support.minitab.com/en-us/minitab/18/help-and-how-to/modeling-statistics/doe/supporting-topics/basics/orthogonal-designs/). An orthogonal design will allow you to attribute a "proportion of variance explained" for each predictor (https://stats.stackexchange.com/questions/60872/how-to-split-r-squared-between-predictor-variables-in-multiple-regression) which sums to the full model r squared. – Demetri Pananos Dec 28 '21 at 03:40
  • @DemetriPananos: This looks like an answer, can you make it into one? – kjetil b halvorsen Jan 01 '22 at 03:56
  • @kjetilbhalvorsen I've copy and pasted my comment as an answer. – Demetri Pananos Jan 01 '22 at 04:18

1 Answers1

1

It might be helpful to construct an orthogonal design (https://support.minitab.com/en-us/minitab/18/help-and-how-to/modeling-statistics/doe/supporting-topics/basics/orthogonal-designs/). An orthogonal design will allow you to attribute a "proportion of variance explained" for each predictor (How to split r-squared between predictor variables in multiple regression?) which sums to the full model r squared.

Demetri Pananos
  • 24,380
  • 1
  • 36
  • 94