What is Shapley value regression and how does one implement it?

Question

I have seen references to Shapley value regression elsewhere on this site, e.g.:

Shapley value regression / driver analysis with binary dependent variable

What is it exactly?

My guess would go along these lines. For a game where a group of players cooperate, and where the expected payoff is known for each subset of players cooperating, one can calculate the Shapley value for each player, which is a way of fairly determining the contribution of each player to the payoff. I assume in the regression case we do not know what the expected payoff is. Instead, we model the payoff using some random variable and we have samples from this random variable.

score 6 · Answer 1 · answered May 21 '18 at 17:44

The Shapley Value Regression: Shapley value regression significantly ameliorates the deleterious effects of collinearity on the estimated parameters of a regression equation. The concept of Shapley value was introduced in (cooperative collusive) game theory where agents form collusion and cooperate with each other to raise the value of a game in their favour and later divide it among themselves. Distribution of the value of the game according to Shapley decomposition has been shown to have many desirable properties (Roth, 1988: pp 1-10) including linearity, unanimity, marginalism, etc. Following this theory of sharing of the value of a game, the Shapley value regression decomposes the R2 (read it R square) of a conventional regression (which is considered as the value of the collusive cooperative game) such that the mean expected marginal contribution of every predictor variable (agents in collusion to explain the variation in y, the dependent variable) sums up to R2.

The scheme of Shapley value regression is simple. Suppose z is the dependent variable and x1, x2, ... , xk ∈ X are the predictor variables, which may have strong collinearity. Let Yi ⊂ X in which xi ∈ X is not there or xi ∉ Yi. Thus, Yi will have only k-1 variables. We draw r (r=0, 1, 2, ... , k-1) variables from Yi and let this collection of variables so drawn be called Pr such that Pr ⊆ Yi . Also, Yi = Yi∪∅. Now, Pr can be drawn in L=kCr ways. Also, let Qr = Pr ∪ xi. Regress (least squares) z on Qr to find R2q. Regress (least squares) z on Pr to obtain R2p. The difference between the two R-squares is Dr = R2q - R2p, which is the marginal contribution of xi to z. This is done for all L combinations for a given r and arithmetic mean of Dr (over the sum of all L values of Dr) is computed. Once it is obtained for each r, its arithmetic mean is computed. Note that Pr is null for r=0, and thus Qr contains a single variable, namely xi. Further, when Pr is null, its R2 is zero. The result is the arithmetic average of the mean (or expected) marginal contributions of xi to z. This is done for all xi; i=1, k to obtain the Shapley value (Si) of xi; i=1, k. The In the regression model z=Xb+u, the OLS gives a value of R2. The sum of all Si; i=1,2, ..., k is equal to R2. Thus, OLS R2 has been decomposed. Once all Shapley value shares are known, one may retrieve the coefficients (with original scale and origin) by solving an optimization problem suggested by Lipovetsky (2006) using any appropriate optimization method. A simple algorithm and computer program is available in Mishra (2016).

I was going to flag this as plagiarized, then realized you're actually the original author. For anyone lookibg for the citation: https://ojs.tripaledu.com/index.php/jefa/article/view/34/33 — David Marx, Aug 14 '18 at 22:45

score 5 · Answer 2 · edited Mar 08 '18 at 21:07

5

There are two good papers to tell you a lot about the Shapley Value Regression:

Lipovetsky, S. (2006). Entropy criterion in logistic regression and Shapley value of predictors. Journal of Modern Applied Statistical Methods, 5(1), 95-106.

Mishra, S.K. (2016). Shapley Value Regression and the Resolution of Multicollinearity. Journal of Economics Bibliography, 3(3), 498-515.

I also wrote a computer program (in Fortran 77) for Shapely regression. It is available here.

edited Mar 08 '18 at 21:07

David Marx

6,647
1
25
43

answered Jan 03 '18 at 14:10

SK Mishra

51
1
1

2

Papers are helpful, but it would be even more helpful if you could give a precis of these (maybe a paragraph or so) & say what SR is. – gung - Reinstate Monica Jan 03 '18 at 15:28

score 3 · Answer 3 · edited May 21 '18 at 17:48

3

In statistics, "Shapely value regression" is called "averaging of the sequential sum-of-squares." Ulrike Grömping is the author of a R package called relaimpo in this package, she named this method which is based on this work lmg that calculates the relative importance when the predictor unlike the common methods has a relevant, known ordering.

edited May 21 '18 at 17:48

Sycorax

76,417
20
189
313

answered Sep 14 '16 at 04:17

MFR

552
5
15

I can see how this works for regression. I suppose in this case you want to estimate the contribution of each regressor on the change in log-likelihood, from a baseline. In this case, I suppose that you assume that the payoff is chi-squared? distributed and find the parameter values (i.e. the shapley values) that maximise the probability of the observed change in log-likelihood? – Alex Sep 14 '16 at 04:24

What is Shapley value regression and how does one implement it?

3 Answers3