What is the difference (interpretation) between the partial R^2 and the SHAP value for a linear regression model?

Juan · November 14, 2022, 11:27pm

Both calculate the contribution of a given variable to the output’s variability but…

To calculate the coefficient of partial determination R2 for a given variable:
We calculate the R2 with and without that variable and substract them. This implies fitting a different model with and without that variable.
I don’t know if for this calculation it’s better to use adjusted R2s or unadjusted ones.

The procedure to calculate the SHAP value is more convoluted:
The output for a given input is substracted from the output for all other possible inputs, generated by fixing one variable and randomly modifying another one, or two, or three… in different orders. And all these differences are averaged out.
This process can be repeated from different staring inputs and take the grand average.

SHAP is a black-box method useful for any model but it’s very slow. Sampling some of the data and interpolation may be used to accelerate its calculation.

Intuitively, how are the interpretation of the partial R2 and the SHAP value different when applied to a linear regression model?

Topic		Replies	Views
Measuring R squared of a linear regression with GLM and Dataframes General Usage question , glm	3	2690	November 23, 2016
Understanding Linear Regression and finding R-Squared Statistics	11	5145	March 24, 2020
Pseudo r2 in probit New to Julia statistics	1	701	February 1, 2017
JuliaDB & OnlineStats syntax for linear regression General Usage package	5	790	June 15, 2018
Accuracy_score General Usage	2	393	October 3, 2020

What is the difference (interpretation) between the partial R^2 and the SHAP value for a linear regression model?

Related topics