I am trying to implement a model (Suita Score) shared in this paper:
https://www.jstage.jst.go.jp/article/jat/21/8/21_19356/_pdf/-char/en
for predicting the 10-year probability of CHD they given the probability function:
P = 1 - S(t) ^ exp(X, M) f(X, M) = β1 * (X1 - M1) + ... + βn * (Xn - Mn)
Where:
S(t) = survival rate for the mean of values of risk factors in the Suita cohort βs = the regression coefficients Xs = the observed risk factor values Ms = the mean risk factor values in the Suita cohort
In the tables of data provided they give:
- The mean values for all risk factors in their study (Table 1)
- The cox regression model coefficients (Table 4)
They also state:
"The beta coefficients corresponding to the Cox model were multiplied 10 times for categorical covariates and were rounded. For the age category, the midpoint of each category was multiplied by the β coefficients in Table 4, and then multiplied 10 times. We added all these values corresponding to each individual risk, divided the number by 10, and then the corresponding probability of CHD was calculated from the equation: P = 1 - S(t)^exp((sum of points)/10) where S(t) is the baseline survival function of the Suita cohort."
There are a few things confusing me about re-implementing this:
- How to I calculate the baseline survival function?
- Why all the multiplication and division by 10?
- What are they doing to the age category and why?
Any replies much appreciated even if you tell me it is not possible to re-implement from this information.