I am new to survival analysis. Below is my data with very unbalanced sample size (treat group has 2 samples with 1 event, 1 censored and control group has 700+ samples). I use Cox regression in 'survival' package in R and results show 3 different tests (likelihood ratio test, log rank test and Wald test).
sample trt censor time
A7 TRT 0 1.0219178
BH TRT 1 0.6136986
SB C 0 0.7095890
SD C 0 1.1972603
SE C 0 3.6191781
.. .. .. ..
A1 C 0 4.0082192
My code:
coxph(Surv(time,censor)~trt, data=dataAll)
Result:
> coxfit
Call:
coxph(formula = Surv(time, censor) ~ trt, data = dataAll)
n= 772, number of events= 100
coef exp(coef) se(coef) z Pr(>|z|)
trtC -3.80047 0.02236 1.04854 -3.625 0.000289 ***
---
Signif. codes: 0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
exp(coef) exp(-coef) lower .95 upper .95
trtC 0.02236 44.72 0.002864 0.1746
Concordance= 0.513 (se = 0.002 )
Rsquare= 0.007 (max possible= 0.73 )
Likelihood ratio test= 5.55 on 1 df, p=0.01845
Wald test = 13.14 on 1 df, p=0.0002895
Score (logrank) test = 38.85 on 1 df, p=4.579e-10
My questions are:
- There are 3 tests giving different p values, and they look quite different with the likelihood ratio test the most conservative. Do they all test for the significant of the Cox coefficient? Which one should I choose?
- Give the fact that the treatment group has so few samples, could the p value trustable?
- Is it appropriate to apply Cox regression to unbalanced sample? If no, is there any alternative methods?
Thanks a lot!
J