I am trying to use CoxTimeVaryingFitter model in python from lifelines package, for making inference on which features have a causal impact on a success outcome. The features are time-varying so this model seems appropriate.
As time 't' increases, the success outcome rate decreases, and so the imbalance of outcome increases with time. My questions are:
Should imbalance be handled in an inference model? If so, what is the best way and should it be handled at each time 't'?
Should the rows in the train dataset be only up to a chosen max time t? ie. should entries at large time 't' where success rate is extremely small be excluded from the train dataset ? Is there a good way to choose the cutoff point for 't'?