Both conditional logistic regression and survival analysis are forms of semi-parametric inference where complex relationships between unmeasured risk factors (such as a baseline hazard function or unmeasured risk factors) are controlled by organizing data into risk sets.
Formally, a risk set in survival analysis is a collection of individuals at risk for the event at each time point in which a failure is observed. The distribution of measured risk factors of the survivng cohort is compared to those of individual who failed at the event time. This ratio allows us to control for the complex, unmeasured baseline hazard function which other factors mediate multiplicatively using a hazard ratio. We ignore the amount of time that actually elapses between each failure time, and consider each risk set to be incrementally at "greater risk" by some unknown amount due to their longer duration of follow-up.
A conditional logistic regression does not have a risk set, per se, but a matched set. These are individuals among whom all unmeasured risk factors are assumed to be the same. Conditional logistic regression iteratively predicts what the cumulative risk of events is in each matched set insofar as matched sets can be ranked in terms of their unmeasured risk. Using a Cox model, each ranked matched set is treated like a risk set in a Cox Model, and then the odds ratios for events are calculated using the same partial likelihood from the Cox Model. Using predictions from these estimated odds ratios, the ranking is updated to account for what is now known about these matched sets' risk due to unmeasured factors (since our updated predictions take better account of the measured risk factors using odds ratios). This process iterates until there is agreement (or convergence) using an expectation maximization framework. This is why clogit
takes so much longer to converge than a simple Cox Model.
Formally, because there is a "little bit of estimation" in terms of risk of unmeasured factors in the conditional logistic regression, this method is a "conditional likelihood" maximization whereas the Cox Model is a "partial likelihood" maximization.
So
Data structure) risk sets / matched sets
Why) both account for unmeasured sources of risk.