Questions tagged [survival]

Survival analysis models time to event data, typically time to death or failure time. Censored data are a common problem for survival analyses.

Survival analysis includes an array of non-parametric, semi-parametric, and fully parametric methods for analyzing time to event data. Often, these analyses aim to estimate a survival function, $S(t)$, which describes the proportion of subjects surviving at time $t$. A key feature of survival analysis is the ability to incorporate censored data, in which the event of interest does not occur during the observation period.

The most common form of censoring is "right censoring" where the event doesn't happen by the time the data are collected (e.g. patients who are still alive at the end of the study). Left censoring is when the event happens before the study starts and that is the only information on the time of the event. Interval censoring is when the event occurs at some point in the study, but the point is not precisely known.

2533 questions
54
votes
8 answers

How to tell the probability of failure if there were no failures?

I was wondering if there is a way to tell the probability of something failing (a product) if we have 100,000 products in the field for 1 year and with no failures? What is the probability that one of the next 10,000 products sold fail?
melonfresh
  • 541
  • 1
  • 5
  • 3
51
votes
5 answers

Prediction in Cox regression

I am doing a multivariate Cox regression, I have my significant independent variables and beta values. The model fits to my data very well. Now, I would like to use my model and predict the survival of a new observation. I am unclear how to do this…
Marja
  • 513
  • 1
  • 5
  • 4
49
votes
7 answers

Survival Analysis tools in Python

I am wondering if there are any packages for python that is capable of performing survival analysis. I have been using the survival package in R but would like to port my work to python.
MarkSAlen
  • 2,559
  • 5
  • 24
  • 25
42
votes
14 answers

References for survival analysis

I am looking for a good book/tutorial to learn about survival analysis. I am also interested in references on doing survival analysis in R.
MarkSAlen
  • 2,559
  • 5
  • 24
  • 25
38
votes
10 answers

Why are survival times assumed to be exponentially distributed?

I am learning survival analysis from this post on UCLA IDRE and got tripped up at section 1.2.1. The tutorial says: ... if the survival times were known to be exponentially distributed, then the probability of observing a survival time ... Why…
Haitao Du
  • 32,885
  • 17
  • 118
  • 213
34
votes
3 answers

What distribution does my data follow?

Let us say that I have 1000 components and I have been collecting data on how many times these log a failure and each time they logged a failure, I am also keeping track of how long it took my team to fix the problem. In short, I have been recording…
30
votes
3 answers

Imputation before or after splitting into train and test?

I have a data set with N ~ 5000 and about 1/2 missing on at least one important variable. The main analytic method will be Cox proportional hazards. I plan to use multiple imputation. I will also be splitting into a train and test set. Should I…
Peter Flom
  • 94,055
  • 35
  • 143
  • 276
30
votes
2 answers

In survival analysis, why do we use semi-parametric models (Cox proportional hazards) instead of fully parametric models?

I've been studying the Cox Proportional Hazards model, and this question is glossed over in most texts. Cox proposed fitting the coefficients of the Hazard function using a partial likelihood method, but why not just fit the coefficients of a…
user1956609
  • 565
  • 2
  • 5
  • 8
23
votes
2 answers

Bayesian Survival Analysis: please, write me a prior for Kaplan Meier!

Consider right-censored observations, with events at times $t_1, t_2, \dots$. The number of susceptible individuals at time $i$ is $n_i$, and the number of events at time $i$ is $d_i$. The Kaplan-Meier or product estimator arises naturally as a MLE…
Elvis
  • 11,870
  • 36
  • 56
23
votes
4 answers

Intuition for cumulative hazard function (survival analysis)

I'm trying to get intuition for each of the main functions in actuarial science (specifically for the Cox Proportional Hazards Model). Here's what I have so far: $f(x)$: starting at the start time, the probability distribution of when you will…
Jon
  • 415
  • 1
  • 3
  • 7
21
votes
4 answers

Survival analysis: continuous vs discrete time

I am confused about how to decide whether to treat time as continuous or discrete in survival analysis. Specifically, I want to use survival analysis to identify child- and household-level variables that have the largest discrepancy in their impact…
smm
  • 211
  • 1
  • 2
  • 3
21
votes
2 answers

Why does the US death rate not "match" life expectancy

From the CDC (https://www.cdc.gov/nchs/fastats/deaths.htm): Death rate: 863.8 deaths per 100,000 population Life expectancy: 78.6 years Now in a static situation I would expect that the death rate to be the reciprocal of the life expectancy or…
DavidS
  • 313
  • 2
  • 5
20
votes
2 answers

How do I interpret Exp(B) in Cox regression?

I'm a medical student trying to understand statistics(!) - so please be gentle! ;) I'm writing an essay containing a fair amount of statistical analysis including survival analysis (Kaplan-Meier, Log-Rank and Cox regression). I ran a Cox regression…
Alex
  • 459
  • 1
  • 4
  • 11
20
votes
1 answer

Basic questions about discrete time survival analysis

I am attempting to carry out a discrete time survival analysis using a logistic regression model, and I'm not sure I completely understand the process. I would greatly appreciate assistance with a few basic questions. Here is the set up: I'm…
Talbot Katz
  • 361
  • 2
  • 5
19
votes
3 answers

How to get predictions in terms of survival time from a Cox PH model?

I want to develop a prediction model (Cox PH) for all-cause mortality in a dataset of participants of whom (almost) all have died at the end of follow-up (e.g. 1-year). Instead of predicting the absolute risk of dying at a certain timepoint, I…
Rob
  • 593
  • 1
  • 4
  • 14
1
2 3
99 100