Print PDF
In describing measurement of event rates (see below) we refer to the death rate expressed either as a percentage of all patients at risk or as the rate per period risk. All these data tell us is that, for the period measured, so many patients died and so many survived [survival = 1 – mortality]. What about patients who did not die but stopped treatment, e.g., patients who were transplanted or moved to peritoneal dialysis from hemodialysis treatment? What is the probability that a patient will die having received treatment for 6 months compared to the probability of death after 6 years’ treatment? If comparing two treatments over a period of time, say two years, how do we measure the risk of failure (or success) for those who have not yet experienced an event? How can we measure the probability of an event occurring within a certain time?
To address these problems statistics offers what are called Life Table Methods (the methods were originally developed for the purpose of assessing risk by life insurance companies). There are several variants of the basic statistic; we will explore two – the Kaplan-Meier probability of survival method and the Cox proportional hazards model.
Kaplan-Meier probability of survival
This is the most commonly used life-table method in medical practice. It adequately copes with the issues raised above such as patients for whom the event has not yet occurred and for those lost to follow up from treatment. The data required by the method are the time of commencement of the treatment and the time of the measured event (e.g., death, relapse, hospitalization) occurred. Patients for whom an event other than that being measured occurred and led to drop-out from treatment and those patients still alive at the end of the study period are “censored” but the contribution that they have made to the event probability are fully accounted for.
In simple terms, what the method does is record the time since initiation of treatment at which an event occurs (e.g., 3 days, 1003 days) and counts the number of patients at risk of the event at that time. The rate of the event at that time is then one divided by the number at risk. This is repeated for each event at each time. By multiplying the rate at each time by that for the time of the previous event a cumulative rate is calculated. With appropriate calculation the cumulative probability of the event can be calculated. Patients who do not experience the event contribute to the number at risk but not to the event rate for as long as they remain on treatment.
An example of the graphical output of a Kaplan-Meier survival analysis is shown below.

Click image to enlarge
This chart is from a study comparing the effect of high-flux (HFD) dialysis to low-flux (LFD) dialysis on patient survival. One of the features of the Kaplan-Meier is the ability to compare different categories of patients or different treatments. Time (days) is seen on the X-axis and the probability of survival on the Y-axis. The points on the chart represent the probability of survival at each interval of time for HFD (upper line) and LFD (lower line).
It is clear that the survival for patients on HFD exceeds that for patients on LFD particularly where survival after about 3 years (1000 days) is concerned. The probability of a patient on HFD surviving 5 years (about 1800 days) exceeds 95% whereas that for the LFD patients is 83%. But is the difference statistically significant? The Kaplan-Meier method provides several statistical tests of significance. The test used in the above analysis (Log Rank test) reported that the p value was 0.03 – there was less than 3 in 100 probability that the difference was due to chance alone – which is statistically significant.
Cox proportional hazards model
The Kaplan-Meier method does permit comparisons between patient groups or between different therapies or even different therapies across several patient groups. It cannot, however, be used to measure the impact of a continuous variable (e.g., serum albumin concentration) on the probability of an event, nor can it definitively quantify the risk of the event according to the value of a variable. Cox proportional hazards model is a statistical method that also determines a cumulative probability of an event but additionally allows the quantification of the impact of one or many variables upon that probability. The independent variables may be either continuous or categorical. In a sense, the Cox model is a merger of logistic regression and life-table analysis.
The method is too complicated to describe here and we will confine the discussion to interpretation of the sort of results output by the model. As the name of the method suggests, one of its useful functions is the calculation of a hazard ratio (HR) or relative risk (RR). The HR is somewhat similar to, but should not be confused with the odds ratio (OR) described under logistic regression. The HR tells us the degree to which a variable impacts the probability of the event being measured. We can then state what is the relative risk of the event attributable to the variable. The most important feature of the Cox model is that the model calculates the hazard effect of each variable while fully accounting for all other variables included in the model. We will illustrate this with the data in the figure below.

Click image to enlarge
An analysis from the U.S.A. was designed to determine whether the type of dialyzer membrane used had an effect on all death and cause specific death. They compared treatment with standard cellulose membranes to treatment with either modified cellulose or synthetic membranes. Since many different patient and treatment variables affect patient death, it was necessary to build a model which included all these variables and the membrane-type variable and allowed the study group to separate the membrane effect from all the other variables. In this model the authors included factors such as patient age and gender, whether or not they had diabetes, whether or not they had other comorbid factors, etc. The model then reports the relative risk of death for all of these factors and for membrane effect and can determine whether the factors are acting independently or interdependently on the risk of death. The model also reports the significance level for the effect of each variable. Like the odds ratio in logistic regression, the significance of the RR attributable to any variable is dependent upon the confidence interval around the value of the RR. As with OR values, RR above 1 indicate increased risk and values below 1 reduced risk; where the 95% confidence interval includes 1 the effect is not significant and the p value exceeds 0.05.
In this study the use of modified cellulose or synthetic membranes (MC-S) reduced the risk of death by 18% (RR = 0.82; 1-0.82 = 0.18 = 18%) when compared to treatment with standard cellulose (SC). The reduced risk was highly significant (p = 0.002). They also showed that if deaths were characterized by the cause of death the risk reduction effect was significant for deaths due to coronary artery disease (CAD: p = 0.007) and deaths due to infection (p = 0.03) but not for deaths due to malignant disease (p > 0.05) or deaths due to cerebrovascular disease (p > 0.05). It is important to note that the reduction in risk of death associated with membrane type is independent of the effect of all the other variables that were entered in the model.
There are numerous pitfalls in using the Cox model. However, the information provided by this statistical method is useful in studying factors which impact the probability of an event occurring within a time interval.
Back to Basic Statistics Next