HAP 525:
|
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
This lecture demonstrates how to combine the effect of several causes of an event.
A few definitions will be helpful in setting up the method of combining impact of several causes. To analyze the joint effect of two or more causes, we need to have a function called "hazard" function. Before we do so, we need to define several terms that are used in the calculation of a hazard function. We begin with the simplest concept the Probability Distribution Function. This is the probability that the event of interest will occur at time period t.
The cumulative distribution function is the probability that the event would occur prior to or at time t. It is easy to calculate the cumulative distribution function from the probability distribution functions. The cumulative distribution function is the sum of the probability distribution function for current and prior time periods.
Survival function gives the probability of surviving till after time period t. Note the survival function is the complement of cumulative distributive function and can be calculated as one minus the Cumulative Distribution Function.
Now we are ready to calculate the Hazard function. It is defined to be the probability of an event occurring at time period t given that it has not occurred prior to that time period. The hazard function is calculated as the ratio of the Probability distribution function and the survival function. Table 1 shows a summary of these definitions.
|
Function Name
|
Formula
|
Definition
|
Related Terms
|
|
Probability Distribution Function
|
p(X=t)
|
Probability of event occurring at time
"t."
|
|
|
Cumulative Distribution Function
|
p((X≤t)
|
Probability of the event occurring
prior to or at time "t."
|
Sum of PDF for ≤ t
|
|
Survival Function
|
p(X>t)
|
Probability of the event not occurring
prior or at time "t."
|
1-CDF
|
|
Hazard Function
|
p(X=t|X≥t)
|
Probability of the event occurring at
time t given that it has not occurred prior to this time
|
PDF/SF
|
| Table 1: Definition of Terms | |||
When the sentinel event is rare, the hazard function is essentially the same as the probability distribution function. Think about it, when something is rare, the survival function is near one and the hazard rate and probability distribution function are nearly equal.
An example will demonstrate (see Table 2 for calculations). Suppose we know that a hip replacement surgery will last at most 5 years. Furthermore, suppose in the absence of any other information we estimate that it is equally likely for the prosthesis to fail in any of the next five years. Obviously this is an over implication as the prosthesis is more likely to fail in later years, but for the sake of the current example lets accept this simplification. In real situations, the yearly probabilities can be estimated and used instead of equal yearly probabilities. Under assumption of equal yearly probability, the probability of it occurring in any one year is 1 divided by 5 or 0.20. Given this distribution, we want to understand what is the probability that if the prosthesis has not failed for 2 years that it will fail in the 3rd year, or more broadly what is the hazard function associated with each of the years.
The next step is to calculate the cumulative distribution function. We need to calculate a sum of the probability for the current and the prior years. Note that this is the cumulative probability in the immediate prior year plus the probability distribution in the current year. These numbers suggest that the probability of the prosthesis failing either in year one or in year 2 is 40%.
The survival function is one minus the prior years cumulative distribution. Probability of surviving the start of year one is 1. The probability of surviving until the start of year two, in this example, is 0.8. The probability of surviving till the start of year 5 is 0.20. Note that the survival probability or the chances that the prosthesis will not fail decreases with time.
|
Year
|
Probability Distribution Function
|
Cumulative Distribution Function
|
Survival Function
|
Hazard Function
|
|
1
|
0.20
|
0.20
|
1.00
|
0.20
|
|
2
|
0.20
|
0.40
|
0.80
|
0.25
|
|
3
|
0.20
|
0.60
|
0.60
|
0.33
|
|
4
|
0.20
|
0.80
|
0.40
|
0.50
|
|
5
|
0.20
|
1.00
|
0.20
|
1.00
|
|
5+
|
0
|
|||
| Table 2: An Example for Calculating Hazard of Failure of Prosthesis in Hip Replacement Surgery | ||||
Now we can calculate the Hazard function. This is the function that calculates the probability of the event occurring this year if it has not occurred in the prior years. This is calculated as the ratio of the probability function and the survival function. Note the hazard function increases each year as the more the prosthesis does not fail the more likely that it will fail in the remaining years. So now we can answer the question asked earlier regarding what is the probability of the failure in year 3 if the hip replacement has been successful in years 1 and 2. This probability is 33%, significantly higher than the probability of failing in year 3 which was calculated to be 20%.
Processes with constant hazard rate have a Poisson distribution. Even when the hazard rate is not constant over time, it is a constant value at any particular time and therefore the Poisson distribution can be used to measure it. In these circumstance, any sentinel event can be thought of to have a Poisson distribution and arriving for the first time after t periods of time. In this sense, the Poisson distribution allows us to estimate the probability of a sentinel event occurring in the next time period given that it has not occurred so far.
The Poisson formula calculates the relationship between the probability of a sentinel event occurring in the next time period based on the events constant hazard rate h. Note that e to the power of the hazard rate is the probability that the event will not occur during the next time period. One minus this calculation is the probability that it will occur.
p(Surviving next period) = eh
p(Sentinal event next period)=1-eh
The Poisson formula can be turned around to calculate the hazard rate. Solving for h, we obtain a way of calculating the hazard function from the probability of occurrence of the sentinel event. This formula is commonly used in many risk analyses because the Poisson process allows the interpretation of hazard rate as the count of arrivals of a random event. Since the effect of multiple causes can be counted and added together, then this interpretation allows us to add and subtract hazard rates to estimate the effects of multiple causes and constraints.
h=-log(1-p)
Where p is the probability of sentinel event occurring the next time period.
If we can assume that various causes are independent of each other, then the hazard function from all sources can be calculated as the sum of hazard function from each source. This is a very helpful concept as it allows us to calculate the total impact of multiple causes. If h(t) shows the combined hazard function and hi(t) shows the hazard associated with cause "i", then the hazard function of the combination can be calculated as:
h=h1+h2+...+hn
For example, under assumption of independence, the hazard function for a smoker exposed to asbestos is said to be the sum of hazard due to smoking plus the hazard due to asbestos.
h=hSmoking+hasbestos
The combined hazard risk can also be used to calculate the attributable risk to a specific cause. The attributable risk due to a cause is calculated as the ratio of its source specific hazard rate divided by the hazard function from all sources.
ARi= hi/ h
An example can help demonstrate the concept of attributable risk. If the hazard function for medication error caused by a fatigued nurse is 1 in 1000 and the hazard function for medication error caused by illegible prescription order is 2 in 1000, what is the attributable risk to the nurse and to the physician? Using the formula for attributable risk we calculate this as the ratio of nurses hazard rate over the total hazard rate. In this case it is 33%. In contrast, 66% of the medication error risk is attributable to the physicians poor prescription writing skills. The most probable cause of the event is the physician's ineligible writing. In this fashion, relative attribution can be made to separate risks. Correct attribution, of course, is important to priorities one sets for remedies to the problem. If 66% of risk is due to ineligible prescriptions, then it might be best to resolve this problem first before addressing the problem of fatigued nurses. Of course, the best action is to address both causes but if we cannot and need to set a priority, the analysis shows how to focus on key causes that are more responsible for the observed sentinel events.
It is often not possible to collect sufficient data to estimate the possible combinatorial combination of impact of multiple causes of a sentinel event. For example, if there are 5 causes for a sentinel event and each cause maybe present or absent, then there are 2 to the power of 5 possible combination of causes some of which are quite rare and therefore unlikely to occur even in long streams of data. In these circumstances, the hazard associated with combination of causes can be estimated from the available hazard functions. The following provides a set of heuristic for how to estimate the hazard of missing combination of causes:
The point of this lecture has been that hazard functions are key to understanding the combination of various causes. We presented heuristics that can be used to estimate the hazards associated with missing combination of causes. More details on an algorithm to estimate missing combination of causes is available in the section titled: Analytical Relapse.
To assist you in reviewing the material in this lecture, please see the following resources:
See the slides for the lecture
Listen to
narrated lecture on calculating hazard functions and multiple causes.
See narrated lecture in three parts:
Part 1: Calculation of Hazard Function from Distributions
Part 2: Risk Attributed to Different Causes
Part 3: Risk Attributed to Different Cases (Continued)
Narrated lectures require use of Flash.
In this section, you will find answers to questions asked by you or others.
Database Results Wizard ErrorYou can suggests changes or see below suggestions made by others:
Database Results Wizard Error