|
|
||||||||
Statistical Concepts Series |
1 From the DATA Group, Department of Radiology, Massachusetts General Hospital, Zero Emerson Pl, Suite 2H, Boston, MA 02114. Received October 19, 2001; revision requested December 26; revision received March 27, 2002; accepted April 9. Address correspondence to E.F.H. (e-mail: elk@the-data-group.org).
| ABSTRACT |
|---|
|
|
|---|
© RSNA, 2002
Index terms: Statistical analysis
| INTRODUCTION |
|---|
|
|
|---|
Similarly, the prevalence of disease such as malignant liver cancer among patients with cirrhosis is a probability. It is the fraction of patients with a history of cirrhosis who have a malignant tumor. It can be determined simply from the number of patients with cirrhosis and the number of patients with both cirrhosis and malignant tumors of the liver or from the two prevalences.
The likelihood that a patient with a cirrhotic liver and positive CT findings has a malignant tumor (the positive predictive value [PPV] of CT in this setting) is another probability. It is determined by combining the prevalence of the disease among patients with cirrhosis with the sensitivity and specificity of CT.
In all its forms, probabilities obey a single set of rules that determine how they may be combined; this article presents an outline of these rules. The rules are also found and explained in greater detail in textbooks of biostatistics, such as that by Rosner (1).
| THE EARLIEST DEFINITION OF PROBABILITY |
|---|
|
|
|---|
In its earliest manifestation, probability was a tool for gamblers. The earliest expressions of probability dealt with simple gambling games such as flipping a coin, rolling a die, or dealing a single card from the top of a deck. The essential property assumed of such games was that the possible outcomes were all equally likely. The probability of some event was proportional to the number of individual outcomes that comprised the event.
In more complicated scenarios, the fundamental outcomes might not be equally likely or there might be an infinite number of them. In these situations, the definition of a probability of an event became the fraction of times that the event would occur by chance. That is, it is the fraction of times in a sufficiently long sequence of trials where the chance of the event was the same in all trials and was not affected by the results of previous or subsequent trials.
With either definition, counting possible outcomes or measuring the frequency of occurrence, probability was susceptible to the laws and rules of simple arithmetic. The simplest rule of all was the rule of addition. Addition of the fraction of patients who have a single liver tumor that is malignant to the fraction of patients who have a single tumor that is benign must yield the fraction of patients who have a single tumor, either malignant or benign.
Algebraically, we express this "additive" rule for two events, A and B, as the following: If Prob(A AND B) = 0, then Prob(A OR B) = Prob(A) + Prob(B), where we use Prob(A) to denote the probability of A. The condition "Prob(A AND B) = 0" is a way of saying that the simultaneous occurrence of both A and B is impossible.
One consequence of this rule is that the chance that an event would not happen is immediately determined by the chance that it would happen. As "A" and "NOT A" (the event that A would not happen) cannot occur simultaneously, yet one or the other is certain, Prob(A) + Prob(NOT A) = 1 or Prob(NOT A) = 1 - Prob(A). Thus, the fraction of patients who do not have a single tumor is just the complement to the fraction of patients who do.
The limitation "Prob(A AND B) = 0" is necessary, as the rule does not apply without modification when it is possible for both events to be true. For instance, addition of the fraction of patients who have at least one malignant tumor to the fraction of patients who have at least one benign tumor may cause overestimation of the fraction of patients with tumors, either malignant or benign. The extent of the overestimation is given by the fraction of patients who have both benign and malignant tumors. These patients were included in the counts for both tumor-type specific fractions but should be counted only once when patients who have at least one tumor are counted. This may be best seen in Figure 1, where the added shaded areas of A and B yield a sum greater than the total shaded area. The area representing the overlap of A and B must be accounted for.
|
| SUBJECTIVE VERSUS OBJECTIVE PROBABILITIES |
|---|
|
|
|---|
In radiology, before an examination, the probability that a patient will have an abnormality detected is a fraction between 0 and 1. After the procedure and after the image has been viewed, the probability is either 1 or 0, depending on whether an abnormality has been detected. In the interim, after the procedure but before the image has been viewed, the subjective probability is the same as the objective probability was before the procedure.
| CONDITIONAL PROBABILITIES AND INDEPENDENCE VERSUS DEPENDENCE |
|---|
|
|
|---|
This forms the basis of the definition of sensitivity and specificity. If we let A stand for a positive diagnostic examination result and B to represent the actual presence of the disease, then Prob(A|B) is the sensitivity, the chance of a positive examination result among individuals with disease or of a true-positive result. The definition can also be used to calculate the chances derived from a succession of diagnostic tests. For instance, if confirmation of any positive test result, D1+, is required by means of a second positive test result, D2+, then the chance that we will obtain two positive test results is given by the "multiplicative" law of probabilities: Prob(D1+ AND D2+) = Prob(D2+|D1+) x Prob(D1+).
That is, the chance that both tests have positive results is a fraction of all second tests with positive results, once the first test had a positive result times the chance that the first test had a positive result. The rule can be similarly used to calculate the chance of two successive negative findings or any other combination.
Occasionally, the chances for the second examination, D2, are unaltered by the results of the initial examination, D1. For instance, after the diagnostic image has been obtained, the interpretation by a blinded reader, D2, should be unaffected by the prior interpretation by another blinded reader, D1, of the same image. In these situations, the two interpretations are said to be independent (2).
If A and B are independent, then Prob(A) = Prob(A|B) = Prob(A|NOT B); that is, the (conditional) chance of A given that B occurs is the same as the (conditional) chance of A given that B does not occur and, as a result, also equals the (unconditional) chance of A.
A consequence is the multiplicative law of probabilities for independent events; namely, if A and B are independent, then Prob(A AND B) = Prob(A) x Prob(B).
In practice, the diagnostic tests of radiology are rarely truly independent as CT, magnetic resonance imaging, and ultrasonography all rely similarly on the lesion size and differences in tissue characteristics. Yet, quite often, the multiplicative law is used as an approximation because the exact conditional probability has never been accurately determined in clinical trials of both diagnostic modalities.
The same rules may be used to calculate the risk of disease in the presence of multiple predictive characteristics (or cofactors). If the effects of the cofactors are synergistic, the more general multiplicative rule must be used. But if the effects of the cofactors are unrelated or independent, the multiplicative rule for independent events may be used. Similarly, the rules may be used to compute the overall chance of a successful treatment of disease with a succession of treatments based on the (conditional) chance of success of each treatment.
| BAYES RULE AND POSITIVE AND NEGATIVE PREDICTIVE VALUES |
|---|
|
|
|---|
Both sensitivity and specificity are considered to be inherent invariant test characteristics. In contrast, the PPV and the negative predictive value depend not only on the sensitivity and specificity but also on the prevalence of the disease, Prob(D). They may be combined by using Bayes rule (3), which relates the PPV, Prob(D|Dx), to the sensitivity, Prob(Dx|D), the specificity, 1 - Prob(Dx|NOT D), and the prevalence, Prob(D).
|
In this equation, the denominator is the total number of expected positive findings in the population, while the numerator is the number of positive findings that accompany the actual disease.
Suppose that we had a diagnostic test used for screening that had both 95% sensitivity (positive in 95% of all cases where the disease is present) and 95% specificity (negative in 95% of all cases where the disease is absent). When the prevalence of the disease is 10%, out of every 10,000 screening examinations, we can expect to see 1,400 cases that result in a positive finding with the diagnostic test (Table). Rather than go through the laborious exercise of constructing tables, Bayes rule gives us the PPV directly. For 10% prevalence, Prob(D) = 0.10, we have PPV = (0.95 x 0.10)/(0.95 x 0.10) + (0.05 x 0.90) = 0.095/(0.095 + 0.045) = 0.679.
|
| P VALUES, POWER, AND BAYESIAN STATISTICS |
|---|
|
|
|---|
In a conventional analysis, the probability of significant study results, S, conditional on B being false, Prob(S|NOT B), is known as the power. It is not a factor in the conclusion drawn from the study. Rather, it is the major factor in the design before the study is conducted. It determines the number of patients in the study. The study sample size is chosen to provide the desired chance of successfully showing that B is not true.
Bayesian statistics differs from conventional statistics insofar as it depends on the probability that the hypothesis holds given the observed results of the study or studies. This probability is calculated by means of the Bayes rule. In order to be able to calculate it, the (subjective) probability reflecting the prior (before the study) chance or belief that the hypothesis was true is required. Much of the dispute regarding the use of Bayesian analysis centers around the possibility of conclusions that might be largely dictated by "opinion" before hard data are obtained.
P values, power, and Bayesian analysis will be presented in later articles in this series. Software for Bayesian probabilities may be found at the University of Sheffield Web site at www.shef.ac.uk/~st1ao/1b.html.
| PROBABILITY FOR CONTINUOUS OUTCOMES |
|---|
|
|
|---|
cm to the last decimal place (or any other exact value, even, say, 3 cm to the last decimal place) has to differ in meaning. For continuous measures such as the diameter or volume of a tumor or time of an occurrence, the probability that it exactly equals a single value is analogous to the distance traveled or the radiation received in an instant. Instead, one can express the rate at each value and calculate the probability of any interval just as one calculates the distance traveled over any time interval from the instantaneous speed or the total radiation exposure from the instantaneous rate of exposure.
Happily, all of the rules discussed for discrete outcomes apply equally well to continuous ones, both for intervals and for the rates at specified values.
| DISTRIBUTIONS |
|---|
|
|
|---|
The binomial distribution, B(i|n,p), (4) describes the probability of an event occurring i times out of n tries, where the chance, p, of the event is the same for all tries and the occurrence of the event in one try is unrelated (independent) to its occurrence in any other try. The distribution then gives the probabilities of each possible number of occurrences of the event out of n cases. The specific formula for the probability of i events out of n cases, B(i|n,p), is B(i|n,p) = [n!/i!(n - i)!]pi(1 - p)n - i, where ! indicates factorial, as in n! = n x (n - 1) x (n - 2) x ...x 2 x 1.
In practice, this distribution is built into most statistical and spreadsheet software packages. For instance, by using Microsoft Excel software, B(i|n,p) is calculated by the function BINOMDIST.
A binomial distribution that a radiologist might encounter is the probability of detecting i cancers with screening. In order for the binomial distribution to apply, the sensitivity would have to be identical for all of the cancers. Additionally, the detection (or nondetection) of any one cancer could not influence the chance that another was detected. If both conditions applied and there were n actual cases of cancer among those screened, B(i|n,p) would be the chance that i cancers were detected if the sensitivity of the technique was given by p. Thus, if there were actually eight cancers and the sensitivity of screening was 70%, the chance that exactly six of those eight cancer were detected is B(6|8,0.7) = [8!/6!(8 - 6)!]0.76(1 - 0.7)8 - 6 = 0.296. The full distribution is depicted in Figure 2.
|
The distribution depends on two values or parameters: the mean, µ, and the SD,
. (See the preceding article [7] in this series.) The mean determines the location of the high point of the curve. The SD gives the scale. The height of the curve at any point, x, is determined by the "z score," the difference of x and µ in units of
. That is, the height depends only on z = (x - µ)/
.
Again, the height is found as a function in most spreadsheets and statistical software. With Excel software, it is given by NORMDIST. The distribution is depicted in Figure 3.
|
| FOOTNOTES |
|---|
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
B. Vander Cruyssen, I. E A Hoffman, I. Peene, A. Union, H. Mielants, L. Meheus, and F. De Keyser Prediction models for rheumatoid arthritis during diagnostic investigation: evaluation of combinations of rheumatoid factor, anti-citrullinated protein/peptide antibodies and the human leucocyte antigen-shared epitope Ann Rheum Dis, March 1, 2007; 66(3): 364 - 369. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Hoppe, C. Quattropani, A. Spreng, J. Mattich, P. Netzer, and H.-P. Dinkel Virtual Colon Dissection with CT Colonography Compared with Axial Interpretation and Conventional Colonoscopy: Preliminary Results Am. J. Roentgenol., May 1, 2004; 182(5): 1151 - 1158. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Hoppe, H.-P. Dinkel, B. Walder, G. von Allmen, M. Gugger, and P. Vock Grading Airway Stenosis Down to the Segmental Level Using Virtual Bronchoscopy Chest, February 1, 2004; 125(2): 704 - 711. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Levatter Screening CT and Modern Economics Am. J. Roentgenol., November 1, 2003; 181(5): 1191 - 1193. [Full Text] [PDF] |
||||
![]() |
A. V. Proto Radiology 2003--The Constancy of Change Radiology, January 1, 2003; 226(1): 1 - 4. [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| RADIOLOGY | RADIOGRAPHICS | RSNA JOURNALS ONLINE |