Radiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Published online before print July 26, 2002, 10.1148/radiol.2243011626
This Article
Right arrow Abstract Freely available
Right arrow Figures Only
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
2243011626v1
224/3/871    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Berg, W. A.
Right arrow Articles by Crewson, P. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Berg, W. A.
Right arrow Articles by Crewson, P. E.
(Radiology 2002;224:871-880.)
© RSNA, 2002


Breast Imaging

Does Training in the Breast Imaging Reporting and Data System (BI-RADS) Improve Biopsy Recommendations or Feature Analysis Agreement with Experienced Breast Imagers at Mammography?1

Wendie A. Berg, MD, PhD, Carl J. D’Orsi, MD2, Valerie P. Jackson, MD, Lawrence W. Bassett, MD, Craig A. Beam, PhD3, Rebecca S. Lewis, MPH and Philip E. Crewson, PhD4

1 From the Dept of Radiology (W.A.B.) and Greenebaum Cancer Ctr (W.A.B.), Univ of Maryland, 419 W Redwood St, Ste 110, Baltimore, MD 21201; Dept of Radiology, Univ of Massachusetts, Worcester, Mass (C.J.D.); Dept of Radiology, Indiana Univ School of Medicine, Indianapolis (V.P.J.); Dept of Radiological Sciences, UCLA School of Medicine, Los Angeles, Calif (L.W.B.); Dept of Radiology, Medical College of Wisconsin, Milwaukee (C.A.B.); and American College of Radiology, Reston, Va (R.S.L., P.E.C.). From the 2000 RSNA scientific assembly. Received Oct 5, 2001; revision requested Nov 1; revision received Feb 6, 2002; accepted Feb 28. Supported by the Maryland Chapter of the Susan G. Komen Breast Cancer Foundation and the American College of Radiology Technology Assessment Studies Assistance Program. Address correspondence to W.A.B. (e-mail: waberg@umaryland.edu).


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
PURPOSE: To determine whether training in the Breast Imaging Reporting and Data System (BI-RADS) improves observer performance and agreement with the consensus of experienced breast imagers with regard to mammographic feature analysis and final assessment.

MATERIALS AND METHODS: A test set of mammograms was developed, with 54 proven lesions consisting of 28 masses (nine [32%] malignancies) and 26 microcalcifications (10 [38%] malignancies). Three experienced breast imagers reviewed cases independently and by means of consensus. Twenty-three practicing mammogram-interpreting physicians reviewed mammograms before and after a day’s lectures on BI-RADS. Observer performance before and after training was measured by means of agreement ({kappa}) with consensus description and assessments, rate of biopsy of malignant and benign lesions, and areas under receiver operating characteristic (ROC) curves. Performance was also measured for 11 participants 2–3 months after training.

RESULTS: Improved agreement with consensus feature analysis was found for mass margins and/or asymmetries, with a pretraining generalized {kappa} value of 0.36 and a posttraining generalized {kappa} value of 0.41. Similar improvement was seen for description of calcification morphology (pretraining {kappa} value of 0.36 improving to 0.44 after training). No improvement was seen in describing calcification distribution. Final assessments were more consistent after training, with a pretraining {kappa} value of 0.31, as compared with 0.45 after training. The mean biopsy rate for malignant lesions improved from 73% (range, 53%–89%) before training to 88% (range, 74%–100%) after training, with minimal increase in mean biopsy rate of benign lesions (43% [range, 26%–60%] before to 51% [range, 31%–63%] after training), and no net change in area under the ROC curve, as compared with histopathologic findings. For the subset of participants with delayed follow-up, no significant decline in posttraining results was seen.

CONCLUSION: BI-RADS training resulted in improved agreement with the consensus of experienced breast imagers for feature analysis and final assessment. It is important that trainees showed improved rates of recommending biopsy for malignant lesions. This effect was maintained over 2–3 months.

© RSNA, 2002

Index terms: Breast radiography • Diagnostic radiology, observer performance


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Several studies have demonstrated a wide range in the performance of mammogram-interpreting physicians. In the 1995 study by Elmore et al (1), 10 readers evaluated 150 mammograms, including 27 showing cancer. Immediate work-up was recommended for 74%–96% of women with cancer and 11%–65% of women without cancer. In 1996, Beam et al (2) published results from a broader sample of 108 national radiologists interpreting 79 mammograms (45 showing malignancy) and found work-up rates of 47%–100% for women with cancer and 1%–64% for those without. In both studies, variability in reader performance was due to differences in both lesion detection and interpretation and/or management.

The Breast Imaging Reporting and Data System (BI-RADS) (3) was developed through the American College of Radiology, or ACR, to help standardize feature analysis and final management of mammographic findings. It was also hoped that a large national database of outcomes would ensue that would provide greater rationale and validation for management when certain features or combinations of features were present. Indeed, the National Mammography Database is currently set up through the ACR to receive such information (4).

The purpose of our study was to determine whether training in BI-RADS feature analysis would improve observer agreement with experienced breast imagers in mammographic lesion description and final assessments (management recommendations). We further sought to assess whether such training would improve the biopsy rate of malignant lesions, without loss of specificity.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Study Design
A set of 54 "quiz" cases was developed by one of the authors (W.A.B.) from prospectively nonpalpable lesions representative of the BI-RADS lexicon for mass margins and microcalcification morphology and distribution. The cases consisted of 28 masses or asymmetries, nine (32%) of which were malignant, and 26 lesions manifest as calcifications, 10 (38%) of which were malignant. "Histopathologic truth" was established as benign or malignant findings with biopsy (n = 43), ultrasonographically (US) guided cyst aspiration (n = 1), or at least 5 years of mammographic stability for two lymph nodes; one case each of skin calcifications, milk of calcium calcifications, and secretory calcifications; two scattered punctate calcifications; and three asymmetric densities. Malignancies included seven infiltrating ductal carcinomas, one infiltrating and intraductal carcinoma, one infiltrating lobular carcinoma, and 10 ductal carcinomas in situ.

Photographic slides were prepared that were centered on the area of interest (indicated with arrows when needed), with the entire mammographic image also provided when a broader view was needed (eg, asymmetric densities and diffuse and/or bilateral findings). The University of Maryland institutional review board, with which we consulted, did not require its approval or informed consent for this study, as all patient identifiers were removed from the images and database. Magnification views were provided for 35 cases. When magnification views were not available, collimated compression views were used (n = 11), or the lesion was marked on the slides after photographic magnification (n = 8). Lesions were selected to be pure examples of individual features of the BI-RADS lexicon in that the 28 masses and/or asymmetric densities did not have associated calcifications, and the 26 lesions manifest as calcifications were chosen to be of one main morphologic type, without associated density or mass. The case mix was designed to have at least three lesions from each of the major lesion descriptor categories.

For masses, lesion descriptors supplied included the following: lymph node, circumscribed round or oval, circumscribed lobulated, obscured, microlobulated, indistinct, or spiculated mass; or focal asymmetric density. Shape and margins were combined to simplify the number of variables evaluated. While circumscribed round or oval and circumscribed lobulated descriptors were initially separated, combining them (as "circumscribed") did not substantially alter any agreement or outcome analyses. To the greatest extent possible, descriptors were listed in order of increasing risk of malignancy (3,58). No logical ordering could be inferred for asymmetric density, and it was arbitrarily placed last on the list; participants were so instructed.

For calcification morphology, descriptors included the following: typically benign, punctate, amorphous and/or indistinct, pleomorphic, and fine linear and/or branching. Participants were instructed that the typically benign category included coarse, secretory, oil cyst, milk of calcium, popcorn, vascular, and dermal calcifications. For calcification distribution, descriptors available were the following: diffuse, clustered, regional, multiple groups, linear, and segmental. Participants were instructed that segmental meant one or multiple ducts and corresponding branches. Multiple groups described the distribution of several groupings of similar morphology (3,7). Final assessments were category 2, benign; 3, probably benign; 4, suspicious abnormality; and 5, highly suggestive of malignancy. While BI-RADS category 5 is intended for lesions that may proceed directly to definitive surgery without initial tissue diagnosis, both categories 4 and 5 indicate the need for biopsy, which is the end point of interest (9). Categories 4 and 5 were combined for evaluation of interrater agreement.

Three experienced breast imagers (C.J.D., V.P.J., L.W.B.) participated in the training. Each of these individuals interprets at least 4,000 mammograms per year and had at least 17 years of experience (range, 17–30 years) in mammography at the time this article was written. These experienced imagers were shown the quiz cases on slides the evening before the course (see below) and were asked to independently record their responses on worksheets. The cases were then reviewed and consensus opinion achieved as to both appropriate feature description and final assessment.

Twenty-seven radiologists participated in a day-long course, "Applying BI-RADS to Mammography Practice," in Baltimore, Md, in September 1999. The training and experience of the participants was recorded, including fellowship training in mammography, the number of mammograms reviewed per week, and the number of years in practice. Complete data sets for lesion description were obtained from 23 participants. Audience response keypads (Audience Response Systems, Evanston, Ill) allowed immediate data entry, with questions repeated when incomplete participation occurred. Individual participants were given unique identifiers required for keypad entry. Slides showing the previously mentioned quiz cases were presented, immediately followed by questions where participants were asked to provide a description and a BI-RADS final assessment with its linked recommendation (benign, routine screening; probably benign, 6-month follow-up; suspicious, biopsy; or highly suggestive of malignancy, biopsy) for each lesion. Quiz cases were shown at the start of the day and again after a day’s lectures in BI-RADS. The lectures consisted of 31/2 hours of didactic material, including mammographic and pathologic correlation of individual BI-RADS features for calcifications, masses, and asymmetric densities, and another half-hour of questions and answers. Another 2 hours were devoted to examples, with audience response for both feature analysis using BI-RADS terms and final assessments, with time for discussion. There was no overlap of case material presented in the didactic sessions with that in the quiz. For the quiz, participants were asked to assume that the findings were noted on the baseline screening mammogram and that the patient had no known risk factors for breast cancer. Participants were asked to use the single most worrisome applicable feature descriptor in each category. During the afternoon quiz, after participants recorded their description and final assessment for a particular case and prior to the next case, they were immediately given the consensus opinion of the experienced breast imagers with regard to both feature analysis and final assessment for that case. Twenty-three participants completed data sets on lesion description, and 21 participants completed data sets on management for all 54 lesions before and after training. A subgroup of 11 observers was shown the quiz cases again in random order 2–3 months after completing the training course. Immediate consensus feedback was given after each case in the delayed follow-up evaluation.

Control Group
A group of six senior radiology residents who had not participated in the course were shown the quiz slides on two occasions separated by 1 week, with the consensus of the experienced breast imagers supplied after each case after recording of their impressions on the second occasion. The residents were instructed to not read any mammography-related materials during the intervening week. All residents had completed at least 1 month of training in mammography.

Measures of Performance
Observer performance was considered in terms of agreement with the consensus opinion and, separately, agreement with histopathologic truth. A final assessment of BI-RADS category 4, suspicious abnormality, was considered as agreement with an assessment of category 5, highly suggestive of malignancy. {kappa} values were used as measures of agreement between each participant and the consensus beyond that expected by chance (10,11). Fleiss (11) has suggested that {kappa} > 0.75 indicates excellent agreement; 0.40 <= {kappa} <= 0.75, fair to good agreement; and {kappa} < 0.40, poor agreement. Multireader generalized {kappa} values (10,12) were used to describe agreement among participant and expert readers for each feature descriptor. This statistic measures the degree to which interpretation variability arises from differences between cases relative to differences among readers interpreting the same case. It is analogous to the intraclass correlation used in assessment of laboratory measurements. Positive values indicate that interpretation variability is governed more by case-related differences than by between-reader disagreements. Negative values indicate the opposite. A zero value occurs whenever the two sources of variability are equal. We report generalized {kappa} values from the individual consensus ratings as an agreement benchmark.

Accuracy was estimated by calculating sensitivity, false-positive rate (1 - specificity), and area under the receiver operating characteristic (ROC) curve (Az) (13). Sensitivity for malignancy was calculated as the proportion of malignancies given final assessments of BI-RADS category 4, suspicious abnormality, or category 5, highly suggestive of malignancy. Similarly, benign lesions so categorized were considered false-positive findings. A separate analysis considered the consensus assessment as the reference standard: In addition to known malignancies, some benign lesions are appropriately recommended for biopsy. Az in the latter analysis enabled measurement of concordance between reader classification and expert consensus.

Training effectiveness was measured by means of the paired difference between post- and pretraining {kappa} values (individual reader vs consensus), sensitivity, false-positive rates, and Az value. To identify statistically significant improvements in agreement ({kappa}), sensitivity, false-positive rate, and Az value, lower 95% CIs were used, analogous to a one-tailed hypothesis test with a 5% significance level. If the lower CI bounds contained zero, we concluded that there was no statistically significant improvement. If the lower CI bounds did not contain zero, we concluded that there was a statistically significant improvement. The latter is reported as a P value less than or equal to .05. Multiple regression analysis was used to model the effect of individual reader characteristics on (a) pretraining Az value and (b) relative change in Az value after training. For the regression model variables, a two-tailed 95% CI was used, since there was no a priori hypothesis regarding the direction of the parametric relationships. Performance measures were assessed as a function of experience in breast imaging.

Lower bounds and two-tailed CIs of paired differences (before vs after training) were estimated by using jackknifed standard errors to provide the appropriate inference given the correlated nature of the data. Caution is recommended in relying on estimates of statistical significance and use of CIs with these data. Our distribution of images is not representative of that in clinical practice. The lesions used in the current study include an enriched malignant case mix along with a deliberate selection of lesions close to the threshold for intervention. In addition to limited generalizability, this lesion selection can be expected to result in greater variance in assessments among both lesions and study participants. This is especially true for {kappa} value estimates, since both paired and generalized {kappa} values can vary dramatically, depending on the distribution of participant interpretations. All analyses were conducted by using SAS software (SAS Institute, Cary, NC).


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Feature Analysis
Three experienced readers (C.J.D., V.P.J., L.W.B.) evaluated the 54 quiz cases independently prior to arriving at the consensus descriptions summarized in Table 1. All three experienced imagers agreed on the margins of 14 (50%) of 28 masses, final assessment of 16 (57%) of 28 masses, 12 (46%) of 26 microcalcification morphologies, 15 (58%) of 26 microcalcification distributions, and 14 (54%) of 26 microcalcification final assessments. This resulted in a need for consensus overall for 63 (47%) of 134 descriptions or final assessments with initial disagreement.


View this table:
[in this window]
[in a new window]

 
TABLE 1. Lesion Descriptors, Malignancy Rates, and Prestudy Agreement among Experienced Breast Imagers for Lesion Classification and Assessment

 
Agreement among the experienced readers resulted in a generalized {kappa} value of 0.57 (range, 0.43–0.66) for mass margins and asymmetries. The least agreement was seen for use of the term obscured, although the {kappa} value was still 0.43, indicating fair to good agreement. For participants before training, the generalized {kappa} value for all mass descriptors was 0.36 (range, 0.07–0.61), with the least agreement seen for lymph node ({kappa} = 0.07; Table 2) and indistinct masses ({kappa} = 0.24), which were more likely to be considered focal asymmetric densities by another reader. After 1 day of training, the mean generalized {kappa} value improved to 0.41 (range, 0.06–0.59) (mean change, 0.05), with the least agreement seen for lymph node masses ({kappa} = 0.06). Focal asymmetric density showed the greatest improvement in agreement with training. Of the 23 participants evaluated, 12 (52%) showed improvement after training in agreement with consensus description of mass margins and asymmetries; six (26%) showed no change; and five (22%) showed less agreement with the consensus after training.


View this table:
[in this window]
[in a new window]

 
TABLE 2. Agreement (generalized {kappa} value) among 23 Participants before and after BI-RADS Instruction

 
For microcalcification morphology, the overall generalized {kappa} value among the experienced breast imagers was 0.44 (range, 0.06–0.75 for individual descriptors; Table 1). The greatest disagreement among experienced breast imagers was seen for punctate morphology ({kappa} = 0.06); such calcifications were more likely described as pleomorphic or amorphous by another experienced breast imager. The generalized {kappa} value for participants for description of microcalcification morphology prior to training was 0.36 (range, 0.17–0.65), with the greatest disagreement for pleomorphic calcifications ({kappa} = 0.17, Table 2). After training, the generalized {kappa} value for participants improved to 0.44 (range, 0.31–0.60; mean change, 0.08), and the term pleomorphic remained the most variably used ({kappa} = 0.31). With training, 11 (48%) of the 23 participants showed improved agreement with the consensus on description of calcification morphology, nine (39%) showed no change, and three (13%) showed less agreement.

The generalized {kappa} value for experienced breast imagers for all categories of microcalcification distribution was 0.44 (range, -0.03 to 0.62). Use of descriptors diffuse and/or scattered and multiple groups showed more disagreement than expected by chance (Table 1). Prior to training, the generalized {kappa} value for participants was 0.35 (range, 0.10–0.50), with segmental and regional showing poor agreement with the consensus ({kappa} = 0.10 and 0.11, respectively; Table 2). With training, the generalized {kappa} value improved to 0.36 (range, 0.17–0.52; change, 0.01), and descriptors segmental and regional remained the most problematic ({kappa} = 0.17 for each).

The generalized {kappa} value for the three experienced breast imagers for final assessment BI-RADS category 2 (benign) was 0.51; for category 3 (probably benign), 0.04; and for categories 4 and 5 combined (suspicious abnormality and highly suggestive of malignancy, respectively), 0.50. Participants showed improved agreement with the consensus after training in each of the final assessment groupings (Table 2). The overall generalized {kappa} value improved from 0.31 before training to 0.45 after training (change, 0.15). With training, 15 (65%) of the 23 (65%) participants showed improved agreement; five (22%), no change; and three (13%), decreased agreement with the consensus.

Accuracy Measures
While individual experienced imagers recommended biopsy for 15–17 (79%– 89%) of the 19 cancers in the quiz set, the group consensus was to perform biopsy on all malignancies (sensitivity, 100%). Similarly, individual experienced imagers recommended biopsy for 10–18 (29%–51%) of the 35 benign lesions, and the consensus was to perform biopsy in 15 (43%) of the 35.

When using the consensus assessments as the reference standard, we found significant improvement in the biopsy rate of malignant lesions after training (Table 3). Prior to training, for the group of 21 participants with complete data sets both before and after training, biopsy rates for malignant lesions were 50%–91% (mean, 71%; median, 71%). After training, biopsy rates for malignant lesions improved to 71%–97% (mean, 86%; median, 88%) (P < .05). Improvement was 3%–32% (mean, 16%; median, 14%). Prior to training, participants’ biopsy rates of lesions judged benign or probably benign (false-positive rate) by means of consensus of the experienced imagers were 10%–45% (mean, 25%; median, 25%). It is important to note that after training, no significant change was seen in biopsy rates for benign lesions, which were 10%–40% (mean, 26%; median, 25%) after training, with a mean change of 0.01. Az values showed significant improvement, with a mean change of 0.04 (P < .05) (Fig 1, Table 3), as compared with the consensus reference standard. Of the 21 participants evaluable, 17 (81%) showed improvement, and two (10%) showed no change. The two (10%) observers who showed a decrease in Az value were the two with the highest initial pretraining Az values. By using histopathologic truth as the reference standard, a significant increase was seen in both sensitivity and biopsy rate of benign lesions (false-positive rate), with no significant change in Az value (Table 3). In this analysis, participants remained clustered around an Az value of 0.71 after training, which is less than the prestudy experienced breast imager consensus Az value of 0.81.


View this table:
[in this window]
[in a new window]

 
TABLE 3. Mean Reader Accuracy before and after BI-RADS Training for 21 Participants Evaluating 54 Lesions (including 19 malignancies)

 


View larger version (37K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 1. Graph shows Az values for each participant ({bullet}), as compared with the consensus reference standard, before and after 1 day of BI-RADS training. The line is one of equivalence. Note that two participants performed worse after training, and two showed no change. The remaining 17 showed at least some improvement.

 
Multiple regression analysis was performed to evaluate the effect of participant characteristics on pretraining Az value and on relative change in Az value after training. Seven participants had received (n = 4) or were receiving (n = 3) fellowship training in mammography; one was in a body imaging fellowship, and 13 were practicing qualified interpreting physicians under the Mammography Quality Standards Act, with an average of 12 years of practice each (range, 1–33 years). Six participants had estimated cumulative experience in reviewing fewer than 10,000 mammograms (range, 1,800–104,000). Pretraining Az value did not correlate with fellowship training in mammography, professional status (resident, fellow, or practicing radiologist), total number of mammograms read, or number of mammographic examinations interpreted per week. Relative improvement in Az value after training negatively correlated with pretraining Az (P < .05); that is, those participants with better initial performance benefitted less from training.

Training Retention
When a subgroup of 11 participants was shown the quiz cases again in random order 2–3 months after training, improvement in {kappa} value, as compared with pretraining values, was retained for nine (82%) participants with regard to mass and/or asymmetry description. The {kappa} value was 0.53 (as compared with the subgroup’s pretraining {kappa} value of 0.35 and immediate posttraining {kappa} value of 0.43, P = not significant; Table 4). Improved {kappa} values were maintained for eight (73%) of the 11 participants with regard to calcification morphology, as compared with pretraining values. Less success was maintained for calcification distribution, with four (36%) of the 11 observers showing persistent improved agreement with the consensus. For final assessments, nine (82%) of the 11 participants retained significant improvement, as compared with pretraining values, although all but two participants showed slight decreases in {kappa} values, as compared with immediate posttraining values. For the subset of participants with delayed follow-up, the mean {kappa} value for final assessment was 0.37 before training, 0.53 immediately after training, and 0.49 at delayed follow-up (Table 4). The changes in mean {kappa} value at delayed follow-up, as compared with those immediately after training, were not significant for any category. These 11 readers showed increases in agreement immediately after training for mass margins and/or asymmetries, microcalcification morphology, and final assessment, and the results shown in Table 4 suggest that this improvement was retained.


View this table:
[in this window]
[in a new window]

 
TABLE 4. Mean Changes in Agreement ({kappa} values) between 11 Readers and Experienced Breast Imager Consensus

 
Improvements in accuracy observed immediately after training were generally maintained at delayed follow-up (Table 5). Of the 11 readers at delayed follow-up, four (36%) saw small declines in their sensitivity, as compared with either the consensus or the histopathologic reference standard, immediately after training, although for all participants, sensitivity remained above pretraining levels. For these same 11 readers, biopsy rates for lesions considered benign by means of consensus or histopathologic standards (false-positive rate) showed a trend toward increase at delayed follow-up, as compared with immediate posttraining levels.


View this table:
[in this window]
[in a new window]

 
TABLE 5. Mean Change in Accuracy for 11 Participants with Delayed Follow-up 2-3 Months after BI-RADS Training

 
Control Group
Agreement with the consensus of experienced imagers for the residents who did not participate in the training course showed no significant difference from the first to the second reading for lesion descriptions, including mass margins, calcification morphology and distribution, and final assessment. While mean sensitivity improved from 0.57 at the first reading to 0.76 at the second reading, the difference was not significant. All but one resident showed improvement, with sensitivity differences of -0.09, 0.12, 0.18, 0.20, 0.30, and 0.38 from the first to the second reading. The mean false-positive rate increased from 0.16 to 0.23. The trend toward improvement suggests that immediate feedback may be the source of improvement seen with course participants.

Cases with Greatest Variance
Calcifications proved more problematic than masses in achievement of consistency in management. The experienced breast imagers all recommended biopsy for all nine malignant masses, whereas at least one experienced imager rated four (40%) of the 10 malignant calcifications as benign or probably benign. When rank ordered according to variance across participants after training, of the 20 cases with the least variance, only six (30%) were calcifications, whereas 15 (75%) of the 20 cases with the greatest variance were calcifications. Three cancers were among the 20 with the greatest variance, including two clusters of pleomorphic calcifications (Figs 2, 3) and one obscured mass (Fig 4). There were two cases with no variance (spiculated masses) prior to training. After training, there were 13 cases with no variance (four spiculated masses, three indistinctly marginated masses, one obscured and one microlobulated mass each, two pleomorphic calcifications, and two fine linear and/or branching calcifications). Five of 10 cases with greatest variance after training were typically benign calcifications, according to the consensus of the experienced imagers. Participants had particular difficulty recognizing secretory calcifications as benign.



View larger version (158K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 2. Lateral true spot-magnification mammogram shows new pleomorphic calcifications in multiple groups (arrows) due to intermediate-grade ductal carcinoma in situ (micropapillary type) in a 50-year-old woman. The experienced breast imagers did not agree on description or management of this case, which was considered typically benign by two of them and pleomorphic by a third. After discussion, the consensus was that the calcifications were not uniformly round but rather pleomorphic and thus suggestive of malignancy. This case showed the highest variance in management of all malignancies after training, with 81% of participants describing these as typically benign or punctate and probably benign before training, and only 30% describing them as pleomorphic and suspicious even after 1 day of BI-RADS training.

 


View larger version (114K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 3a. Circumscribed mass due to fibroadenoma and pleomorphic calcifications due to ductal carcinoma in situ in a 71-year-old woman. (a) Craniocaudal mammogram shows multiple circumscribed masses, one of which has popcorn calcification (arrow) typical of fibroadenoma. The circumscribed mass (*) was consistently described as circumscribed and benign or as probably benign. (b) Four-fold photographic magnification of mass (*) and adjacent calcifications (arrow). A majority of participants and all experienced breast imagers recognized the mass as circumscribed or partially obscured and as benign or probably benign (55% before and 74% after training). The calcifications and mass were considered separately. The calcifications were variably described as punctate and probably benign or as pleomorphic and suggestive of malignancy, even with training.

 


View larger version (149K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 3b. Circumscribed mass due to fibroadenoma and pleomorphic calcifications due to ductal carcinoma in situ in a 71-year-old woman. (a) Craniocaudal mammogram shows multiple circumscribed masses, one of which has popcorn calcification (arrow) typical of fibroadenoma. The circumscribed mass (*) was consistently described as circumscribed and benign or as probably benign. (b) Four-fold photographic magnification of mass (*) and adjacent calcifications (arrow). A majority of participants and all experienced breast imagers recognized the mass as circumscribed or partially obscured and as benign or probably benign (55% before and 74% after training). The calcifications and mass were considered separately. The calcifications were variably described as punctate and probably benign or as pleomorphic and suggestive of malignancy, even with training.

 


View larger version (124K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 4a. Obscured mass due to infiltrating ductal carcinoma and branching and/or fine linear calcifications due to infiltrating ductal carcinoma and ductal carcinoma in situ in a 71-year-old woman. (a) Craniocaudal routine mammogram shows partially obscured mass (arrow) and several groupings of calcifications (arrowheads). (b) Mediolateral oblique spot-compression mammographic view of the mass (arrow) again shows partially obscured or indistinct margins. Since the participants were told to assume that all masses were solid, biopsy was the appropriate recommendation, made prospectively by two of the three experienced imagers. Initially, 68% of participants considered this benign or probably benign. With training, 64% recommended biopsy. (c) Craniocaudal spot-magnification mammogram of the outer right breast better demonstrates the cluster of branching and/or fine linear calcifications (arrow) suggestive of malignancy. The mass and calcifications were considered separate lesions, although participants were aware that the lesions were in the same breast. All participants initially recognized the morphology as either pleomorphic or branching and/or fine linear. In spite of this, 17% of participants initially classified the calcifications as benign or probably benign. After training, all participants recommended biopsy. The more posterior group of hazy calcifications (unlabeled) to the left of those marked with an arrow was due to ductal carcinoma in situ and was not part of the quiz.

 


View larger version (127K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 4b. Obscured mass due to infiltrating ductal carcinoma and branching and/or fine linear calcifications due to infiltrating ductal carcinoma and ductal carcinoma in situ in a 71-year-old woman. (a) Craniocaudal routine mammogram shows partially obscured mass (arrow) and several groupings of calcifications (arrowheads). (b) Mediolateral oblique spot-compression mammographic view of the mass (arrow) again shows partially obscured or indistinct margins. Since the participants were told to assume that all masses were solid, biopsy was the appropriate recommendation, made prospectively by two of the three experienced imagers. Initially, 68% of participants considered this benign or probably benign. With training, 64% recommended biopsy. (c) Craniocaudal spot-magnification mammogram of the outer right breast better demonstrates the cluster of branching and/or fine linear calcifications (arrow) suggestive of malignancy. The mass and calcifications were considered separate lesions, although participants were aware that the lesions were in the same breast. All participants initially recognized the morphology as either pleomorphic or branching and/or fine linear. In spite of this, 17% of participants initially classified the calcifications as benign or probably benign. After training, all participants recommended biopsy. The more posterior group of hazy calcifications (unlabeled) to the left of those marked with an arrow was due to ductal carcinoma in situ and was not part of the quiz.

 


View larger version (171K):
[in this window]
[in a new window]
[Download PPT slide]
 
Figure 4c. Obscured mass due to infiltrating ductal carcinoma and branching and/or fine linear calcifications due to infiltrating ductal carcinoma and ductal carcinoma in situ in a 71-year-old woman. (a) Craniocaudal routine mammogram shows partially obscured mass (arrow) and several groupings of calcifications (arrowheads). (b) Mediolateral oblique spot-compression mammographic view of the mass (arrow) again shows partially obscured or indistinct margins. Since the participants were told to assume that all masses were solid, biopsy was the appropriate recommendation, made prospectively by two of the three experienced imagers. Initially, 68% of participants considered this benign or probably benign. With training, 64% recommended biopsy. (c) Craniocaudal spot-magnification mammogram of the outer right breast better demonstrates the cluster of branching and/or fine linear calcifications (arrow) suggestive of malignancy. The mass and calcifications were considered separate lesions, although participants were aware that the lesions were in the same breast. All participants initially recognized the morphology as either pleomorphic or branching and/or fine linear. In spite of this, 17% of participants initially classified the calcifications as benign or probably benign. After training, all participants recommended biopsy. The more posterior group of hazy calcifications (unlabeled) to the left of those marked with an arrow was due to ductal carcinoma in situ and was not part of the quiz.

 

    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
The missed diagnosis of breast cancer at mammography is multifactorial. In practice, an average of 78% of breast cancers are detected at mammographic screening (14,15). Approximately half of missed breast cancers are mammographically visible in retrospect (1618). The reader may fail to detect an abnormality. Once one is seen, the reader may choose to dismiss it as a normal finding or may misclassify it as benign or probably benign. Double reading (19) and computer-assisted detection methods have been demonstrated to improve performance by improving cancer detection (20), although detecting subtle spiculated masses remains problematic (21), and normal variant asymmetric densities continue to be a source of false-positive results.

Linver et al (22) have shown that dedicated training in mammography through continuing medical education courses improves the performance of radiologists both by increasing sensitivity and by decreasing the size of cancers detected, without a loss of specificity. Pure experience in reviewing mammograms has been shown to contribute to improved performance, with the best accuracies seen after cumulative experience of reviewing 10,000 mammograms (23). Kan et al (24) have shown improved reader performance when a minimum of 2,500 mammograms are interpreted per year. Ciatto et al (25), in a study of 117 Italian radiologists, showed increasing accuracy in a mammographic proficiency test correlated with increasing number of years in practice, number of mammograms reviewed, and total number of mammograms reviewed per year. Indeed, many European countries require that 5,000 mammograms be interpreted annually to maintain qualifications as a screening reader (26). We did not demonstrate significant differences in Az values with varying experience levels, although we showed that readers performing worse initially were more likely to benefit from training.

Improvement of reader classification of mammographic findings has received relatively little attention. D’Orsi et al (27) have demonstrated that mammographic interpretation was more accurate when features were entered into a computer and the risk of malignancy was provided to the reader. Recently, Chan et al (28) showed improved Az values when lesion features were entered into artificial neural networks and compared with a database of known benign and malignant lesions, with an output risk of malignancy provided to the radiologist. Performance is further enhanced when patient risk factors are also considered (29,30).

Wherever possible, we listed descriptors in order of increasing risk of malignancy. This parallels the presentation of the terminology in the third edition of the BI-RADS lexicon (3). This may have introduced bias in linking final assessments with feature analysis. We were trying to teach the relationship between the two, and such bias, if any, was considered desirable. It is not clear that this actually biased the results. For example, we were surprised to note that 15% of calcifications described as "coarse, typically benign" were still assessed as suspicious or highly suggestive of malignancy, with biopsy recommended.

Use of the BI-RADS lexicon in practice has not yet, to our knowledge, been optimized. Until 1998, with publication of the Illustrated BI-RADS (3), little illustration was available for the average practitioner to master the use of standardized terms for feature analysis. Baker et al (31) had five radiologists, who came from the same practice and trained in the same program, apply the BI-RADS terminology to 60 proven cases. {kappa} values were used as measures of agreement, with {kappa} values of 0.63 for mass margins, 0.50 for microcalcification morphology, and 0.77 for microcalcification distribution (31). Berg et al (32) evaluated the use of BI-RADS by five separately trained radiologists reviewing 103 proven cases and determined {kappa} values of 0.40 for mass margins, 0.36 for microcalcification morphology, 0.47 for microcalcification distribution, and 0.37 for final assessment. Particular terms were more variably used, especially obscured ({kappa} = 0.1), amorphous ({kappa} = 0.25), and diffuse ({kappa} = 0.08) descriptors and the probably benign and suspicious assessment categories ({kappa} = 0.17 and 0.16, respectively) (32). This suggests that lesions near the threshold for biopsy were the most problematic and intuitively likely accounted for the greatest variability in actual practice. Indeed, D’Orsi and Swets (33), in analyzing the results of Elmore et al (1), have shown that the 10 radiologists were in fact operating on the same ROC curve but used different thresholds for intervention. On the basis of these data, our case mix deliberately included lesions close to the threshold for intervention (ie, punctate, amorphous, and pleomorphic calcifications, microlobulated and indistinct masses, and focal asymmetric densities), to attempt to clarify the areas of greatest confusion in practice.

As has been observed previously (32), we found lower pretraining {kappa} values for such intermediately suspicious features than for those more typically benign (ie, circumscribed) or highly suspicious (ie, spiculated masses and linear or branching calcifications). It is important to note that we were able to improve the consistency of lesion description for most of these intermediate-suspicion features, including microlobulated and indistinct mass margins, focal asymmetric densities, and amorphous and pleomorphic calcifications.

Punctate (round, <0.5-mm) calcifications remain problematic and were, indeed, problematic for our experienced breast imagers to consistently describe, with a generalized {kappa} value of only 0.06 for the experts. While punctate calcifications have been variably included in the probably benign classification (5), it is clear that even punctate calcifications in a linear or segmental distribution are suggestive of cancer. Indeed, in an abstract by Arnoldus and Berg (34), of 11 foci of punctate calcifications on which biopsy was to be performed because of suspicious distribution or increase or because of a strong family history of breast cancer, three (27%) proved malignant. Similarly, in the study by Liberman et al (8), one (11%) of nine clustered punctate calcifications on which biopsy was to be performed proved malignant. While training participants to consistently describe and manage punctate calcifications may remain problematic, a majority of isolated, scattered, and diffuse bilateral, punctate, and round calcifications are indeed benign and can be dismissed (35).

What may be more problematic is inconsistent recognition of a segmental (duct and its branches) distribution of calcifications. Liberman et al (8) found that 17 (74%) of 23 such calcifications on which biopsy was to be performed were malignant, similar to the three (75%) of four malignancy rate of these lesions in this study’s case mix. While our experienced breast imagers were consistent in their use of the term segmental ({kappa} = 0.62), participants were not, with a pretraining {kappa} value of 0.10 and a posttraining {kappa} value of 0.17. We found that participants tended to consider secretory calcifications to be linear and branching in a segmental distribution and to be suspicious, although they were instructed to consider them to be typically benign. Training in calcification distribution was less successful than in the other features analyzed.

We demonstrated an improvement in lesion management that was maintained at delayed follow-up. It is important to note that the sensitivity of participants improved without a significant increase in false-positive results, as measured against the consensus of the experienced breast imagers. Indeed, a majority of participants achieved performance similar to that of the experts after training.

Whether or not such a quiz set of cases can be used to accurately measure performance in practice has been questioned. Rutter and Taplin (36) recently found that actual clinical practice exceeded test performance. Inclusion of a large number of malignancies in the test set introduces "context bias" that tends to increase sensitivity and decrease specificity (37). Nevertheless, we were most concerned with demonstrating that we could achieve improved agreement with consensus of experienced breast imagers rather than with any absolute measure of performance. This was accomplished.

In summary, training in BI-RADS feature analysis and assessment resulted in improved consistency in lesion description, particularly for mass margins and calcification morphology, but in less improvement for calcification distribution; with this training, participants uniformly improved in their recognition of suspicious features and recommended biopsy on more cancers. The benefits of such training were retained after 2–3 months in a subgroup of participants.


    ACKNOWLEDGMENTS
 
The authors gratefully acknowledge the generous support of time and effort of Carla Morrisey of the American College of Radiology and Barbara Myers of the University of Maryland, University Imaging Center. Without the efforts of these individuals, this work could not have been performed.


    FOOTNOTES
 
2 Current address: Department of Radiology, Emory University School of Medicine, Atlanta, Ga. Back

3 Current address: Moffitt Cancer Center, University of South Florida, Tampa. Back

4 Current address: Health Services Research and Development Service, Department of Veterans Affairs, Washington, DC. Back

Abbreviations: Az = area under the ROC curve, BI-RADS = Breast Imaging Reporting and Data System, ROC = receiver operating characteristic

Author contributions: Guarantor of integrity of entire study, W.A.B.; study concepts, W.A.B., C.J.D., P.E.C.; study design, W.A.B., P.E.C.; literature research, W.A.B.; clinical studies, W.A.B., C.J.D., V.P.J., L.W.B.; data acquisition, W.A.B.; data analysis/interpretation, W.A.B., R.S.L., P.E.C., C.A.B.; statistical analysis, R.S.L., P.E.C., C.A.B.; manuscript preparation and definition of intellectual content, W.A.B.; manuscript editing, revision/review, and final version approval, all authors.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Elmore JG, Wells CK, Lee CH, Howard DH, Feinstein AR. Variability in radiologists’ interpretations of mammograms. N Engl J Med 1995; 331:1493-1499.[Abstract/Free Full Text]
  2. Beam CA, Layde PM, Sullivan DC. Variability in the interpretation of screening mammograms by US radiologists: findings from a national sample. Arch Intern Med 1996; 156:209-213.[Abstract]
  3. American College of Radiology. Illustrated Breast Imaging Reporting and Data System (BI-RADS) 3rd ed. Reston, Va: American College of Radiology, 1998.
  4. Osuch JR, Anthony M, Bassett LW, et al. A proposal for a national mammography database: content, purpose, and value. AJR Am J Roentgenol 1995; 164:1329-1334.[Abstract/Free Full Text]
  5. Sickles EA. Periodic mammographic follow-up of probably benign lesions: results in 3184 consecutive cases. Radiology 1991; 179:463-468.[Abstract/Free Full Text]
  6. Sickles EA. Nonpalpable, circumscribed, noncalcified solid breast masses: likelihood of malignancy based on lesion size and age of patient. Radiology 1994; 192:439-442.[Abstract/Free Full Text]
  7. D’Orsi CJ, Kopans DB. Mammographic feature analysis. Semin Roentgenol 1993; 28:204-230.[CrossRef][Medline]
  8. Liberman L, Abramson AF, Squires CB, Glassman JR, Morris EA, Dershaw DD. The Breast Imaging Reporting and Data System: positive predictive value of mammographic features and final assessment categories. AJR Am J Roentgenol 1998; 171:35-40.[Abstract/Free Full Text]
  9. Kopans DB. The accuracy of mammographic interpretation. N Engl J Med 1994; 331:1521-1522.[Free Full Text]
  10. Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics 1977; 33:159-174.[CrossRef][Medline]
  11. Fleiss J. Statistical methods for rates and proportions New York, NY: Wiley, 1981.
  12. Fleiss J. Measuring the nominal scale agreement among many raters. Psych Bull 1971; 76:378-382.[CrossRef]
  13. Metz CE. ROC methodology in radiologic imaging. Invest Radiol 1986; 21:720-733.[Medline]
  14. Kerlikowske K, Carney PA, Geller B, et al. Performance of screening mammography among women with and without a first-degree relative with breast cancer. Ann Intern Med 2000; 133:855-863.[Abstract/Free Full Text]
  15. Kavanagh AM, Giles GG, Mitchell H, Cawson JN. The sensitivity, specificity, and positive predictive value of screening mammography and symptomatic status. J Med Screen 2000; 7:105-110.[Abstract/Free Full Text]
  16. Bird RE, Wallace TW, Yankaskas BC. Analysis of cancers missed at screening mammography. Radiology 1992; 184:613- 617.[Abstract/Free Full Text]
  17. Harvey JA, Fajardo LL, Innis CA. Previous mammograms in patients with impalpable breast carcinoma: retrospective vs blinded interpretation. 1993 ARRS President’s Award. AJR Am J Roentgenol 1993; 161:1167-1172.
  18. Ciatto S, Rosselli Del Turco M, Zappa M. The detectability of breast cancer by screening mammography. Br J Cancer 1995; 71:337-339.[Medline]
  19. Thurfjell E, Lernevall K, Taube A. Benefit of independent double reading in a population-based mammography screening program. Radiology 1994; 191:241-244.[Abstract/Free Full Text]
  20. Warren Burhenne LJ, Wood SA, D’Orsi CJ, et al. Potential contribution of computer-aided detection to the sensitivity of screening mammography. Radiology 2000; 215:554-562.[Abstract/Free Full Text]
  21. Birdwell RL, Ikeda DM, O’Shaughnessy KF, Sickles EA. Mammographic characteristics of 115 missed cancers later detected with screening mammography and the potential utility of computer-aided detection. Radiology 2001; 219:192-202.[Abstract/Free Full Text]
  22. Linver MN, Paster SB, Rosenberg RD, Key CR, Stidley CA, King WV. Improvement in mammography interpretation skills in a community radiology practice after dedicated teaching courses: 2-year medical audit of 38,633 cases. Radiology 1992; 184:39-43.[Abstract/Free Full Text]
  23. Nodine CF, Kundel HL, Mello-Thoms C, et al. How experience and training influence mammography expertise. Acad Radiol 1999; 6:575-585.[CrossRef][Medline]
  24. Kan L, Olivotto IA, Warren Burhenne LJ, Sickles EA, Coldman AJ. Standardized abnormal interpretation and cancer detection ratios to assess reading volume and reader performance in a breast screening program. Radiology 2000; 215:563-567.[Abstract/Free Full Text]
  25. Ciatto S, Ambrogetti D, Catarzi S, Morrone D, Rosselli Del Turco M. Proficiency test for screening mammography: results for 117 volunteer Italian radiologists. J Med Screen 1999; 6:149-151.[Abstract/Free Full Text]
  26. Burhenne L, Smith RA, Tabar L, Dean PB, Perry N, Sickles EA. Mammographic screening: international perspective. Semin Roentgenol 2001; 36:187-194.[CrossRef][Medline]
  27. D’Orsi CJ, Getty DJ, Swets JA, Pickett RM, Seltzer SE, McNeil BJ. Reading and decision aids for improved accuracy and standardization of mammographic diagnosis. Radiology 1992; 184:619-622.[Abstract/Free Full Text]
  28. Chan HP, Sahiner B, Helvie MA, et al. Improvement of radiologists’ characterization of mammographic masses by using computer-aided diagnosis: an ROC study. Radiology 1999; 212:817-827.[Abstract/Free Full Text]
  29. Baker JA, Kornguth PJ, Lo JY, Williford ME, Floyd CE, Jr. Breast cancer: prediction with artificial neural network based on BI-RADS standardized lexicon. Radiology 1995; 196:817-822.[Abstract/Free Full Text]
  30. Lo JY, Baker JA, Kornguth PJ, Floyd CE, Jr. Computer-aided diagnosis of breast cancer: artificial neural network approach for optimized merging of mammographic features. Acad Radiol 1995; 2:841-850.[CrossRef][Medline]
  31. Baker JA, Kornguth PJ, Floyd CE, Jr. Breast imaging Reporting and Data System standardized mammography lexicon: observer variability in lesion description. AJR Am J Roentgenol 1996; 166:773-778.[Abstract/Free Full Text]
  32. Berg WA, Campassi C, Langenberg P, Sexton MJ. Breast Imaging Reporting and Data System: inter- and intraobserver variability in feature analysis and final assessment. AJR Am J Roentgenol 2000; 174:1769-1777.[Abstract/Free Full Text]
  33. D’Orsi CJ, Swets JA. Variability in the interpretation of mammograms (letter). N Engl J Med 1995; 332:1172.
  34. Arnoldus C, Berg W. Biopsy of selected punctate calcifications: outcome analysis (abstr). AJR Am J Roentgenol 2000; 174(S):65.
  35. Sickles E. Breast calcifications: mammographic evaluation. Radiology 1986; 160:289-293.[Abstract/Free Full Text]
  36. Rutter C, Taplin S. Assessing mammographers’ accuracy: a comparison of clinical and test performance. J Clin Epidemiol 2000; 53:443-450.[CrossRef][Medline]
  37. Egglin T, Feinstein A. Context bias: a problem in diagnostic radiology. JAMA 1996; 276:1752-1755.[Abstract]



This article has been cited by other articles:


Home page
Am. J. Roentgenol.Home page
C. A. Coursey, C. L. Hollingsworth, A. M. Gaca, C. Maxfield, D. DeLong, and G. Bisset III
Radiologists' Agreement When Using a 10-Point Scale to Report Abdominal Radiographic Findings of Necrotizing Enterocolitis in Neonates and Infants
Am. J. Roentgenol., July 1, 2008; 191(1): 190 - 197.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
E.-K. Kim, K. H. Ko, K. K. Oh, J. Y. Kwak, J. K. You, M. J. Kim, and B.-W. Park
Clinical Application of the BI-RADS Final Assessment to Breast Sonography in Conjunction with Mammography
Am. J. Roentgenol., May 1, 2008; 190(5): 1209 - 1215.
[Abstract] [Full Text] [PDF]


Home page
Jpn J Clin OncolHome page
A. Suzuki, Y. Nakamoto, T. Terauchi, M. Kawamoto, Y. Okumura, Y. Suzuki, T. Sato, N. Takahashi, J. Lee, M. Senda, et al.
Inter-observer Variations in FDG-PET Interpretation for Cancer Screening
Jpn. J. Clin. Oncol., August 18, 2007; (2007) hym064v1.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
G. Schueller, E. Kaindl, W. K. Matzek, F. Semturs, C. Schueller-Weidekamm, and T. H. Helbich
Image Quality of a Wet Laser Printer Versus a Paper Printer for Full-Field Digital Mammograms
Am. J. Roentgenol., January 1, 2006; 186(1): 38 - 43.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
P. E. Crewson
Reader Agreement Studies
Am. J. Roentgenol., May 1, 2005; 184(5): 1391 - 1397.
[Full Text] [PDF]


Home page
RadiologyHome page
D. Gur, L. P. Wallace, A. H. Klym, L. A. Hardesty, G. S. Abrams, R. Shah, and J. H. Sumkin
Trends in Recall, Biopsy, and Positive Biopsy Rates for Screening Mammography in an Academic Practice
Radiology, May 1, 2005; 235(2): 396 - 401.
[Abstract] [Full Text] [PDF]


Home page
CA Cancer J ClinHome page
D. Saslow, J. Hannan, J. Osuch, M. H. Alciati, C. Baines, M. Barton, J. K. Bobo, C. Coleman, M. Dolan, G. Gaumer, et al.
Clinical Breast Examination: Practical Recommendations for Optimizing Performance and Reporting
CA Cancer J Clin, November 1, 2004; 54(6): 327 - 344.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Figures Only</