Radiology
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Published online before print July 26, 2002, 10.1148/radiol.2243011482
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
2243011482v1
224/3/861    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Sickles, E. A.
Right arrow Articles by Dee, K. E.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sickles, E. A.
Right arrow Articles by Dee, K. E.
(Radiology 2002;224:861-869.)
© RSNA, 2002


Breast Imaging

Performance Parameters for Screening and Diagnostic Mammography: Specialist and General Radiologists1

Edward A. Sickles, MD, Dulcy E. Wolverton, MD and Katherine E. Dee, MD2

1 From the Department of Radiology, University of California (UCSF) Medical Center, Box 1667, San Francisco, CA 94143-1667. Received September 4, 2001; revision requested October 16; final revision received February 28, 2002; accepted March 20. Supported by Breast Cancer Surveillance Consortium cooperative agreement no. 1U01 CA 63740 from the National Cancer Institute. Address correspondence to E.A.S.


    ABSTRACT
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
PURPOSE: To evaluate performance parameters for radiologists in a practice of breast imaging specialists and general diagnostic radiologists who interpret a large series of consecutive screening and diagnostic mammographic studies.

MATERIALS AND METHODS: Data (ie, patient age; family history of breast cancer; availability of previous mammograms for comparison; and abnormal interpretation, cancer detection, and stage 0–I cancer detection rates) were derived from review of mammographic studies obtained from January 1997 through August 2001. The breast imaging specialists have substantially more initial training in mammography and at least six times more continuing education in mammography, and they interpret 10 times more mammographic studies per year than the general radiologists. Differences between specialist and general radiologist performances at both screening and diagnostic examinations were assessed for significance by using Student t and {chi}2 tests.

RESULTS: The study involved 47,798 screening and 13,286 diagnostic mammographic examinations. Abnormal interpretation rates for screening mammography (ie, recall rate) were 4.9% for specialists and 7.1% for generalists (P < .001); and for diagnostic mammography (ie, recommended biopsy rate), 15.8% and 9.9%, respectively (P < .001). Cancer detection rates at screening mammography were 6.0 cancer cases per 1,000 examinations for specialists and 3.4 per 1,000 for generalists (P = .007); and at diagnostic mammography, 59.0 per 1,000 and 36.6 per 1,000, respectively (P < .001). Stage 0–I cancer detection rates at screening mammography were 5.3 cancer cases per 1,000 examinations for specialists and 3.0 per 1,000 for generalists (P = .012); and at diagnostic mammography, 43.9 per 1,000 and 27.0 per 1,000, respectively (P < .001).

CONCLUSION: Specialist radiologists detect more cancers and more early-stage cancers, recommend more biopsies, and have lower recall rates than general radiologists.

© RSNA, 2002

Index terms: Breast neoplasms, diagnosis, 00.11, 00.30 • Breast neoplasms, radiography, 00.11, 00.30 • Breast radiography, quality assurance, 00.11 • Cancer screening, 00.11, 00.30 • Radiology and radiologists, observer performance


    INTRODUCTION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Variability in radiologist performance at screening mammography has been demonstrated in several reports that involved both prospectively designed observer performance studies with standardized case material (13) and comprehensive audits of large-scale mammography practices and organized screening programs (410). The data from some of these reports appear to show a positive correlation between several parameters of screening performance and radiologist experience.

The purpose of our study was to evaluate various performance parameters for radiologists in a practice of breast imaging specialists and general diagnostic radiologists who interpret a large series of consecutive screening and diagnostic mammographic studies, with the aim of extending the scope of previously published observations concerning screening mammography and determining whether similar observations are valid for diagnostic mammography.


    MATERIALS AND METHODS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
We collected data for all screening and diagnostic mammographic examinations that were performed at fixed-site facilities in our institution from January 1997 through August 2001. Some of these data have been reported previously (11). In our practice, screening examinations are performed in asymptomatic women and consist of mediolateral oblique and craniocaudal projections of each breast; the studies are interpreted in batches twice daily and compared with two previous studies (when available). We perform diagnostic mammography as a problem-solving examination that involves a full spectrum of mammographic projections that are tailored to elucidate the unexplained features of each case. Diagnostic studies are interpreted immediately after imaging is completed. Data from both the screening and diagnostic examinations are stored in a computer database that is patterned according to previously described principles (12).

Our study was reviewed and approved by our institutional review board. Appropriate informed consent was obtained from all patients (to permit review of medical records) and from all interpreting radiologists (to permit analysis and reporting of radiologist-specific performance).

Interpreting Physicians
All of the interpreting physicians in our practice are American Board of Radiology–certified radiologists, but their levels of mammography experience and expertise vary. Most are general diagnostic radiologists who have been in practice for 3–32 years (mean, 19.6 years). These individuals, who have neither special training nor a special focus in mammography, interpret sufficient numbers of mammographic studies, both within and outside of our practice, to satisfy U.S. federal regulations for continuing experience (ie, 960 examinations per 2-year period) (13). They also acquire a sufficient level of postgraduate education in mammography to satisfy U.S. federal regulations for continuing education (15 hours of category 1 credit per 3-year period) (13).

Our practice also includes three radiologists (including E.A.S. and D.E.W.) who are breast imaging specialists. These individuals interpret more than 5,000 mammographic studies per year both within and outside of our practice, and they acquire at least 30 hours of category 1 continuing education credit in mammography per year. Our senior radiologist has been in full-time breast imaging practice for 25 years; the other two began practicing in this specialty 7 and 8 years ago, each after having completed a 1-year breast imaging fellowship taught by the senior radiologist.

Mammography Facilities
Our practice operates two fixed-site mammography facilities, one of which is an integral part of our institution’s Comprehensive Cancer Center. All of the specialist radiologists and several of the general diagnostic radiologists interpret mammographic studies at this facility, which is called the Breast Imaging Center (BIC). Radiology residents and breast imaging fellows are trained at the BIC on an ongoing basis. At least two residents and one fellow are always on service; these physicians routinely preview each screening and diagnostic mammographic study prior to its interpretation by a staff radiologist so that all BIC interpretations are rendered by using a form of quadruple reading. Another aspect of routine practice at the BIC is a radiologic-pathologic correlation conference, which is attended by all specialist radiologists, fellows, and residents. At this conference, the pathologic diagnosis for every interventional breast procedure is compared with the radiologic findings to formulate recommendations for additional imaging studies, if appropriate.

Our other fixed-site facility is part of the outpatient radiology department in our institution’s Ambulatory Care Center (ACC). Three of the general diagnostic radiologists interpret mammographic studies at this facility by using a single- rather than multiple-reading protocol, insofar as there are neither residents nor fellows on service at the ACC. The radiologic-pathologic correlation performed by each general diagnostic radiologist (ACC and BIC) is limited to a personal review of the pathologic diagnoses for all cases that are recommended for biopsy by that radiologist, as required by U.S. federal regulations (13).

All BIC diagnostic mammographic studies are interpreted at the BIC, and all ACC diagnostic mammographic studies are interpreted at the ACC. However, whereas all BIC screening studies are interpreted at the BIC, ACC screening studies also are interpreted at the BIC whenever it is practical (in almost 85% of cases) and with no purposeful selection of which ACC screening studies are interpreted at either facility. During this study, all radiologists at both facilities recommended routine screening mammography at an annual interval. Therefore, it is highly likely that there are no substantial differences between the actual frequencies of yearly versus longer interval screening at both facilities, although we cannot confirm this with direct measurement.

Our BIC-based technologists almost always perform mammography at the BIC, and our ACC-based technologists almost always perform mammography at the ACC, with only very occasional cross coverage between the two facilities. However, the levels of experience and expertise of BIC and ACC technologists are essentially identical: The two groups have the same amount of initial training, continuing education, and quality-assurance supervision by the same (BIC) mammography specialist radiologists, and they have similar amounts of continuing experience.

During this study, the mammography equipment at the BIC was very similar in quality to that at the ACC. The imaging protocols used at the two facilities were identical. Furthermore, periodic Mammography Quality Standards Act–mandated testing of BIC and ACC mammography equipment for quality assurance has produced identical results.

Data Collection and Analysis
Prior to image interpretation, under the supervision of a mammography technologist, the patient completes a questionnaire in which she provides data on her age, family history of breast cancer, and the reason(s) for undergoing mammography. These data subsequently are transferred into our computer database. At the time of image interpretation, a radiologist reviews the patient questionnaire and enters the following information into the database: the indication for the examination—that is, screening or diagnostic, and if diagnostic, whether the examination is being performed for evaluation of a palpable mass or another indication; whether previous mammograms are available for comparison; a Breast Imaging Reporting and Data System (BI-RADS) assessment category for each breast; and management recommendations.

Screening mammography findings are considered to be abnormal when either breast is assigned to BI-RADS category 0 (incomplete, needs additional imaging), 4 (suspicious), or 5 (highly suggestive of malignancy). Diagnostic mammography findings are considered to be abnormal when either breast is assigned to BI-RADS category 4 or 5. With fewer than one-fourth of diagnostic mammographic examinations, targeted breast ultrasonography (US) is performed concurrently. For such cases, the data collection procedures of this study were set up so that the presence or absence of US findings did not affect the concurrent mammographic assessment.

Subsequent to image interpretation, we record a benign or malignant diagnosis for all cases in which biopsy is performed. When biopsy reveals malignancy (defined as ductal carcinoma in situ or any invasive carcinoma), we also collect data on the histologic diagnosis, tumor size, and axillary nodal status, from which we determine the stage of the cancer on the basis of the American Joint Committee on Cancer staging system (14).

Because it has been shown that screening and diagnostic mammographic examinations are associated with substantial and sometimes significant differences in rates of abnormal interpretation, biopsy, and cancer detection (8,11), the results of these two types of examinations were analyzed separately. In this study, for both screening and diagnostic mammographic examinations, we calculated abnormal interpretation rates by dividing the number of abnormal interpretations by the number of studies interpreted, biopsy rates by dividing the number of abnormal examinations for which biopsy was performed by the number of studies interpreted, and cancer detection rates by dividing the number of abnormal examinations at which cancer was diagnosed by the number of studies interpreted.

Data tabulations and statistical computations were accomplished by using programming software (S-PLUS; Insightful, Seattle, Wash). The Student t test was performed for comparisons of data with approximately normal distributions, and the {chi}2 test was performed for comparisons of proportional data. A P value of less than .05 was considered to indicate statistical significance.


    RESULTS
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
Patient Population
This study involved 61,084 consecutive mammographic studies, 47,798 of which were classified as screening examinations and 13,286 of which were classified as diagnostic examinations. Table 1 shows the patient age–related breakdown of screening and diagnostic studies by interpreting radiologist. There were only minor, nonsignificant differences in mean patient age among individual radiologists and between the specialist and general radiologist groups. For screening examinations, the mean age of patients was 58.97 years for specialist radiologists and 58.95 years for general radiologists (P = .894); for diagnostic examinations, the corresponding mean ages were 55.82 and 55.46 years, respectively (P = .190).


View this table:
[in this window]
[in a new window]

 
TABLE 1. Patient Age Correlated with Number of Screening versus Diagnostic Mammographic Studies Interpreted by Each Radiologist

 
Similarly, among the examinations for which family history of breast cancer was recorded, there were only small, nonsignificant differences in family history between the studies read by specialist and general radiologists (Table 2). For screening mammography, 13.9% of patients whose studies were interpreted by specialist radiologists and 13.4% of patients whose studies were read by general radiologists had a strong or very strong family history of breast cancer (P = .244). For diagnostic mammography, 17.9% of patients whose studies were interpreted by specialist radiologists and 16.9% of patients whose studies were read by general radiologists had a strong or very strong family history of breast cancer (P = .223).


View this table:
[in this window]
[in a new window]

 
TABLE 2. Family History of Breast Cancer Correlated with Number of Screening versus Diagnostic Mammographic Studies Interpreted by Each Radiologist

 
There also were no significant differences in the availability of previous mammograms between the studies interpreted by specialists and those interpreted by general radiologists (Table 3). For screening mammography, previous mammograms were available for comparison for 54.6% of studies read by specialist radiologists and for 54.3% of studies read by general radiologists (P = .654). For diagnostic mammography, these data were 66.6% for specialist radiologists and 66.1% for general radiologists (P = .645).


View this table:
[in this window]
[in a new window]

 
TABLE 3. Availability of Previous Mammograms Correlated with Number of Screening versus Diagnostic Mammographic Studies Interpreted by Each Radiologist

 
In addition to the direct comparisons of patient populations examined by specialist versus general radiologists shown in Tables 13, we made similar comparisons of the patient populations examined at the BIC versus at the ACC. These latter comparisons also indicated only small, nonsignificant differences in patient age (P = .316), family history of breast cancer (P = .722), and availability of previous mammograms (P = .522) between examinations performed at our two mammography facilities.

Abnormal Interpretation Rates
The results of mammographic interpretations are shown in Table 4. For screening mammography, the abnormal interpretation rate (ie, recall rate) for specialist radiologists was 4.9%, which is approximately 30% lower than the rate of 7.1% for general radiologists (P < .001). The opposite was observed for diagnostic mammography in that the abnormal interpretation rate (ie, biopsy recommended rate) for specialist radiologists, 15.8%, was approximately 60% higher than that for general radiologists, 9.9% (P < .001). The group of general radiologists who work at the BIC had higher abnormal interpretation rates, for both screening and diagnostic mammography, than the group of general radiologists who work at the ACC. For screening mammography, these rates were 7.8% for BIC radiologists and 6.0% for ACC radiologists (P = .003). For diagnostic mammography, these rates were 11.8% for BIC radiologists and 8.8% for ACC radiologists (P = .010).


View this table:
[in this window]
[in a new window]

 
TABLE 4. Abnormal Interpretation Correlated with Number of Screening versus Diagnostic Mammographic Studies Interpreted by Each Radiologist

 
Biopsy and Cancer Detection Rates
As shown in Table 5, the rate at which biopsy was performed was higher among the specialist radiologists than the general radiologists. For screening mammography, the biopsy rate was 1.5% for specialist radiologists, which is approximately 30% higher than the rate of 1.1% for general radiologists (P = .001). For diagnostic mammography, the difference in biopsy rates was even greater: 12.3% for specialist radiologists versus 7.5% for general radiologists (P < .001). BIC general radiologists also had somewhat higher biopsy rates than ACC general radiologists: 1.2% versus 0.9% for screening mammography (P = .256, nonsignificant) and 9.7% versus 6.4% for diagnostic mammography (P = .001).


View this table:
[in this window]
[in a new window]

 
TABLE 5. Biopsy Performed Owing to Abnormal Mammogram Interpretation Correlated with Number of Screening versus Diagnostic Mammographic Studies Interpreted by Each Radiologist

 
The data in Table 6 show that the cancer detection rates also were higher among specialist radiologists than general radiologists. For screening mammography, the cancer detection rate was 6.0 cancer cases per 1,000 examinations for specialist radiologists, which is approximately 75% higher than the rate of 3.4 cases per 1,000 examinations for general radiologists (P = .007). For diagnostic mammography, these data were 59.0 cancer cases per 1,000 examinations for specialist radiologists and 36.6 cases per 1,000 examinations for general radiologists (P < .001). BIC general radiologists also had somewhat higher, although statistically nonsignificant, cancer detection rates than ACC general radiologists: 4.0 cases per 1,000 examinations versus 2.6 cases per 1,000 examinations for screening mammography (P = .412) and 45.6 cases per 1,000 examinations versus 31.9 cases per 1,000 examinations for diagnostic mammography (P = .074).


View this table:
[in this window]
[in a new window]

 
TABLE 6. Cancer Detection Correlated with Number of Screening versus Diagnostic Mammographic Studies Interpreted by Each Radiologist

 
Rates of Detecting Less Advanced Stage Cancer
The data on cancer stage summarized in Table 7 show that the rates of detecting less advanced (ie, stage 0 or I) cancers were higher among specialist radiologists than general radiologists. For screening mammography, the stage 0–I cancer detection rate was 5.3 cancer cases per 1,000 examinations for specialist radiologists, which is approximately 75% higher than the rate of 3.0 cases per 1,000 examinations for general radiologists (P = .012). For diagnostic mammography, these data were 43.9 cases per 1,000 examinations for specialist radiologists and 27.0 cases per 1,000 examinations for general radiologists (P < .001). Again, BIC general radiologists were observed to have somewhat higher stage 0–I cancer detection rates than ACC general radiologists: 3.5 cases per 1,000 examinations versus 2.3 cases per 1,000 examinations for screening mammography (P = .435, nonsignificant) and 35.9 cases per 1,000 examinations versus 22.3 cases per 1,000 examinations for diagnostic mammography (P = .039).


View this table:
[in this window]
[in a new window]

 
TABLE 7. Stage 0 and I Cancer Detection Correlated with Number of Screening versus Diagnostic Mammographic Studies Interpreted by Each Radiologist

 
Indications for Diagnostic Mammography
The diagnostic mammography data on abnormal interpretations, biopsies performed, cancers detected, and stage 0–I cancers detected are categorized according to indication for examination in Tables 811. From the data in each of these tables, it is apparent that there were only small and nonsignificant differences between the interpretations made by specialist radiologists and those made by general radiologists at examinations performed for work-up of palpable masses. All of the differences observed for our entire diagnostic mammography population (Tables 47) were observed to be limited to those examinations performed for indications other than a palpable mass. These differences between specialist and general radiologists, and between BIC and ACC general radiologists, all were statistically significant.


View this table:
[in this window]
[in a new window]

 
TABLE 8. Abnormal Interpretation Correlated with Indication for Diagnostic Mammography

 

View this table:
[in this window]
[in a new window]

 
TABLE 9. Biopsy Performed Owing to Abnormal Mammogram Interpretation Correlated with Indication for Diagnostic Mammography

 

View this table:
[in this window]
[in a new window]

 
TABLE 10. Cancer Detection Correlated with Indication for Diagnostic Mammography

 

View this table:
[in this window]
[in a new window]

 
TABLE 11. Stage 0 and I Cancer Detection Correlated with Indication for Diagnostic Mammography

 

    DISCUSSION
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 
It has been demonstrated in several previous studies (310) involving screening mammography that interpretive performance may vary with radiologist experience. The results of two of these studies show that the performance of groups of radiologists improves over time as a result of either on-the-job experience (6) or attendance at mammography teaching courses (5). In other studies, the superior performance of individual radiologists has been observed to be related to increased amounts of previous screening mammography experience, and, most frequently, to the number of studies interpreted during the past 1 or more years (3,4,710). In the current study, in which we analyzed a large prospective series of consecutive mammographic studies, we extended these observations to a wider range of performance parameters—namely, abnormal interpretation rate, biopsy performed rate, cancer detection rate, and stage 0–I cancer detection rate—and to diagnostic, as well as screening, examinations.

To put the differences in interpretive performance observed among individual radiologists into proper perspective, it is important to determine the extent to which these differences might be due to underlying differences in the studied patient populations. This is pertinent for screening mammography because advancing patient age is directly proportional to cancer detection rate (1518) and because comparison with previous mammograms is associated with a decreased abnormal interpretation (ie, recall) rate (1823). Therefore, our finding that there were only small, nonsignificant differences in age, family history of breast cancer, and previous mammogram availability between studies interpreted by specialist versus general radiologists, and between those interpreted by BIC versus ACC general radiologists, strongly suggests that the several, much more substantial significant differences in performance parameters between these radiologist groups truly represented differences in interpretive performance rather than effects of selection bias due to differences in the patient populations studied by specialist versus general radiologists.

In addition, there was no indication of selection bias with regard to the patients examined at the BIC versus at the ACC or with regard to the quality of mammographic examinations performed at the BIC versus at the ACC, which have similarly skilled technologists and mammography equipment of similar quality. These observations further support our conclusion that the substantial statistically significant differences in performance parameters between specialist and general radiologists, and between BIC and ACC general radiologists, truly represented differences in interpretive performance.

The most striking differences in radiologist performance were observed between our three specialist radiologists and the seven general radiologists, who interpret mammograms less frequently. At analysis of all reported parameters of interpretive performance, each of these differences was statistically significant. For screening mammography, the abnormal interpretation (ie, recall) rate for the specialist radiologists was significantly lower than that for the general radiologists. For diagnostic mammography, however, the abnormal interpretation (ie, biopsy recommended) rate for the specialist radiologists was higher than that for the general radiologists.

The results were uniformly consistent for the remaining performance parameters at both screening and diagnostic mammographic examinations. The biopsy performed, cancer detection, and stage 0–I cancer detection rates all were significantly higher for the specialist radiologists. Thus, the specialist radiologists make more true-positive and fewer false-positive interpretations at screening mammography, and this is clearly a preferable level of performance. At diagnostic mammography, the specialist radiologists make more true-positive and more false-positive interpretations, and we believe these results also represent a preferable level of performance because the principal goal of mammography is cancer detection rather than reduction of benign biopsy results.

Several differences in the experience and expertise of these two groups of radiologists may explain the observed differences in their performances. First, the specialist radiologists have considerably more training and education in mammography. The general radiologists received the basic amount of initial training in mammography during their diagnostic radiology residencies and have received close to the minimum amount of continuing education in mammography that is required by federal regulations. On the other hand, two of the three specialist radiologists have completed 1-year breast imaging fellowships, the other has taught breast imaging fellows for more than 10 years, and all of the specialist radiologists earn at least six times more hours of category 1 continuing education credits in mammography than are required by federal regulations.

Our specialist radiologists also have considerably more continuing experience in mammographic interpretation. As breast imaging specialists, they read at least 5,000 studies per year, which is 10 times greater than the basic caseload that is required by federal regulations and handled by most of our general radiologists. However, to the extent that increased experience correlates with improved performance, the important factor appears to be caseload per year rather than number of years of experience, because, on average, the general radiologists have been in practice many more years than the specialist radiologists: 19.6 versus 13.3 years.

For both screening and diagnostic mammographic examinations, the BIC general radiologists have somewhat higher abnormal interpretation, biopsy performed, cancer detection, and stage 0–I cancer detection rates than the ACC general radiologists. Therefore, the BIC radiologists make more positive interpretations, with increases in both true-positive and false-positive cases. As stated previously, such results are considered a preferable level of performance.

We cannot explain these observed differences in performance among the general radiologists on the basis of either experience or expertise, because there are no substantial differences in the amounts of initial or continuing education in mammography received by our ACC versus BIC general radiologists, nor are there substantial differences in the years of practice experience or mammography caseload per year between the two groups. The only difference between these groups of radiologists involves the environment within which they interpret mammograms. ACC radiologists single read mammographic studies because neither residents nor breast imaging fellows have clinical service rotations at the ACC. On the other hand, BIC radiologists perform interpretations by using a multiple-reading system: At least two residents and one breast imaging fellow preview both screening and diagnostic mammographic studies and then present their findings to the staff radiologist. The actual interpretation is then made by the staff radiologist, who must decide whether to accept or reject the assessments proposed by the residents and fellows. This multiple-reading system is much more similar to the second-look type of double reading that is performed by computer-assisted detection software programs than it is to the independent or consensus double reading that is performed by two human observers, because computer-assisted detection second reading also involves a single interpreting radiologist who must decide, in addition to acting on his or her own personal assessments, whether or not to act on the prompts presented by the computer-assisted detection system.

We conclude that the observed differences in performance between BIC and ACC general radiologists in our practice are a result of multiple- versus single-reading interpretation, not only because there are no differences in experience or expertise between these groups of radiologists, but also because prospective computer-assisted detection second reading has been shown to produce similar improvements in performance (ie, an increase in both true-positive and false-positive interpretations) (24).

Finally, further assessment of the outcomes of the described diagnostic mammography cases at subset analysis based on the indications for the examinations revealed that the preferable performances of the specialist (vs general) radiologists and BIC (vs ACC) general radiologists were limited to their performances at examinations performed for indications other than the evaluation of a palpable mass. Because all groups and subgroups of radiologists had equivalent performance in the palpable mass cases, we conclude that for all of the radiologists in our practice, presence of a palpable mass, rather than radiologist experience and expertise, is the principal factor that governs the decision of whether or not to recommend biopsy.

In summary, our study results indicate that at both screening mammography and diagnostic mammography in our practice, specialist radiologists have a significantly more preferable interpretive performance compared with general radiologists, and we suspect that this difference is related to the increased amounts of initial training and continuing education in mammography and the increased continuing experience in mammographic interpretation (regarding caseload per year, not number of years in practice) of the specialist radiologists. In addition, among our general radiologists, we observed a smaller magnitude of but occasionally also significantly more preferable performance by those general radiologists who use the multiple- rather than single-reading system of interpretation.

We believe that the major strengths of our findings are the large number of screening and diagnostic mammograms studied, the statistically significant differences observed, and the fact that each analyzed subgroup consisted of several radiologists (although an even larger number of radiologists in each subgroup would facilitate a more accurate determination of the range of performance among all practicing radiologists). The major limitation of our study is that we reported no data on the performance of general radiologists who have an intermediate level of education and experience—that is, between that of our specialist and general radiologist groups. Throughout the United States and in other countries, there are many general radiologists who have a special interest in mammography and who have acquired much more than the required levels of education and experience in mammography. It will be important to determine the levels of mammographic performance for this subgroup of radiologists.


    FOOTNOTES
 
2 Current address: Department of Radiology, University of Washington Medical Center, Seattle. Back

Abbreviations: BIC = Breast Imaging Center, BI-RADS = Breast Imaging Reporting and Data System, ACC = Ambulatory Care Center

Author contributions: Guarantor of integrity of entire study, E.A.S.; study concepts, E.A.S., D.E.W.; study design, E.A.S.; literature research, E.A.S.; clinical studies, E.A.S., D.E.W., K.E.D.; data acquisition and analysis/interpretation, E.A.S., D.E.W., K.E.D.; statistical analysis, E.A.S., D.E.W., K.E.D.; manuscript preparation, E.A.S.; manuscript definition of intellectual content, editing, revision/review, and final version approval, E.A.S., D.E.W., K.E.D.


    REFERENCES
 TOP
 ABSTRACT
 INTRODUCTION
 MATERIALS AND METHODS
 RESULTS
 DISCUSSION
 REFERENCES
 

  1. Beam CA, Layde PM, Sullivan DC. Variability in the interpretation of screening mammograms by US radiologists: findings from a national sample. Arch Intern Med 1996; 156:209-213.[Abstract]
  2. Beam CA, Sullivan DC, Layde PM. Effect of human variability on independent double reading in screening mammography. Acad Radiol 1996; 3:891-897.[CrossRef][Medline]
  3. Elmore JG, Wells CK, Howard DH. Does diagnostic accuracy in mammography depend on radiologist experience? J Womens Health 1998; 7:443-449.[Medline]
  4. Sickles EA, Ominsky SH, Sollitto RA, Galvin HB, Monticciolo DL. Medical audit of a rapid throughput mammography screening practice: methodology and results of 27,114 examinations. Radiology 1990; 175:323-327.[Abstract/Free Full Text]
  5. Linver MN, Paster SB, Rosenberg RD, et al. Improvement in mammography interpretation skills in a community radiology practice after dedicated teaching courses: 2-year medical audit of 38,633 cases. Radiology 1992; 184:39-43.[Abstract/Free Full Text]
  6. Pamilo M, Anttinen I, Soiva M, Roiha M, Suramo I. Mammography screening: reasons for recall and the influence of experience on recall in the Finnish system. Clin Radiol 1990; 41:384-387.[CrossRef][Medline]
  7. Sickles EA. Quality assurance: how to audit your own mammography practice. Radiol Clin North Am 1992; 30:265-275.[Medline]
  8. Robertson CL. A private breast imaging practice: medical audit of 25,788 screening and 1,077 diagnostic examinations. Radiology 1993; 187:75-79.[Abstract/Free Full Text]
  9. Sickles EA. Auditing your practice. In: Kopans DB, Mendelson EB, eds. Syllabus: a categorical course in breast imaging. Oak Brook, Ill: Radiological Society of North America, 1995; 81-91.
  10. Kan L, Olivotto IA, Warren Burhenne LJ, Sickles EA, Coleman AJ. Standardized abnormal interpretation and cancer detection ratios to assess reading volume and reader performance in a breast screening program. Radiology 2000; 215:563-567.[Abstract/Free Full Text]
  11. Dee KE, Sickles EA. Medical audit of diagnostic mammography examinations: comparison with screening outcomes obtained concurrently. AJR Am J Roentgenol 2001; 176:729-733.[Abstract/Free Full Text]
  12. Sickles EA. The usefulness of computers in managing the operation of a mammography screening practice. AJR Am J Roentgenol 1990; 155:755-761.[Abstract/Free Full Text]
  13. United States Food and Drug Administration. Quality mammography standards: final rule—21 CFR parts 16 and 900 Washington, DC: United States Department of Health and Human Services, 1997.
  14. American Joint Committee on Cancer. Manual for staging of cancer 5th ed. Philadelphia, Pa: Lippincott, 1997.
  15. Warren Burhenne LJ, Hislop TG, Burhenne HJ. The British Columbia mammography screening program: evaluation of the first 15 months. AJR Am J Roentgenol 1992; 158:45-49.[Abstract/Free Full Text]
  16. Curpen BN, Sickles EA, Sollitto RA, Ominsky SH, Galvin HB, Frankel SD. The comparative value of mammographic screening for women 40–49 years old versus women 50–64 years old. AJR Am J Roentgenol 1995; 164:1099-1103.[Abstract/Free Full Text]
  17. Otten JDM, van Dijck JAAM, Peer PGM, et al. Long term breast cancer screening in Nijmegen, the Netherlands: the nine rounds from 1975–92. J Epidemiol Community Health 1996; 50:353-358.[Abstract]
  18. Libstug AR, Moravan V, Aitken SE. Results from the Ontario breast screening program, 1990–1995. J Med Screen 1998; 5:73-80.[Abstract/Free Full Text]
  19. Sickles EA, Ominsky SH, Sollitto RA, Galvin HB, Monticciolo DL. Medical audit of a rapid-throughput mammography screening practice: methodology and results of 27,114 examinations. Radiology 1990; 175:323-327.
  20. Tabár L, Fagerberg G, Duffy SW, Day NE, Gad A, Gröntoft O. Update of the Swedish two-county program of mammographic screening for breast cancer. Radiol Clin North Am 1992; 30:187-210.[Medline]
  21. Frankel SD, Sickles EA, Curpen BN, Sollitto RA, Ominsky SH, Galvin HB. Initial versus subsequent screening mammography: comparison of findings and their prognostic significance. AJR Am J Roentgenol 1995; 164:1107-1109.[Abstract/Free Full Text]
  22. Callaway MP, Boggis CRM, Astley SA, Hutt I. The influence of previous films on screening mammographic interpretation and detection of breast carcinoma. Clin Radiol 1997; 52:527-529.[CrossRef][Medline]
  23. Fracheboud J, de Koning HJ, Beemsterboer PMM, et al. Nationwide breast cancer screening in the Netherlands: results of initial and subsequent screening 1990–1995. Int J Cancer 1998; 75:694-698.[CrossRef][Medline]
  24. Freer TW, Ulissey MJ. Screening mammography with computer-aided detection: prospective study of 12,860 patients in a community breast center. Radiology 2001; 220:781-786.[Abstract/Free Full Text]



This article has been cited by other articles:


Home page
JNCI J Natl Cancer InstHome page
S. Hofvind, P. M. Vacek, J. Skelly, D. L. Weaver, and B. M. Geller
Comparing Screening Mammography for Early Breast Cancer Detection in Vermont and Norway
J Natl Cancer Inst, August 6, 2008; 100(15): 1082 - 1091.
[Abstract] [Full Text] [PDF]


Home page
JNCI J Natl Cancer InstHome page
S. Taplin, L. Abraham, W. E. Barlow, J. J. Fenton, E. A. Berns, P. A. Carney, G. R. Cutter, E. A. Sickles, D. Carl, and J. G. Elmore
Mammography Facility Characteristics Associated With Interpretive Accuracy of Screening Mammography
J Natl Cancer Inst, June 18, 2008; 100(12): 876 - 887.
[Abstract] [Full Text] [PDF]


Home page
JNCI J Natl Cancer InstHome page
D. L. Miglioretti, R. Smith-Bindman, L. Abraham, R. J. Brenner, P. A. Carney, E. J. A. Bowles, D. S. M. Buist, and J. G. Elmore
Radiologist Characteristics Associated With Interpretive Performance of Diagnostic Mammography
J Natl Cancer Inst, December 19, 2007; 99(24): 1854 - 1863.
[Abstract] [Full Text] [PDF]


Home page
JCOHome page
H. Singh, S. Sethi, M. Raber, and L. A. Petersen
Errors in Cancer Diagnosis: Current Understanding and Future Directions
J. Clin. Oncol., November 1, 2007; 25(31): 5009 - 5018.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
Y. Jiang, D. L. Miglioretti, C. E. Metz, and R. A. Schmidt
Breast Cancer Detection Rate: Designing Imaging Trials to Demonstrate Improvements
Radiology, May 1, 2007; 243(2): 360 - 367.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
J. W. T. Leung and E. A. Sickles
Developing Asymmetry Identified on Mammography: Correlation with Imaging Outcome and Pathologic Findings
Am. J. Roentgenol., March 1, 2007; 188(3): 667 - 675.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
P. A. Carney, L. A. Abraham, D. L. Miglioretti, K. R. Yabroff, E. A. Sickles, D. S. M. Buist, C. J. Kasales, B. M. Geller, R. D. Rosenberg, M. B. Dignan, et al.
Factors Associated with Imaging and Procedural Events Used to Detect Breast Cancer After Screening Mammography
Am. J. Roentgenol., February 1, 2007; 188(2): 385 - 392.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
J. W. T. Leung, F. R. Margolin, K. E. Dee, R. P. Jacobs, S. R. Denny, and John. D. Schrumpf
Performance Parameters for Screening and Diagnostic Mammography in a Community Practice: Are There Differences Between Specialists and General Radiologists?
Am. J. Roentgenol., January 1, 2007; 188(1): 236 - 241.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
European Society of Gastrointestinal and Abdominal
Effect of Directed Training on Reader Performance for CT Colonography: Multicenter Study
Radiology, January 1, 2007; 242(1): 152 - 161.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
S. H. Taplin, C. M. Rutter, and C. D. Lehman
Testing the effect of computer-assisted detection on interpretive performance in screening mammography.
Am. J. Roentgenol., December 1, 2006; 187(6): 1475 - 1482.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
R. D. Rosenberg, B. C. Yankaskas, L. A. Abraham, E. A. Sickles, C. D. Lehman, B. M. Geller, P. A. Carney, K. Kerlikowske, D. S. M. Buist, D. L. Weaver, et al.
Performance Benchmarks for Screening Mammography
Radiology, October 1, 2006; 241(1): 55 - 66.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
E. S. Burnside, D. L. Rubin, J. P. Fine, R. D. Shachter, G. A. Sisney, and W. K. Leung
Bayesian Network to Predict Breast Cancer Risk of Mammographic Microcalcifications and Reduce Number of Benign Biopsy Results: Initial Experience
Radiology, September 1, 2006; 240(3): 666 - 673.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
J. C. Dean and C. C. Ilvento
Improved cancer detection using computer-aided detection with diagnostic and screening mammography: prospective study of 104 cancers.
Am. J. Roentgenol., July 1, 2006; 187(1): 20 - 28.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
W. A. Berg, J. D. Blume, J. B. Cormack, E. B. Mendelson, and E. L. Madsen
Lesion Detection and Characterization in a Breast US Phantom: Results of the ACRIN 6666 Investigators
Radiology, June 1, 2006; 239(3): 693 - 702.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
M. A. Ganott, J. H. Sumkin, J. L. King, A. H. Klym, V. J. Catullo, C. S. Cohen, and D. Gur
Screening Mammography: Do Women Prefer a Higher Recall Rate Given the Possibility of Earlier Detection of Cancer?
Radiology, March 1, 2006; 238(3): 793 - 800.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
A. J. Coldman, D. Major, G. P. Doyle, Y. D'yachkova, N. Phillips, J. Onysko, R. Shumak, N. E. Smith, and N. Wadden
Organized Breast Screening Programs in Canada: Effect of Radiologist Reading Volumes on Outcomes
Radiology, March 1, 2006; 238(3): 809 - 815.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Roentgenol.Home page
E. S. Burnside, J. M. Park, J. P. Fine, and G. A. Sisney
The Use of Batch Reading to Improve the Performance of Screening Mammography
Am. J. Roentgenol., September 1, 2005; 185(3): 790 - 796.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
E. A. Sickles, D. L. Miglioretti, R. Ballard-Barbash, B. M. Geller, J. W. T. Leung, R. D. Rosenberg, R. Smith-Bindman, and B. C. Yankaskas
Performance Benchmarks for Diagnostic Mammography
Radiology, June 1, 2005; 235(3): 775 - 790.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
D. Gur, L. P. Wallace, A. H. Klym, L. A. Hardesty, G. S. Abrams, R. Shah, and J. H. Sumkin
Trends in Recall, Biopsy, and Positive Biopsy Rates for Screening Mammography in an Academic Practice
Radiology, May 1, 2005; 235(2): 396 - 401.
[Abstract] [Full Text] [PDF]


Home page
JNCI J Natl Cancer InstHome page
R. Smith-Bindman, P. Chu, D. L. Miglioretti, C. Quale, R. D. Rosenberg, G. Cutter, B. Geller, P. Bacchetti, E. A. Sickles, and K. Kerlikowske
Physician Predictors of Mammographic Accuracy
J Natl Cancer Inst, March 2, 2005; 97(5): 358 - 367.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
D. Gur, J. S. Stalder, L. A. Hardesty, B. Zheng, J. H. Sumkin, D. M. Chough, B. E. Shindel, and H. E. Rockette
Computer-aided Detection Performance in Mammographic Examination of Masses: Assessment
Radiology, November 1, 2004; 233(2): 418 - 423.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
M. Mullerad, H. Hricak, L. Wang, H.-N. Chen, M. W. Kattan, and P. T. Scardino
Prostate Cancer: Detection of Extracapsular Extension by Genitourinary and General Body Radiologists at MR Imaging
Radiology, July 1, 2004; 232(1): 140 - 146.
[Abstract] [Full Text] [PDF]


Home page
RadiologyHome page
M. A. Helvie, L. Hadjiiski, E. Makariou, H.-P. Chan, N. Petrick, B. Sahiner, S.-C. B. Lo, M. Freedman, D. Adler, J. Bailey, et al.
Sensitivity of Noncommercial Computer-aided Detection System for Mammographic Breast Cancer Detection: Pilot Clinical Trial
Radiology, April 1, 2004; 231(1): 208 - 214.
[Abstract] [Full Text] [PDF]