- Original article
- Open Access
Breast MRI radiomics: comparison of computer- and human-extracted imaging phenotypes
European Radiology Experimental volume 1, Article number: 22 (2017)
In this study, we sought to investigate if computer-extracted magnetic resonance imaging (MRI) phenotypes of breast cancer could replicate human-extracted size and Breast Imaging-Reporting and Data System (BI-RADS) imaging phenotypes using MRI data from The Cancer Genome Atlas (TCGA) project of the National Cancer Institute.
Our retrospective interpretation study involved analysis of Health Insurance Portability and Accountability Act-compliant breast MRI data from The Cancer Imaging Archive, an open-source database from the TCGA project. This study was exempt from institutional review board approval at Memorial Sloan Kettering Cancer Center and the need for informed consent was waived. Ninety-one pre-operative breast MRIs with verified invasive breast cancers were analysed. Three fellowship-trained breast radiologists evaluated the index cancer in each case according to size and the BI-RADS lexicon for shape, margin, and enhancement (human-extracted image phenotypes [HEIP]). Human inter-observer agreement was analysed by the intra-class correlation coefficient (ICC) for size and Krippendorff’s α for other measurements. Quantitative MRI radiomics of computerised three-dimensional segmentations of each cancer generated computer-extracted image phenotypes (CEIP). Spearman’s rank correlation coefficients were used to compare HEIP and CEIP.
Inter-observer agreement for HEIP varied, with the highest agreement seen for size (ICC 0.679) and shape (ICC 0.527). The computer-extracted maximum linear size replicated the human measurement with p < 10−12. CEIP of shape, specifically sphericity and irregularity, replicated HEIP with both p values < 0.001. CEIP did not demonstrate agreement with HEIP of tumour margin or internal enhancement.
Quantitative radiomics of breast cancer may replicate human-extracted tumour size and BI-RADS imaging phenotypes, thus enabling precision medicine.
Computer-extracted breast cancer maximum linear size replicated the human measurement.
Computer-extracted breast cancer shape replicated the human-extracted imaging phenotypes.
Using both computer- and human-extracted imaging phenotypes may be clinically valuable.
Breast magnetic resonance imaging (MRI) is currently the most sensitive imaging modality to detect breast cancer . Radiologists are trained to visually identify pertinent imaging characteristics for cancer detection while taking into account clinical information and context. To standardise reporting and minimise inter-observer variability, the Breast Imaging-Reporting and Data System (BI-RADS), a quality assurance tool, was designed and disseminated into clinical practice . Nevertheless, substantial inter- and intra-observer agreement variability continue to exist among radiologists, though the degree of both is unclear because of a paucity of literature . Despite this recognised variability, the radiologist is the imaging reference standard for interpretation of diagnostic imaging studies, including breast MRI.
The goal of research focusing on integrating computer-aided diagnosis (CAD) and human MRI interpretation is to improve breast cancer detection, moving beyond determining if a lesion is benign or malignant [4,5,6,7], and additionally to use radiomics in assessing cancer subtypes. Prototype CAD systems have been investigated clinically and shown to significantly improve the average diagnostic performance of radiologists . Now algorithms are tasked with improving specificity, decreasing observer variability, and identifying biomarkers of breast cancer genotype and/or outcome. Breast cancer radiomics, namely computer-assisted phenotype extraction, can quantify information from voxel-based MRI, providing reproducible information that may be imperceptible to the human eye. Quantitative radiomics can analyse texture features, providing further insight into inter- and intra-tumour heterogeneity and assess enhancement kinetics [3, 7, 9,10,11]. Similar to human assessments, there may be computer software and/or algorithm variability.
Diagnostic breast MRI is in the midst of a paradigm shift whereby the correlative and complementary roles of human-extracted imaging phenotypes (HEIP) and computer-extracted imaging phenotypes (CEIP) are being assessed for the diagnosis, treatment and management of breast cancer. There may soon be a clinical role for using both HEIP and CEIP wherein the strengths of each are leveraged to generate combined biomarkers that accurately predict important outcomes. One of the initial steps for CEIP to be clinically accepted is to compare them with the clinical gold standard (i.e., human radiologists) and to demonstrate that CEIP can reasonably replicate HEIP. The purpose of this study was to investigate if CEIP MRI phenotypes of breast cancer could replicate HEIP (i.e., human-extracted size and BI-RADS phenotypes) using MRIs from The Cancer Genome Atlas (TCGA) project of the National Cancer Institute.
Institutional review board approval
In this retrospective study, all patient data were Health Insurance Portability and Accountability Act-compliant and acquired under institutional review board approval with a waiver of the need for informed consent.
We retrieved data from an open-source de-identified database, The Cancer Imaging Archive (TCIA) , which is the imaging counterpart of TCGA. TCGA, in brief, is a coordinated effort led by the National Cancer Institute to accelerate the molecular and genomic understanding of cancer . The TCGA program performs genomic sequencing and characterisation of tissue from cancers diagnosed and treated at cancer centres around the United States.
Breast cancer (both invasive ductal and lobular) was one of the cancers selected for study, and by December 16, 2014, 1100 breast cancer cases had been collected by TCGA and 1098 also had available genomic and clinical data (available at: https://gdc.cancer.gov/; accessed February 17, 2016). To complement the genomic and clinical data, the National Cancer Institute enriched the TCGA open-source data portal by collecting MRI studies for storage and analysis in the TCIA. However, in 2014, only 108 breast cancers had TCIA MRI (pre-operative examinations with pathologically verified cancer) to correlate with available TCGA data. Clinical, pathologic and genomic data were extracted using the TCGA assembler, an open-source, publicly available, free tool .
Because our patients were drawn from the TCGA, an open-source database, some patients in our cohort were included in several previously published studies [9,10,11, 15, 16]. However, this paper has no scientific overlap with the other published studies. We report new findings from our study done to investigate if CEIP of breast cancer could replicate HEIP using MRI from TCGA. Thus, whereas we assessed the correlation between CEIP and HEIP, the other studies assessed the correlation between CEIP and clinical features.
All breast MRI data were downloaded from the Breast Cancer Risk Assessment collection within TCIA (http://www.cancerimagingarchive.net). The data had previously been generated under MRI studies originally performed between 1999 and 2004 at four institutions: the Mayo Clinic, Memorial Sloan Kettering Cancer Center, Roswell Park Cancer Institute and the University of Pittsburgh Medical Center. To avoid differences in image quality caused by different equipment vendors, only MR images obtained with 1.5-T systems were included in our study, which limited the number of patients considered for inclusion in our analysis to 93. Of these 93 patients, two were excluded because of missing genomic data (n = 1) and missing images (n = 1).
Therefore, in total, 91 female patients with pre-operative breast MRI studies underwent analysis in our study. All images were acquired with a 1.5-T system (Signa or Signa HDX; General Electric Medical Systems, Waukesha, WI, USA). In all patients, a dedicated surface breast coil was used. T1-weighted fat-suppressed images were acquired before and after intravenous administration of a gadolinium-based contrast agent (gadodiamide, Omniscan®; Nycomed-Amersham, Princeton, NJ, USA), with three to five post-contrast images because the protocol was institution-dependent. In-plane spatial resolution ranged from 0.53 to 0.86 mm, and slice spacing ranged from 2 to 3 mm. Only pre- and post-contrast T1-weighted fat-suppressed images were included in our study. Further information, including full clinical breast MRI protocols, can be accessed from the open-source TCIA.
To generate HEIP, a pool of eleven board-certified breast-imaging radiologists, with experience ranging from 4 to 29 years (ESB, 14 years; GJW, 25 years; EJS, 4 years; JMN, 5 years; MG, 29 years; and EAM, 25 years; the other five radiologists were non-authors), participated in the manual assessment of MRI data. The image location of the index breast cancer and maximal tumour size were identified on the first post-contrast image and annotated by each radiologist using an open-source and open-access software platform . For each patient, three radiologists from this pool were randomly assigned to review the imaging data. Visual assessments of tumour characteristics of the cancer were made according to BI-RADS 5 descriptors (lesion-shaped, internal enhancement and margin), yielding HEIP. In patients with multifocal or multicentric disease, the largest mass was used as the index lesion.
Four HEIP characteristics were assessed: lesion size (largest size as per Response Evaluation Criteria in Solid Tumours 1.1 recommendation ), lesion shape (whether the lesion was irregularly shaped or round/oval), internal enhancement (whether the enhancement was heterogeneous or homogeneous) and margin (whether the margin was circumscribed, irregular or spiculated). The radiologists performed their reviews independently and were blinded to all health information. Enhancement kinetics/curves were not generated, owing to the variable temporal resolution across sites and over time, which would have significantly impacted the accuracy of the results. HEIP for each feature from the three radiologists was summarised into a single representative consensus value for each patient; these summary values served as the case-based HEIP in the subsequent statistical analysis. The decision rule for consensus was simple majority. There were no cases where all readers disagreed.
CEIP (quantitative radiomics)
Given the approximate tumour centre location, each index breast tumour was automatically segmented in the three-dimensional (3D) space. The quantitative radiomics workstation used for this study was able to yield 38 CEIP from dynamic contrast-enhanced MRI scans to characterise tumour size, shape, margin, enhancement texture, kinetics, and variance kinetics [6, 7, 19,20,21]. However, because kinetic characteristics were not assessed by the radiologists (i.e., as HEIP) for this study, only 24 phenotypes (from four categories) were used to compare the HEIP and CEIP. The 14 kinetics-related CEIP were excluded. All measurements were extracted from the first post-contrast MR images.
The 24 CEIP were calculated on the basis of automatically derived 3D tumour segmentations . They were further separated into four phenotypic categories: (a) size-measuring tumour dimensions (4 CEIP); (b) shape, quantifying the 3D tumour geometry (3 CEIP); (c) morphology, combining tumour shape and margin characteristics such as margin sharpness (3 CEIP); and (d) enhancement texture, describing the texture of the contrast uptake (heterogeneity of the uptake) in the tumour on the first post-contrast MRI image (14 CEIP) (Fig. 1). Owing to our small sample size, we report only the CEIP extracted for each HEIP BI-RADS descriptor but do not provide the coefficients.
Inter-observer agreement in HEIP
Inter-observer agreement for human-extracted lesion size was based on inferences on the (−log 1.20, log 1.20) coverage probability of the natural logarithm of the size. This is the probability that the absolute difference between the natural logarithm of the size measurements from any two radiologists differs by less than log 1.20, or equivalently, that the size measurements from any two radiologists are within 20% of each other. Inter-observer agreement for human-extracted tumour shape, margin and heterogeneity was based on inferences on Krippendorff’s α . The 95% CIs for Krippendorff’s α and the coverage probability were constructed using the nonparametric bootstrap .
Associations between HEIP and CEIP
Associations for lesion size were assessed through inferences on the Kendall τ rank correlation coefficient ; p values were obtained through permutation tests. Associations for both shape and internal enhancement were assessed through the Mann-Whitney U test . Associations for margin were assessed through inferences on the Kendall τ rank correlation coefficient; p values were obtained through permutation tests . The Benjamini-Hochberg procedure was used to correct for multiple hypothesis testing. All tests of associations between HEIPs and individual CEIPs were performed at the α = 0.05 level .
Replicating HEIP using CEIP
We analysed the feasibility of replicating each of the four HEIP using the corresponding CEIP. These were treated as either prediction or classification problems that we evaluated in terms of accuracy. The replication of HEIP of tumour size given the CEIP of tumour size was treated as a prediction problem. A predictor for the HEIP of tumour size based on the size-related CEIP was constructed using multivariate linear regression subject to elastic net constraints . The elastic net constraints set some coefficients in the model exactly to zero because of the model’s geometry, thus performing variable selection, and also simultaneously stabilises the coefficient estimates in the presence of substantial correlation between the CEIP. Ten-fold cross-validation  was used to select the values of the tuning parameters controlling the severity of the elastic net constraints.
The replications of tumour shape, internal enhancement, and margin were treated as classification problems. Classifiers for tumour shape and internal enhancement were constructed using multiple logistic regression subject to elastic net constraints. Classifiers for tumour margin were constructed using ordinal logistic regression, also subject to elastic net constraints. For these classifiers, tuning parameters for the elastic net constraints were also selected using ten-fold cross-validation.
These predictors and classifiers were then assessed in terms of their prediction and classification accuracy through nested ten-fold by ten-fold cross-validation . For tumour size, performance was evaluated in terms of the mean squared deviation . For tumour shape and internal enhancement, performance was evaluated in terms of AUC in ROC analysis. For tumour margin, performance was evaluated in terms of the Kendall τ rank correlation coefficient between the classifier score and the actual value of the HEIP of tumour margin .
For each of these four tumour characteristics (size, shape, internal enhancement, and margin), the p values of predictive signal contained within CEIP were computed through permutation tests. The Benjamini-Hochberg procedure was applied to these p values to correct for multiple hypothesis testing .
Ninety-one patients with invasive breast cancer who underwent pre-operative breast MRI were included in the study sample. Their median age was 53.6 years (range 29–82) at first cancer diagnosis (Table 1). Breast cancers were predominantly oestrogen receptor- and progesterone receptor-positive. Human epidermal grow factor receptor 2 status was known only in 63 (69%) of 91 patients, of whom 49 (78%) of 63 were negative and 14 (22%) of 63 were positive. Additional information regarding the tumours can be found on the open-source website (available at: https://gdc.cancer.gov/; accessed February 17, 2016).
Inter-observer agreement in HEIP
There was variability in the inter-observer agreement of HEIP of tumour size, shape, margin and internal enhancement (Table 2). Tumour size, the only quantitative measurement, showed the strongest inter-observer agreement, with a coverage probability of π = 0.679 (95% CI 0.561–0.736). Tumour shape and margin showed moderate inter-observer agreement, with α = 0.527 (95% CI 0.380–0.654) and α = 0.561 (95% CI 0.426–0.674), respectively. Tumour internal enhancement showed the poorest inter-observer agreement, with α = 0.292 (95% CI 0.147–0.433).
Association between HEIP and CEIP
Strong evidence of associations between the HEIP and corresponding CEIP were observed in some cases, indicating that the CEIP may have the ability to replicate the HEIP (Table 3).
The p values of the associations between HEIP of tumour size and each of the size-related CEIPs were all < 10−12. Figure 2 shows a scatterplot of the human-extracted tumour size versus each of the four tumour size-related CEIP. The p values of the associations between the HEIP and corresponding CEIP of tumour shape, specifically sphericity, and irregularity, were 5.65 × 10−5 and 3.98 × 10−3, respectively. Figure 3 shows box plots of the CEIP of tumour sphericity and irregularity versus HEIP of tumour shape. Figures 4 and 5 demonstrate representative cases where CEIP and HEIP are concordant and discordant, respectively. These associations also remained significant even after adjustment for multiple comparisons using the Benjamini-Hochberg procedure .
Meanwhile, no evidence of association between the HEIP and the corresponding CEIP of tumour internal enhancement or tumour margin was observed.
Feasibility of replicating HEIP using combinations of HEIP
A summary of the results of the analysis of the ability of CEIP to replicate HEIP is given in Table 4.
HEIP of tumour size could be replicated using a weighted sum of CEIP of effective diameter, surface area and maximum linear size (mean squared deviation 56.56, p < 0.001). HEIP of tumour shape could also be replicated using CEIP of tumour sphericity (AUC 0.720, p = 0.004). Both the abilities of tumour size-related CEIP to replicate HEIP of tumour size and CEIP of tumour sphericity to replicate HEIP of tumour shape remained statistically significant even after correction for multiple testing.
The HEIP of tumour internal enhancement could not be replicated with a reasonable degree of accuracy using CEIP of tumour enhancement; the AUC associated with the best model as identified through ten-fold cross-validation, which involved energy, entropy, information measure of correlation, maximum correlation coefficient, sum of average, sum of entropy, and sum of squares, was 0.551 (p = 0.263).
The HEIP of tumour margin could not be replicated with a reasonable degree of accuracy using CEIP of tumour margin. The value of τ associated with the best classifier as identified through ten-fold cross-validation, which involved variance of the radial gradient histogram only, was 0.143 (p = 0.183). Because of the limited sample size of 91 and the preliminary nature of this analysis, only the CEIP involved in the predictor or classifier of the HEIP BI-RADS descriptors is reported.
Our results demonstrate that breast MRI CEIP can replicate some HEIP of breast cancer. In our study, we evaluated four HEIP characteristics: lesion size, lesion shape, internal enhancement, and margin. Specifically, CEIP in our study was able to replicate radiologists’ assessment of tumour size and shape. There was a correlation between the CEIP and radiologists’ measurement of tumour margins; however, it was not statistically significant.
For radiologists in the era of precision medicine, the quantification and precision of radiology using computerised MRI analysis (i.e., radiomics) are a rapidly evolving field of interest, paralleling the pace of technologic innovation outside healthcare. Computer machine-learning algorithms continue to be developed to facilitate tumour diagnosis (detection and localisation), segmentation (automated and semi-automated), and feature extraction on MRI. Radiomics of breast cancer on MRI, which involves the extraction of quantitative imaging features (also known as CEIP), have the potential to provide further insight into tumour imaging phenotypes that are imperceptible to the human eye [32, 33]. In contrast to the qualitative nature of HEIP, these quantitative characteristics of CEIP may improve the reproducibility, accuracy and predictive power of existing subjective BI-RADS terms. Furthermore, computers do not suffer from fatigue, distraction or hunger, which are known to affect the clinical performance of humans [34, 35]. In addition, to the advantages mentioned above, CEIP may be able to identify new image features that could become biomarkers of tumour biology, predictors of treatment response or surrogates for genetic testing [32, 33].
Some correlations between CEIP and breast cancer subtype and genotype have been reported in the literature. Bhooshan et al  developed CEIP as MRI-based prognostic markers, distinguishing between ductal carcinoma in situ and invasive breast cancer and between breast cancer with and without positive lymph nodes. They  also demonstrated CEIP in the classification of breast cancer tumour grades. Mazurowski et al  found that the Luminal B molecular subtype is associated with enhancement dynamics on MRI. Sutton et al  developed a predictive model using features extracted from MRI that could distinguish between invasive ductal carcinoma molecular subtypes. Agner et al  reported that CEIP could identify triple-negative breast cancers and could differentiate them from other molecular subtypes. Sutton et al  developed a model using CEIP that could predict the likelihood of recurrence and magnitude of chemotherapeutic benefit. The TCGA Breast Phenotype Group has related CEIP to clinical features including pathologic stage ; molecular classification of breast cancers (such as luminal A) ; risk of recurrence with research versions of Oncotype DX, MammaPrint and PAM50 multi-gene assays ; clinical phenotype ; and genetics of various pathways .
The results of this study suggest a potential role for CEIP tumour features; however, further research must demonstrate how CEIP can assist, complement, or benefit the human/radiologist. In our study, inter-observer variability in radiologists’ use of tumour BI-RADS features was significant even among experts in breast imaging. The highest correlation was seen with size and shape. These features were also those that the CEIP combinations could successfully replicate. The difficulty in our study for CEIP in replicating HEIP of tumour internal enhancement or margin may be a result of low radiologist inter-observer agreement; it is hard to predict features that radiologists do not agree on. The reason for low inter-observer agreement is unclear, and there is a paucity of literature exploring this . All readers in thus study were fellowship-trained breast-imaging radiologists. Future research could be focused on reproducible quantitative CEIP tumour features and their possible role as the imaging reference standard for BI-RADS HEIP with high inter-reader variability, such as tumour internal enhancement and margin in our study, also possibly concealing clinically important information. Before this can happen, however, the research community must decide on a standardised computer algorithm for tumour segmentation and quantitative image analysis that work across different image protocols, image quality and magnet strength. A CEIP lexicon of breast cancer image features analogous to BI-RADS is needed because features that can be extracted by computers are vast and somewhat abstract to the clinical radiologist who is used to the BI-RADS lexicon. Finally, there may be variability in CEIP tumour assessments analogous to the inter- and intra-observer variability seen with radiologists.
The present study had several limitations. First, this was a retrospective analysis of 91 patients who presented over 6 years across 4 different institutions. Image acquisition took place between 1999 and 2004, and technology has since improved significantly. Most patients had oestrogen receptor- and progesterone receptor-positive breast cancers; therefore, our evaluation was not representative of all immunohistochemical subtypes. In addition, all index lesions were masses, so it remains to be determined if the same results would be seen with cancers presenting as only non-mass enhancement. Second, the standard MRI acquisition parameters varied across institutions and possibly over time. Third, it is uncertain if our results can be extrapolated for different imaging equipment and protocols. Finally, this is a small dataset, and our classifier cannot be assumed to work on a larger dataset.
In conclusion, this study we found that, for some phenotypes, CEIP were able to successfully replicate breast MRI HEIP. Thus we propose that there may be a role in the future for using both whereby the strengths of each are leveraged to optimise diagnostic reporting, decrease inter-observer variability, and perhaps even improve our understanding of breast cancer. As computer algorithms continue to be developed, we foresee that radiology reports in the future will include quantitative metrics besides size, the result of CEIP from validated computer algorithms. This information will be compounded by the continuing improvements in MRI sequences, specifically the ability to perform simultaneously high temporal and spatial resolution imaging. Our study demonstrates the feasibility of a future method where radiomics are part of everyday radiology reporting. Future research will be needed on larger datasets to validate this observation. In addition, the observed trend that the correlation between CEIP and HEIP corresponded to the degree of inter-observer agreement requires further investigation. Large-scale datasets are needed to provide further evidence for the use of CEIP in clinical practice.
Breast Imaging-Reporting and Data System
Computer-extracted image phenotype
Human-extracted image phenotype
Intra-class correlation coefficient
Magnetic resonance imaging
Mean squared deviation
The Cancer Genome Atlas
The Cancer Imaging Archive
Morris EA (2010) Diagnostic breast MR imaging: current status and future directions. Magn Reson Imaging Clin N Am 18:57–74.
Morris E, Comstock C, Lee C et al (2013) ACR BI-RADS® magnetic resonance imaging. In: ACR BI-RADS® Atlas, Breast Imaging Reporting and Data System. American College of Radiology, Reston
Heye T, Merkle EM, Reiner CS et al (2013) Reproducibility of dynamic contrast-enhanced MR imaging. Part II. Comparison of intra- and interobserver variability with manual region of interest placement versus semiautomatic lesion segmentation and histogram analysis. Radiology 266:812–821.
Liu YH, Xu L, Liu LH et al (2014) 3.0T MR-CAD: clinical value in diagnosis of breast tumor compared with conventional MRI. J Cancer 5:585–589.
Kuhl CK, Mielcareck P, Klaschik S et al (1999) Dynamic breast MR imaging: are signal intensity time course data useful for differential diagnosis of enhancing lesions? Radiology 211:101–110.
Chen W, Giger ML, Newstead GM et al (2010) Computerized assessment of breast lesion malignancy using DCE-MRI robustness study on two independent clinical datasets from two manufacturers. Acad Radiol 17:822–829.
Chen W, Giger ML, Lan L, Bick U (2004) Computerized interpretation of breast MRI: investigation of enhancement-variance dynamics. Med Phys 31:1076–1082
Shimauchi A, Giger ML, Bhooshan N et al (2011) Evaluation of clinical breast MR imaging performed with prototype computer-aided diagnosis breast MR imaging workstation: reader study. Radiology 258:696–704.
Li H, Zhu Y, Burnside ES et al (2016) MR imaging radiomics signatures for predicting the risk of breast cancer recurrence as given by research versions of MammaPrint, Oncotype DX, and PAM50 gene assays. Radiology 281:382–391. doi:10.1148/radiol.2016152110
Guo W, Li H, Zhu Y et al, TCGA Breast Phenotype Research Group (2015) Prediction of clinical phenotypes in invasive breast carcinomas from the integration of radiomics and genomics data. J Med Imaging (Bellingham) 2:041007.
Burnside ES, Drukker K, Li H et al (2016) Using computer-extracted image phenotypes from tumors on breast magnetic resonance imaging to predict breast cancer pathologic stage. Cancer 122:748–757.
Clark K, Vendt B, Smith K et al (2013) The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository. J Digit Imaging 26:1045–1057.
The Cancer Genome Atlas Network (2012) Comprehensive molecular portraits of human breast tumours. Nature 490:61–70.
Zhu Y, Qiu P, Ji Y (2014) TCGA-assembler: open-source software for retrieving and processing TCGA data. Nat Methods 11:599–600.
Li H, Zhu Y, Burnside ES et al (2016) Quantitative MRI radiomics in the prediction of molecular classifications of breast cancer subtypes in the TCGA/TCIA data set. NPJ Breast Cancer 2:16012.
Zhu Y, Li H, Guo W, Drukker K, Lan L, Giger ML, Ji Y (2015) Deciphering genomic underpinnings of quantitative MRI-based radiomic phenotypes of invasive breast carcinoma. Sci Rep 5:17787.
Mongkolwat P, Channin DS, Kleper V, Rubin DL (2012) Informatics in radiology: an open-source and open-access cancer biomedical informatics grid annotation and image markup template builder. Radiographics 32:1223–1232.
Eisenhauer EA, Therasse P, Bogaerts J et al (2009) New response evaluation criteria in solid tumours: revised RECIST guideline (version 1.1). Eur J Cancer 45:228–247.
Gilhuijs KG, Giger ML, Bick U (1998) Computerized analysis of breast lesions in three dimensions using dynamic magnetic-resonance imaging. Med Phys 25:1647–1654.
Chen W, Giger ML, Bick U, Newstead GM (2006) Automatic identification and classification of characteristic kinetic curves of breast lesions on DCE-MRI. Med Phys 33:2878–2887.
Chen W, Giger ML, Li H, Bick U, Newstead GM (2007) Volumetric texture analysis of breast lesions on contrast-enhanced magnetic resonance images. Magn Reson Med 58:562–571.
Krippendorff K (2004) Content analysis: an introduction to its methodology. Sage, Thousand Oaks
Efron B, Tibshirani RJ (1993) An introduction to the bootstrap. Chapman & Hall/CRC, Boca Raton
Kendall MG (1938) A new measure of rank correlation. Biometrika 30:81–93
Mann HB, Whitney DR (1947) On a test of whether one of two random variables is stochastically larger than the other. Ann Math Stat 18:50–60
Welch WJ (1990) Construction of permutation tests. J Am Stat Assoc 85:693–698.
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc B Methodol 57:289–300
Zou H, Hastie T (2005) Regularization and variable selection via the elastic net. J R Stat Soc Series B Stat Methodology 67:301–320
Stone M (1974) Cross-validatory choice and assessment of statistical predictions. J R Stat Soc B Methodol 36:111–147
Statnikov A, Aliferis CF, Tsamardinos I, Hardin D, Levy S (2005) A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis. Bioinformatics 21:631–643
Lehmann EL, Casella G (1998) Theory of point estimation springer texts in statistics, 2nd edn. Springer-Verlag, New York
Mazurowski MA (2015) Radiogenomics: what it is and why it is important. J Am Coll Radiol 12:862–866.
Grimm LJ (2016) Breast MRI radiogenomics: current status and research implications. J Magn Reson Imaging 43:1269–1278. doi:10.1002/jmri.25116
Basavanhally A, Viswanath S, Madabhushi A (2015) Predicting classifier performance with limited training data: applications to computer-aided diagnosis in breast and prostate cancer. PLoS One 10:e0117900. doi:10.1371/journal.pone.0117900
Giger ML (2010) Update on the potential of computer-aided diagnosis for breast cancer. Future Oncol 6:1–4.
Bhooshan N, Giger ML, Jansen SA, Li H, Lan L, Newstead GM (2010) Cancerous breast lesions on dynamic contrast-enhanced MR images: computerized characterization for image-based prognostic markers. Radiology 254:680–690.
Bhooshan N, Giger M, Edwards D et al (2011) Computerized three-class classification of MRI-based prognostic markers for breast cancer. Phys Med Biol 56:5995–6008.
Mazurowski MA, Zhang J, Grimm LJ, Yoon SC, Silber JI (2014) Radiogenomic analysis of breast cancer: luminal B molecular subtype is associated with enhancement dynamics at MR imaging. Radiology 273:365–372.
Sutton EJ, Dashevsky BZ, Oh JH et al (2016) Breast cancer molecular subtype classifier that incorporates MRI features. J Magn Reson Imaging 44:122–129.
Agner SC, Rosen MA, Englander S et al (2014) Computerized image analysis for identifying triple-negative breast cancers and differentiating them from other molecular subtypes of breast cancer on dynamic contrast-enhanced MR images: a feasibility study. Radiology 272:91–99.
Sutton EJ, Oh JH, Dashevsky BZ et al (2015) Breast cancer subtype intertumor heterogeneity: MRI-based features predict results of a genomic assay. J Magn Reson Imaging 42:1398–1406.
EJS and EAM were supported in part through National Institutes of Health (NIH)/National Cancer Institute (NCI) Cancer Center Support grant P30 CA008748. KD, HL and MLG received support from NIH/NCI grant U01CA195564. The sponsor had no involvement in the design of the study; in the collection, analysis or interpretation of data; or in the writing of the manuscript.
Availability of data and materials
In our present study, we retrieved data from an open-source de-identified database, The Cancer Imaging Archive (TCIA), which is the imaging counterpart of The Cancer Genome Archive (TCGA). Clinical, pathologic and genomic data were extracted using TCGA assembler, an open-source, publicly available, free tool (http://www.nature.com/nmeth/journal/v11/n6/full/nmeth.2956.html). All breast MRI data were downloaded from the Breast Cancer Risk Assessment collection within TCIA (http://www.cancerimagingarchive.net/).
Ethics approval and consent to participate
In this retrospective study, all patient data were Health Insurance Portability and Accountability Act-compliant and acquired under institutional review board approval that waived the need for informed consent.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Sutton, E.J., Huang, E.P., Drukker, K. et al. Breast MRI radiomics: comparison of computer- and human-extracted imaging phenotypes. Eur Radiol Exp 1, 22 (2017). https://doi.org/10.1186/s41747-017-0025-2
- Magnetic resonance imaging
- Breast cancer
- Inter-observer variability
- Machine learning