- Open Access
Prediction of BRCA gene mutation status in epithelial ovarian cancer by radiomics models based on 2D and 3D CT images
BMC Medical Imaging volume 21, Article number: 180 (2021)
The objective of this study is to explore the value of two-dimensional (2D) and three-dimensional (3D) radiomics models based on enhanced computed tomography (CT) images in predicting BRCA gene mutations in patients with epithelial ovarian cancer.
The clinical and imaging data of 106 patients with ovarian cancer confirmed by surgery and pathology were retrospectively analyzed and genetic testing was performed. Radiomics features extracted from the 2D and 3D regions of interest of the patients’ primary tumor lesions were selected in the training set using the maximum correlation and minimum redundancy method. Then, the best features were selected through Lasso tenfold cross-validation. Feature subsets were employed to establish a radiomics model. The model’s performance was evaluated via area under the receiver operating characteristic curve analysis and its clinical validity was assessed by using the model’s decision curve.
On the validation set, the area under the curve values of the 2D, 3D, and 2D + 3D combined models were 0.78 (0.61–0.96), 0.75 (0.55–0.92), and 0.82 (0.61–0.96), respectively. However, the DeLong test P values between the three pairs of models were all > 0.05. The decision curve analysis showed that the radiomics model had a high net benefit across all high-risk threshold probabilities.
The three radiomics models can predict the BRCA gene mutation in ovarian cancer, and there were no statistically significant differences between the prediction performance of the three models.
According to data from the International Cancer Research Center, approximately 52,000 new cases of ovarian cancer are diagnosed in China each year, with a high tumor mortality rate [1, 2], and epithelial ovarian cancer accounts for 90% of all ovarian malignancies . Approximately 50% of epithelial ovarian cancers exhibit DNA repair defects through homologous recombination, and BRCA1/2 mutations in germline and somatic cells are the most common mechanism underlying homologous recombination deficiency . Patients with BRCA1/2 mutant ovarian cancer are sensitive to platinum drugs, have a higher progression-free survival period, and have better prognosis [5, 6]. The main method for clinical judgment of BRCA gene mutations is genetic testing at present, but it is costly and time-consuming, and unit sampling cannot cover the entire tumor . Nougaret et al.  predicted BRCA gene status by observing CT features and found that some radiomics features are related to gene mutation status, but the judgment of these imaging features was dependent on the observer’s subjective experience.
Radiomics can objectively quantify the relationship between the pixels and the spatial distributions of medical images and fully explore the hidden information in the images that cannot be observed by the naked eyes. In previous studies, radiomics has been used to capture tumors’ inherent heterogeneity and correlate it with potential gene expression types . Extracting radiomics features requires delineating the region of interest (ROI) of the lesion, which is currently done by two commonly used methods: two-dimensional (2D) and three-dimensional (3D) delineation. However, the advantages and disadvantages of these two delineation methods are still controversial [10, 11]. Therefore, in this study, we explored the predictive value of radiomics models based on different delineation methods for BRCA gene mutations in patients with epithelial ovarian cancer, and compared the prediction performance of each model, to improve the prediction of gene mutation status. In doing so, we strived to seek a model that could provide an ideal and convenient method to sketch the area of interest.
We performed a retrospective analysis of patients with epithelial ovarian cancer who received treatment at our institution from March 2017 to July 2020. We collected clinical pathological data, CT enhanced images before surgery, radiotherapy, and chemotherapy, and postoperative pathological diagnosis. The inclusion criteria were as follows: (1) epithelial ovarian cancer pathologically confirmed by biopsy; (2) abdominal CT scan performed before the operation, and the images including the arterial, venous, and delayed phases; (3) no radiotherapy or chemotherapy performed before the operation; (4) lesion size ≥ 10 mm. The exclusion criteria were as follows: (1) large image artifacts interfering with observation; (2) intolerance to enhanced CT examination; (3) preoperative radiotherapy and chemotherapy; (4) a history of other malignant tumors or pelvic metastases. In total, 106 patients were included in the study (Fig. 1). Among them, 63 cases, aged 35–75 years (average: 54.86 years), were in the BRCA gene non-mutation group; 43 cases aged 36–77 years (average: 53.93 years), were in the BRCA gene mutation group.
Collection of general characteristics
The clinical data of 106 patients, including patient’s age, maximum tumor diameter and tumor markers, were collected and analyzed,.
NGS genetic testing technology was applied on all patients, who had at least one of the following indications: (1) family history of pathogenic BRCA gene mutations; (2) family history of breast cancer before the age of 45 years or triple-negative breast cancer before the age of 60 years; (3) affordability of genetic testing or the doctor’s requirement of genetic testing. To ensure the consistency of the test results, all subjects were tested by the same testing agency (Huada BGI).
The patients fasted for more than 8 h and drank approximately 500–1000 mL of clean water orally 15–30 min before the examination. After the bladder was filled, the abdomen and pelvis were scanned using a GE Discovery CT 750HD (HDCT, USA) scanner in supine position. The scan ranged from the top of the diaphragm or the level of the iliac spine to the symphysis pubis. The tube voltage was 120 kV, the tube current was 280–300 mA, the layer thickness was 5 mm, the reconstruction dimension was 1.25 mm, the layer spacing was 5 mm, and the pitch was 1.375:1. The contrast agent for enhanced scanning was 1.5 mL/kg of iohexol (300 mgI/mL), which was injected through the median cubital vein with a flow rate of 2.5–3.0 mL/s. The contrast agent was injected twice: at 25–30 s and 60–70 s for arterial and venous scanning, respectively.
Image segmentation and feature extraction
The preoperative abdominal enhanced CT image data of all patients were collected from the ICPACS workstation of the CT room at the Imaging Department of our institution and exported in.DICOM format. For all the image modality, the slices were resampled to 1*1*1cm3, and the intensity range were normalized to a mean value of 0 and standard deviation of 1 (z score standardization). Doctor A, a senior diagnostic imaging doctor, used ITK-Snap software (https://www.itksnap.org) to delineate the (ROI) of the lesion in the third phase of enhanced CT. For 2D delineation, the scope included the layer with the largest lesion surface area, and for 3D delineation, the scope covered all possible areas of the target lesion. During the delineation, necrosis, blood vessels, and other structures were eliminated as completely as possible (Fig. 2). Two weeks later, doctor A randomly selected data of 30 patients to perform ROI delineation again to evaluate the intra-group consistency of ROI delineation (intra-ICC). To evaluate the between-group consistency of ROI delineation (inter-ICC), another radiologist, B, also drew the ROIs on the 30 patients’ images independently.
A.K. software (GE Healthcare, AnalysisKit; Version: 3.2.0.R) conforming to the IBSI standard was used to extract features from the ROIs outlined in each image. A total of 681 image biomarkers were obtained in each phase (non-mutation group labeled PA1-63, marked as 0; mutation group labeled PA64-106, marked as 1), including histogram, grayscale interconnected area matrix, grayscale co-occurrence matrix, morphology, run-length matrix, and wavelet change (wavelet) and Laplacian change (ln) characteristics.
Feature screening and model establishment
The features of the arterial phase, venous phase, and delay phase were integrated into 2043 features. The data were randomly divided into the training and test sets in a 7:3 ratio. In the training set, maximum correlation minimum redundancy (mRMR) and LASSO (least absolute shrinkage and selection operator) tenfold cross-validation, were used to select features. First, mRMR was performed to remove redundant and irrelevant features, incorporate the retained features into the LASSO regression, find the value of hyperparameter λ that minimized the binomial deviation, and then retain the subset of features whose coefficients were not 0 to construct the final model.
Statistical analysis and processing
The chi-square test or Mann–Whitney U test was used to check for significant differences in general features and clinical characteristics between the two groups. The area under curve (AUC), model accuracy, precision, sensitivity, specificity, negative predictive value, and positive predictive value were analyzed to evaluate the radiomics models’ predictive ability. Finally, decision curve analysis was used to evaluate the models’ clinical applicability. All statistical analyses were performed using R software (version 3.6.1, https://www.r-project.org).
For general clinical characteristics, we used the Chi-square test and the Kruskal–Wallis H test to analyze abnormally distributed continuous variables. P values < 0.05 were considered statistically significant. All statistical analyses were performed using R software (version 3.6.1) and the Python programming language (version 3.5.6).
The intraclass correlation coefficients (ICCs) of each texture feature were calculated to evaluate the within- and between-observer changes in the texture features extracted by the ROI segmentation and to explain each feature’s reproducibility, according to the following scale: ICC < 0.4, poor; 0.59 > ICC ≥ 0.4, fair; 0.75 > ICC ≥ 0.6, good; and ICC ≥ 0.75, excellent. Features with ICC ≥ 0.8 were considered stable and included in further analyses.
No statistically significant differences in age composition, maximum tumor diameter, or tumor markers were found among the three groups (P > 0.05; Table 1).
Construction and verification of prediction models
The ICC ranges within and between the 2D-outlined feature data groups were (− 0.18–1.00) and (− 0.08–1.00), respectively. In total, 531 features had ICC values of > 0.75 across the two groups. In the 3D-outlined feature data group, the ICC ranges within and between groups were (− 0.06–1.00) and (− 0.40–1.00), respectively. In total, 548 features had ICC values of > 0.75 across the two groups.
We then used the mRMR and LASSO regression models to perform feature screening on features with ICC values of > 0.75 in the 2D, 3D, and 2D + 3D models. For the 2D images, the binomial deviation was smallest at the optimal tuning parameter value of ln λ = 0.0118, and the four features whose coefficients were not 0 at that value were retained. For the 3D images, the binomial deviation was smallest at the optimal tuning parameter value of ln λ = 0.0279, and the 13 features whose coefficients were not 0 at that value were retained. For the 2D + 3D images, the binomial deviation was the smallest at the optimal tuning parameter value of ln λ = 0.0294 (Fig. 3a), and the 12 features whose coefficients were not 0 at that value (Fig. 3b) were retained; eight and four were retained from 2 and 3D images, respectively. The corresponding coefficients of the features are shown in Fig. 3c.
Diagnostic efficiency of the devised models
Logistic regression was used to establish radiomics models using the selected features, and the radscore of each case was calculated. Table 2 shows the radscore distribution in the gene mutation and wild-type groups generated by the 2D, 3D, and 2D + 3D models. We used the AUC of the receiver operating characteristic curve and the models’ accuracy, sensitivity, specificity, negative predictive value, and positive predictive value to evaluate their predictive ability.
In the training set, the 2D model’s image AUC was 0.81 (0.71–0.91), the 3D model’s image AUC was 0.80 (0.70–0.90), and the 2D + 3D model’s image AUC was 0.91 (0.84–0.97). In the test set, the 2D model’s image AUC was 0.78 (0.61–0.96), the 3D model’s image AUC was 0.75 (0.55–0.92), and the 2D + 3D model’s image AUC was 0.82 (0.67–0.98) (Table 3). The highest AUC value of the 2D + 3D model indicated its highest diagnostic efficacy (Fig. 4). However, the DeLong test results showed that the P values among the three pairs of models were all > 0.05, suggesting that there were no statistically significant differences in prediction efficiency among the three pairs of models.
Finally, a decision curve was used to evaluate the three models’ clinical effectiveness (Fig. 5). As the radiomics model’s standard net benefit within a range of the high-risk threshold increased, its clinical effectiveness increased. The decision curves show that the clinically effective performance of the 2D + 3D model was much higher than those of the other two models.
The above results indicate that the radiomics model based on 2D and 3D images can effectively predict BRCA gene mutations in patients with epithelial ovarian cancer, although the 2D + 3D model showed no significantly different predictive performance from that of the radiomic models based on 2D or 3D images alone.
Ovarian cancer has the highest mortality rate among gynecological malignancies. More than 70% of patients with ovarian cancer are already in the advanced stage when detected , and there is still a lack of accurate early diagnosis and prevention methods . The use of radiomics can avoid the disadvantages of conventional imaging examinations such as subjectivity, histopathology, and local sampling.
Guo Jianlin et al.  found that the combined radiomics model of arterial phase and venous phase had higher predictive performance than the separate model of each phase. Using the CT features of 539 cases of lung adenocarcinoma, Yooh et al.  analyzed the tumor size, location, volume, density, CT value, and the relationship between pixel-based texture features and gene expression patterns and found that these image features showed good discrimination of fusion-positive from fusion-negative lung adenocarcinomas. In this study, the quantitative features were selected and the prediction model was established using radiomics features extracted from the arterial phase, venous phase, and delay phase of enhanced CT images, and the resulting radiomics scores were strongly correlated with the BRCA gene mutation (r = 0.65, P < 0.001; Fig. 3c). Totally, 2,043 radiomics features were extracted from the CT images of 106 patients with epithelial ovarian cancer, Twelve characteristics were finally selected and the model was constructed via multivariate logistic regression. Among them, the P values of the 3D_Artery_wavelet.HLL_firstorder_Kurtosis, 3D-Delay-wavelet.LHH_IDMN, Venous_wavelet.LLL_firstorder_Range, Delay_wavelet.HHH_glszm_SmallAreaHighGrayLevelEmphasis, Venous_wavelet.HHH_firstorder_MeanAbsoluteDeviation, and 2D_Artery_wavelet.HHH_glszm_ZoneEntropy were less than 0.05, and can be used as independent risk predictors. The values of these characteristics in the mutation group were higher than those in the non-mutation group. For example, Artery_wavelet.HLL_firstorder_Kurtosis is a measure of the “peak” of the value distribution in the image ROI transferred by wavelet from the artery phase. Higher kurtosis means that the quality of the distribution is concentrated in the tail instead of the average, which can be used as a label to predict the mutation status of the BRCA gene. Other detailed explanation of the features are in the appendix (Table 4). The radscore obtained from the mutlvirate logistic regression model was also higher in the mutation group than the wild group in both the 2D and 3D and 2D + 3D model(p < 0.05). Therefore, the radiomics signature established by the selected radiomics feature radscore can be used as a biomarker for BRCA gene mutation prediction.
The study by Lei Xu et al. found that in both univariate and multivariate analyses, 3D image features showed better prediction performance than 2D image features . However, according to Shen et al. , 2D features had better performance. Yet, Lifeng Yang et al.  found that the features derived from a 2D + 3D model showed better prognostic performance than 2D or 3D features alone in predicting overall survival of non-small cell lung cancer. Therefore, we established prediction models using 2D, 3D, and 2D + 3D image features, and found that the AUC values of all three models were over 0.75, and all three models’ exhibited good prediction performance, indicating that the radiomics models can effectively predict BRCA gene mutation status in patients with epithelial ovarian cancer. The 2D + 3D model had the highest AUC value, but the DeLong test results showed no statistical differences in predictive performance between the three pairs of models. The results showed that 2D outlining of the layer with the largest lesion diameter could cover the central region of the tumor and achieve high predictive performance in predicting BRCA gene mutations in ovarian cancer. The 3D outlining images could provide information about tumors’ heterogeneity outside the central area, but this information could not contribute to the prediction of BRCA gene mutations in the present study. Therefore, we believe that when the image is delineated in 2D, with appropriate selection of lesion delineation level, the predictive performance the radiomics model is close to that of the 2D + 3D model. This insight may improve the practical efficiency of establishing radiomics model in clinical practice.
This study had some limitations. First, this study was a single-center retrospective study. The included cases were all from our hospital and therefore lacked external verification. Second, the sample size was small, which may lead to bias in the data. Third, all ROIs in this study were drawn manually, making the results easily affected by subjective factors. Lastly, in this study, there were 63 and 43 patients in the wild-type and mutant groups, respectively. The data of the two groups were unbalanced, and the model may therefore be biased.
In summary, the 2D, 3D, and 2D + 3D radiomics models based on enhanced CT images can effectively predict BRCA gene mutation status in patients with epithelial ovarian cancer and provide a new type of method for clinical evaluation of patients’ genetic mutation status.
Availability of data and materials
The datasets used and/or analyzed in the current study are available from the corresponding author on reasonable request.
Siegel RL, Miller KD, Jemal A. Cancer statistics, 2020. CA Cancer J Clin. 2020;70(1):7–30.
Bray F, Ferlay J, Soerjomataram I, Rebecca L, Lindsey A, Jemal A. Global cancer statistics 2018: CL OBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin. 2018;68(6):394–424.
Kim J, Coffey DM, Creighton CJ, Yu Z, Hawkins SM, Matzuk MM. High grade serous ovarian cancer arises from fallopian tube in a mouse model. Proc Natl Acad Sci U S A. 2012;109(10):3921–6.
Konstantinopoulos PA, Ceccaldi R, Shapiro GI, D’Andrea AD. Homologous recombination deficiency: exploiting the fundamental vulnerability of ovarian cancer. Cancer Discov. 2015;5(11):1137–54.
Byrski T, Huzarski T, Dent R, Marczyk E, Jasiowka M, Gronwald J, Jakubowicz J, Cybulski C, Wisniowski R, Godlewski D, Lubinski J, Narod SA. Pathologic complete response to neoadjuvant cisplatin in BRCA1-positive breast cancer patients. Breast Cancer Res Treat. 2014;147:401–5.
Tutt A, Tovey H, Cheang MCU, Kernaghan S, Kilburn L, Gazinska P, et al. Carboplatin in BRCA1/2-mutated and triple-negative breast cancer BRCAness subgroups: the TNT Trial. Nat Med. 2018;24:628–37.
Vargas HA, Huang EP, Lakhman Y, Ippolito JE, Bhosale P, Mellnick V, Shinagare AB, Anello M, Kirby J, Fevrier-Sullivan B, Freymann J, Jaffe CC, Sala E. Radiogenomics of high-grade serous ovarian cancer: multireader multi-institutional study from the cancer genome atlas ovarian cancer imaging research group. Radiology. 2017;285(2):482–92.
Nougaret S, Lakhman Y, Gönen M. High-grade serous ovarian cancer: associations between BRCA mutation status, CT imaging phenotypes, and clinical outcomes. Radiology. 2017;285(2):472–81.
Aerts HJ, Velazquez ER, Leijenaar RT, Parmar C, Grossmann P, Carvalho S, Bussink J, Monshouwer R, Haibe-Kains B, Rietveld D, Hoebers F, Rietbergen MM, Leemans CR, Dekker A, Quackenbush J, Gillies RJ, Lambin P. Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat Commun. 2014;5:4006.
Zhao H. et al. TCGA-TCIA-based CT radiomics study for noninvasively predicting epstein-barr virus status in gastric cancer. AJR, 2021, 11.
Lei X, et al. A multi-organ cancer study of the classification performance using 2D and 3D image features in radiomics analysis. Phys Med Biol. 2019;64(21):215009.
Chien J, Poole EM. Ovarian cancer prevention, screening, and early detection: report from the 11th biennial ovarian cancer research symposium. Int J Gynecol Cancer, 2017, 27(9S Suppl 5): S20-S22.
Modugno F. Ovarian cancer and high-risk women symposium presenters. Ovarian cancer and high-risk women-implications for prevention, screening, and early detection. Gynecol Oncol. 2003;91(1):15–31.
Chen X, Yang Z, Yang J, Liao Y, Pang P, Fan W, Chen X. Radiomics analysis of contrast-enhanced CT predicts lymphovascular invasion and disease outcome in gastric cancer: a preliminary study. Cancer Imaging. 2020;20(1):24.
Hyun JY, et al. Decoding tumor phenotypes for ALK, ROS1, and RET fusions in lung adenocarcinoma using a radiomics approach. Medicine. 2015;94(41):e1753.
Xu L, Yang P, Yen EA, Wan Y, Jiang Y, Cao Z, Shen X, Wu Y, Wang J, Luo C, Niu T. A multi-organ cancer study of the classification performance using 2D and 3D image features in radiomics analysis. Phys Med Biol. 2019;64(21):215009.
Shen C, Liu Z, Guan M, Song J, Lian Y, Wang S, Tang Z, Dong D, Kong L, Wang M, Shi D, Tian J. 2D and 3D CT radiomics features prognostic performance comparison in non-small cell lung cancer. Transl Oncol. 2017;10(6):886–94.
Yang L, Yang J, Zhou X, Huang L, Zhao W, Wang T, Zhuang J, Tian J. Development of a radiomics nomogram based on the 2D and 3D CT features to predict the survival of non-small cell lung cancer patients. Eur Radiol. 2019;29(5):2196–206.
We thank Richard Lipkin, PhD, from Liwen Bianji, Edanz Group China (www.liwenbianji.cn/ac), for editing the English text of this manuscript.
This work is supported by the National Natural Science Youth Science Fund Project (No. 81501468), by the Fundamental Research Funds for the Central Universities (No. WK9110000002). Application and clinical promotion of pathological classification and FIGO staging of epithelial ovarian cancer based on CT imaging-genomics, supported by Chinese Red Cross Foundation: Leading Elite Scientific Research Project, (XM_HR_YXFN_2021_05_19, 2., June 2021 to June 2022).
Ethics approval and consent to participate
Our institutional review board (IRB) waived written informed consent for this retrospective study which evaluated de-identified data and involved no potential risk to patients. The method used in this study was in accordance with the medical Research Ethics Committee of the First Affiliated Hospital of University of Science and Technology of China (No. 2021-RE-078).
Consent for publication
All authors agree to publish.
All authors have completed the ICMJE uniform disclosure form. The authors have no conflicts of interest to declare. We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Mingzhu, L., Yaqiong, G., Mengru, L. et al. Prediction of BRCA gene mutation status in epithelial ovarian cancer by radiomics models based on 2D and 3D CT images. BMC Med Imaging 21, 180 (2021). https://doi.org/10.1186/s12880-021-00711-3
- Ovarian cancer
- BRCA gene