- Research
- Open access
- Published:
Development of a multi-phase CT-based radiomics model to differentiate heterotopic pancreas from gastrointestinal stromal tumor
BMC Medical Imaging volume 24, Article number: 44 (2024)
Abstract
Background
To investigate whether CT-based radiomics can effectively differentiate between heterotopic pancreas (HP) and gastrointestinal stromal tumor (GIST), and whether different resampling methods can affect the model’s performance.
Methods
Multi-phase CT radiological data were retrospectively collected from 94 patients. Of these, 40 with HP and 54 with GISTs were enrolled between April 2017 and November 2021. One experienced radiologist manually delineated the volume of interest and then resampled the voxel size of the images to 0.5 × 0.5 × 0.5 mm3, 1 × 1 × 1 mm3, and 2 × 2 × 2 mm3, respectively. Radiomics features were extracted using PyRadiomics, resulting in 1218 features from each phase image. The datasets were randomly divided into training set (n = 66) and validation set (n = 28) at a 7:3 ratio. After applying multiple feature selection methods, the optimal features were screened. Radial basis kernel function-based support vector machine (RBF-SVM) was used as the classifier, and model performance was evaluated using the area under the receiver operating curve (AUC) analysis, as well as accuracy, sensitivity, and specificity.
Results
The combined phase model performed better than the other phase models, and the resampling method of 0.5 × 0.5 × 0.5 mm3 achieved the highest performance with an AUC of 0.953 (0.881-1), accuracy of 0.929, sensitivity of 0.938, and specificity of 0.917 in the validation set. The Delong test showed no significant difference in AUCs among the three resampling methods, with p > 0.05.
Conclusions
Radiomics can effectively differentiate between HP and GISTs on CT images, and the diagnostic performance of radiomics is minimally affected by different resampling methods.
Introduction
Heterotopic pancreas (HP) is a congenital anomaly in which pancreatic tissue is separate from the main gland and lacks a continuous vascular or ductal connection [1]. HP is most commonly found in the upper gastrointestinal tract, such as the stomach, duodenum, and proximal jejunum, but is rare in other locations such as the esophagus, ileum, and biliary tree. The majority of affected patients are asymptomatic, making HP difficult to diagnose in clinical settings. The lesion is typically discovered incidentally during an unrelated surgery, imaging examination, or at autopsy. Autopsy studies [2–3] have reported an incidence of 0.5-13.7%, while the incidence during upper abdominal surgeries and gastrectomies is 0.2% and 0.9%, respectively.
HP often presents as a submucosal mass of the gastrointestinal tract and can be misdiagnosed as other submucosal tumors, particularly gastrointestinal stromal tumors (GISTs), in medical imaging or endoscopy. GISTs are specific mesenchymal tumors that can develop in different locations throughout the gastrointestinal tract, omentum, and mesentery [4–5]. GISTs are invasive and potentially malignant, with a 20-30% risk of malignancy and poor prognosis when they give rise to abdominal cavities or liver metastases [6–7]. Resection is the only curative treatment for GISTs. In contrast, HP generally does not require treatment unless complications occur.
Computed tomography (CT) is the ideal imaging modality for upper gastrointestinal examination and preoperative evaluation, but the similar appearance of HP and GISTs in CT scans poses a diagnostic challenge. The characteristic radiographic appearance of HP is a small broad-based submucosal mass in the antrum with ill-defined or microlobulated margins, resembling leiomyoma or other submucosal tumors such as GISTs [8,9,10]. HP generally shows obvious enhancement in the late arterial phase and a gradual decrease in the venous and delayed phases on enhanced CT. Lesions in small GISTs exhibits uniform enhancement on enhanced CT. The venous phase enhancement is slightly higher than that in the arterial phase, but detecting changes in GISTs is challenging with the naked eye. As a result, partial patients with HP have received unnecessary operative resection by being misdiagnosed as having GISTs.
Radiomics, first proposed by Lambin et al. [11], allows for in-depth tumor phenotyping and quantification of lesion heterogeneity, providing high-throughput quantitative data from radiological images that are invisible to the human eye. Several studies have explored the potential value of radiomics in abdominal oncology, with promising results for lesion characterization, assessment of therapeutic response, and patient survival [12,13,14]. The multi-phase enhanced CT images show similarities between HP and GISTs, making it difficult for radiologists to visually differentiate. Nevertheless, potential subtle differences in shape, intensity, and texture might be emerging, hinting at the suitability of radiomics approach.
Therefore, this study aims to develop and validate a CT-based radiomics model that can discriminate between HP and GISTs, and to investigate the effect of different resampling parameters on the model’s performance.
Materials and methods
Patients
A total of 94 patients who underwent abdominal CT examinations were included in this study. Among them, 40 with HP and 54 with GISTs were enrolled from April 2017 to October 2021 at single center. The inclusion criteria is as follow: (1) all patients were proved HP or GISTs by pathology after resection. (2) diameters of GISTs lesions were less than 3 cm. (3) all patients underwent abdominal multi-phase CT scan, including: CT plain, arterial, venous and delayed phases. The exclusion criteria is as follow: (1) patients with incomplete clinic-pathological data. (2) poor quality CT scan, including: blurred lesion boundaries and severe artifacts. (3) lack one of the multi-phase CT scan. The retrospective study was approved by our institutional review board, and the need to obtain informed consent was waived. The whole analysis workflow in this study is shown in Fig. 1.
CT protocol
The CT examinations were performed using Siemens Force dual-source 128-row CT scanner and Siemens Definition Flash dual-source 128-row CT scanner with the following parameters: 100mAs tube current, 120 kV tube voltage, 512 × 512 matrix, 1.5 pitch, 1.5 mm layer thickness, and 1 mm layer spacing. Intravenous group injection tracking method was used for contrast agent enhancement, with iodophoresis injection (Shanghai Stellite) selected as the contrast agent, which had an iodine concentration of 350 mg/mL, dose of 80mL, and flow rate of 23mL/s. The scan was delayed for 30, 60, 90, 120s for the arterial phase, venous phase, and delayed phase after the injection was completed.
CT radiomics analysis
One radiologist, Rz.J, with 13 years of experience in abdominal CT manually delineated the volume of interest (VOI) using TIK-SNAP software (version 3.8.0). The VOI covered the entire lesion and excluded the equivocal boundary of the mass. All segmentations were confirmed by another radiologist, Hy.W, with 25 years of experience in abdominal CT. Appropriate adjustments were made after a consensus-based discussion when disagreements arose.
Radiomics was implemented using PyRadiomics, an open-source Python tool package (3.0.1). Prior to this, the voxel size of the images was resampled to 0.5 × 0.5 × 0.5 mm3, 1 × 1 × 1 mm3, and 2 × 2 × 2 mm3, respectively. Gray-level was discretized using a fixed bin width value of 70 with absolute discretization from − 1000 to 3000 Hounsfield units. Radiomics feature extraction was performed separately for the four-phase images (plain, arterial, venous, and delayed phases). Features were extracted from the original image and filtered images, which included Laplacian of Gaussian (LoG) and wavelet filtering. Eventually, 1218 features, including geometry-based, grayscale-based, and texture-based were acquired from each phase image.
Radiomics feature selection and classifier building
The patients were randomly divided into a training set (n = 66) and a validation set (n = 28) at a 7:3 ratio. Prior to feature selection, z-score normalization was applied to the training set. Firstly, predictors with zero variance were removed. Secondly, high-correlational variables with rs > 0.8 were eliminated using the Spearman test. Thirdly, recursive feature elimination (RFE) with a treebagging algorithm was used to analyze the variables, and those with an ‘accuracy’ index were retained via five-fold cross-validation. Lastly, the optimal predictors with non-multicollinearity and anti-overfitting were selected using the least absolute shrinkage and selection operator (LASSO) regression model. These features were preserved via the one standard error of the minimum lambda (λ) of five-fold cross-validation.
Support vector machine (SVM) has several advantages such as good performance on small-scale datasets, rapid computational speed, multiple optional kernels, well generalization, and achieving an optimal global solution with convex optimization problems [15]. In this study, the radial basis kernel function-based SVM (RBF-SVM) was chosen as the main model for analysis. Each phase RBF-SVM model of different resampling methods was constructed to evaluate the diagnostic performance. Combined phase model was constructed by optimal features of each phase via final LASSO selection method.
Statistical analysis
The difference in categorical variables was calculated using the Chi-Square method, while the difference in continuous variables was calculated using t-test or Mann-Whitney U test and described as mean ± standard deviation or median (IQR). The diagnostic performance of radiomics features was evaluated using the area under the receiver operating curve (AUC) analysis, as well as accuracy, sensitivity, and specificity. The AUC values among different phase models were compared using the DeLong method, and a p-value of less than 0.05 was considered statistically significant.
Results
Baseline characteristic
Out of the 94 patients included in the study, 40 had HP and 54 had GIST. The average age of the HP patients was 47.8 ± 14.0 years (ranging from 20 to 71 years), with 23 males and 17 females. The average age of the GIST patients was 58.7 ± 8.3 years (ranging from 41 to 77 years), with 22 males and 32 females. The HP patients in this study tended to be younger than those with GIST. Further details regarding the baseline information can be found in Table 1.
Discrimination performance and feature contribution
All classifiers using different resampling methods showed favorable performance, with the combined phase model performing better than the other phase models. In the resampling of 0.5 × 0.5 × 0.5 mm3 radiomics, the combined phase model achieved an AUC of 0.997 (0.991-1), accuracy of 0.985, sensitivity of 1, and specificity of 0.964 in the training set (Fig. 2). It also achieved an AUC of 0.953 (0.881-1), accuracy of 0.929, sensitivity of 0.938, and specificity of 0.917 in the validation set. The combined phase RBF-SVM comprised of 4 original and 4 wavelet filter features. Out of these, 2 were shape statistics, 2 were first-order statistics, and 4 were textural statistics.
In resampling of 1 × 1 × 1 mm3 radiomics, the combined phase model was the best performer with an AUC of 0.996 (0.989-1), accuracy of 0.970, sensitivity of 0.947, and specificity of 1 in the training set. In the validation set, it achieved an AUC of 0.880 (0.748-1), accuracy of 0.857, sensitivity of 0.938, and specificity of 0.750. This model included 4 original, 1 LoG, and 5 wavelet filter features. Out of these, 2 were shape statistics, 1 was first-order statistics, and 7 were textural statistics.
In the resampling of 2 × 2 × 2 mm3 radiomics, the venous phase model was the best performer with an AUC of 0.943 (0.889–0.996), accuracy of 0.894, sensitivity of 0.947, and specificity of 0.821 in the training set. It achieved an AUC of 0.901 (0.754-1), accuracy of 0.893, sensitivity of 0.875, and specificity of 0.917 in the validation set (Fig. 3). The venous phase RBF-SVM contained 2 original, 1 LoG, and 4 wavelet filter features. Out of these, 1 was shape statistics, 2 were first-order statistics, and 4 were textural statistics. The performance of each phase model is shown in Table 2.
Based on these results, the globally optimal model was the combined phase model originating from the resampling of 0.5 × 0.5 × 0.5 mm3 radiomics, and the optimal hyperparameters are “C” of 0.4 and “gamma” of 1.
AUCs comparison among different resampling methods
Although the models showed similar diagnostic efficacy on different resampling methods, we used the DeLong method to assess whether there were any differences among the AUCs of the models on the validation set. The results showed no significant difference in AUCs, with p-values of 0.177, 0.275 and 0.951 in the combined phase, and 0.295, 1, and 0.276 in the plain phase, respectively. The comparison of the other phases is shown in Table 3. Figure 4 depicts the calibration curve for the best model in different resampling methods and heatmap of the AUCs performance of the different resampling methods and phases. The names of the selected features in each phase model are provided in the supplementary materials.
Overlap radiomics features
There were overlapping radiomics features across different resampling methods on the plain, arterial, and venous phases. In the plain phase, the overlapping feature was the first-order statistic ‘Median’. In the arterial phase, the overlapping feature was the shape statistic ‘Sphericity’. In the venous phase, the overlapping features were the first-order statistic ‘Mean’ and the shape statistic ‘Sphericity’. All these features were derived from the original filter.
Discussion
There are few studies that focus on radiomics to differentiate between HP and GISTs on CT scans. Our study shows that a CT-based radiomics model can be used to distinguish between HP and GISTs, and that different resampling methods do not affect the diagnostic performance of the models. There was no significant difference in the performance of each phase model (p > 0.05). In the resampling of 0.5 × 0.5 × 0.5 mm3, the best model is from combined phase, achieved the best performance with AUC of 0.953. In the resampling of 1 × 1 × 1 mm3, the best model (combined phase) achieved the best performance with AUC of 0.880. In the resampling of 2 × 2 × 2 mm3, the best model (venous phase) achieved the best performance with AUC of 0.901.
The contrast-enhanced CT is the most applicable imaging modality in such diseases, and the imaging presentation of HP correlates with its histological components. HP is often presented as a well-defined intramural ovoid mass on CT images, with a diameter of less than 3 cm. Lesions consisting mainly of acinar tissues tend to have a higher or equal degree of enhancement than the pancreas in situ, whereas lesions consisting mainly of ductal structures and hyperplastic muscular layers tend to have a lower degree of enhancement than the normal pancreas. However, both lesions of HP and GISTs are presented with homogeneous enhancement. In our study, except for the combination phase, the arterial phase model achieved the best performance compared to other phases. The mean AUC in different resampling methods was 0.901 on the validation set, and the remaining phases were 0.867, 0.859, and 0.772, respectively.
As a branch of artificial intelligence (AI), radiomics has become one of the most popular research fields in medical imaging. When radiomics is combined with machine learning, it can exert the best efficacy of a model. Although radiomics has a long way to go before it can be applied in reality, there are some studies that demonstrate its potential value [16,17,18,19]. The essence of radiomics is a large pile of radiomics features, and the feature values between different classes can be mined and analyzed to reflect the status of a disease.
There are several overlapping features across the different resampling methods, with the feature ‘Sphericity’ occupying 50% (2/4). ‘Sphericity’ is defined as a measure of the roundness of the shape of the tumor region relative to a sphere, where a value of 1 indicates a perfect sphere. In our analysis, the values of GISTs were found to be significantly higher than those of HP in the arterial phase (0.5 × 0.5 × 0.5 mm3: 0.730 (0.714–0.759) vs 0.696 (0.663–0.718) [p < 0.001]; 1 × 1 × 1 mm3: 0.778 (0.757–0.810) vs 0.741 (0.706–0.774) [p < 0.001]; 2 × 2 × 2 mm3: 0.815 (0.772–0.841) vs 0.776 (0.742–0.796) [p = 0.001]), as well as in the venous phase (0.5 × 0.5 × 0.5 mm3: 0.724 (0.703–0.750) vs 0.701 (0.660–0.731) [p = 0.005]; 1 × 1 × 1 mm3: 0.765 (0.745-0.800) vs 0.744 (0.703–0.770) [p = 0.008]; 2 × 2 × 2 mm3: 0.807 (0.768–0.847) vs 0.792 (0.737–0.820) [p = 0.024]). This study is consistent with Jang’s report [20]. GISTs are composed of spindle cells, epithelioid cells, or a mixture, and are more likely to be round in shape than HP, which commonly exhibits a slender appearance similar to that of a normal pancreas. According to the report by Li et al. [21], they found that GISTs (42%) were significantly rounder than HP masses (8%). However, this phenomenon disagrees with Yang’s research [22]. For the overlapping feature ‘Median’ in the plain phase, the value in the HP group is higher than that in the GISTs group in resampling of 0.5 × 0.5 × 0.5 mm3 (42.5 (39.5–48) vs 33 (28–38) [p < 0.001]), 1 × 1 × 1 mm3 (43 (39.25–48.25) vs 33 (28–39) [p < 0.001]), and 2 × 2 × 2 mm3 (43 (39–48) vs 33 (27.250-39.625) [p < 0.001]). For the overlapping feature ‘Mean’ in the venous phase, the value in the HP group is also higher than that in the GISTs group in resampling of 0.5 × 0.5 × 0.5 mm3 (86.141 (74.459–102.600) vs 68.975 (61.223–78.842) [p < 0.001]), 1 × 1 × 1 mm3 (85.901 (75.048-103.251) vs 70.538 (61.293–79.036) [p < 0.001]), and 2 × 2 × 2 mm3 (83.729 (73.346-103.957) vs 70.349 (59.756–77.962) [p < 0.001]).
Our study has several limitations. First, the use of multiple CT scanners in this retrospective study resulted in nonconformity of the scanning parameters and volumes of contrast media. To address this issue, we normalized the grey level and resampled the voxel spacing to eliminate the possible adverse effects. Second, the VOI was manually delineated by radiologists, which is time-consuming and subject to some degree of subjectivity. Third, the sample size is small, a prospective, multi-center, and large-scale population is still necessary for further validation., a prospective, multi-center, and large-scale population is still necessary for further validation.
Conclusions
Radiomics can serve as a noninvasive and quantitative imaging biomarker to differentiate between HP and GISTs on CT images, thereby providing clinical guidance to radiologists in decision-making. Furthermore, this study suggests that the diagnostic performance of radiomics is minimally affected by different resampling methods.
Data availability
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.
Abbreviations
- HP:
-
Heterotopic pancreas
- GIST:
-
Gastrointestinal stromal tumor
- VOI:
-
Volume of interest
- LoG:
-
Laplacian of Gaussian
- RFE:
-
Recursive feature elimination
- LASSO:
-
The least absolute shrinkage and selection operator
- RBF-SVM:
-
Radial basis kernel function-based support vector machine
- AUC:
-
The area under the receiver operating characteristic curve
- AI:
-
Artificial intelligence
References
Rezvani M, Menias C, Sandrasegaran K, Olpin JD, Elsayes KM, Shaaban AM. Heterotopic pancreas: Histopathologic Features, imaging findings, and complications. Radiographics. 2017;37(2):484–99.
Kung JW, Brown A, Kruskal JB, Goldsmith JD, Pedrosa I. Heterotopic pancreas: typical and atypical imaging findings. Clin Radiol. 2010;65(5):403–7.
Wei R, Wang QB, Chen QH, Liu JS, Zhang B. Upper gastrointestinal tract heterotopic pancreas: findings from CT and endoscopic imaging with histopathologic correlation. Clin Imaging. 2011;35(5):353–9.
Fletcher CD, Berman JJ, Corless C, et al. Diagnosis of gastrointestinal stromal tumors: a consensus approach. Hum Pathol. 2002;10(2):81–9.
Rubin BP, Singer S, Tsao C, et al. KIT activation is an ubiquitous feature of gastrointestinal stromal tumors. Cancer Res. 2001;61(22):8118–21.
Fletcher CD, Berman JJ, Corless C, et al. Diagnosis of gastrointestinal stromal tumors: a consensus approach. Hum Pathol. 2002;33(5):459–65.
Koay MH, Goh YW, Iacopetta B, et al. Gastrointestinal stromal tumours (GISTs): a clinicopathological and molecular study of 66 cases. Pathology. 2005;37(1):22–31.
Rooney DR. Aberrant pancreatic tissue in the stomach. Radiology. 1959;73(2):241–4.
Besemann EF, Auerbach SH, Wolf WW. The importance of roentgenologic diagnosis of aberrant pancreatic tissue in the gastrointestinal tract. Am J Roentgenol Radium Ther Nucl Med. 1969;107(1):71–6.
Thoeni RF, Gedgaudas RK. Ectopic pancreas: usual and unusual features. Gastrointest Radiol. 1980;5:37–42.
Lambin P, Rios-Velazquez E, Leijenaar R, et al. Radiomics: extracting more information from medical images using advanced feature analysis. Eur J Cancer. 2012;48(4):441–6.
Lewis S, Hectors S, Taouli B. Radiomics of hepatocellular carcinoma. Abdom Radiol. 2021.
Miranda Magalhaes Santos JM, Clemente Oliveira B, Araujo-Filho JAB, et al. State-of-the-art in radiomics of hepatocellular carcinoma: a review of basic principles, applications, and limitations. Abdom Radiol. 2020;45:342–53.
Kocak B, Durmaz ES, Erdim C, Ates E, Kaya OK, Kilickesmez O. Radiomics of Renal masses: systematic review of reproducibility and validation strategies. AJR Am J Roentgenol. 2020;214(1):129–36.
Jonsson K, Kittler J, Li YP, et al. Support vector machines for face authentication. Image Vis Comput. 2002;20(5–6):369–75.
Hu P, Gao Y, Zhang Y, Sun K. Ultrasound image-based deep learning to differentiate tubal-ovarian abscess from ovarian endometriosis cyst. Front Physiol. 2023; 14.
Sun K, Shi L, Qiu J, Pan Y, Wang X, Wang H. Multi-phase contrast-enhanced magnetic resonance image-based radiomics-combined machine learning reveals microscopic ultra-early hepatocellular carcinoma lesions. Eur J Nucl Med Mol Imaging. 2022;49(8):2917–28.
Zhang L, Sun K, Shi L, Qiu J, Wang X, Wang S. Ultrasound Image-based deep features and Radiomics for the discrimination of small Fat-Poor Angiomyolipoma and Small Renal Cell Carcinoma. Ultrasound Med Biol. 2023;49(2):560–8.
Zhang K, Sun K, Zhang C et al. Using deep learning to predict survival outcome in non-surgical cervical cancer patients based on pathological images. J Cancer Res Clin Oncol. 2023; 1–9.
Jang KM, Kim SH, Park HJ, et al. Ectopic pancreas in upper gastrointestinal tract: MRI findings with emphasis on differentiation from submucosal tumor. Acta Radiol. 2013;54(10):1107–16.
Li LM, Feng LY, Chen XH, Liang P, Li J, Gao JB. Gastric heterotopic pancreas and stromal tumors smaller than 3 cm in diameter: clinical and computed tomography findings. Cancer Imaging. 2018;18:1–9.
Yang CW, Liu XJ, Wei Y, et al. Use of computed tomography for distinguishing heterotopic pancreas from gastrointestinal stromal tumor and leiomyoma. Abdom Radiol. 2021;46:168–78.
Funding
This study has received funding from Taishan Scholar Project of Shandong Province, NSFC (81871354 and 82271993), Academic Promotion Programme of Shandong First Medical University (2019QL023), the Cancer Prevention and Treatment fund of Shandong Province Natural Science Foundation (ZR2019LZL009) and the Key Research and Development Project of Shandong Province (2021SFGC0104).
Author information
Authors and Affiliations
Contributions
KS and YW contributed to study design and manuscript editing. SY contributed to data collection and analysis. RJ contributed to imaging evaluation. RS contributed to the study design, data analysis and manuscript editing. CL, XM and HW contributed to the study design and imaging evaluation. All authors reviewed the manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
The study complied with the Declaration of Helsinki guidelines and declaration. The study was approved by the Ethic Committee of Shandong Provincial Hospital Affiliated to Shandong First Medical University. Written inform consent was waived by the Ethic Committee of Shandong Provincial Hospital Affiliated to Shandong First Medical University due to the retrospective nature of the study.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Sun, K., Yu, S., Wang, Y. et al. Development of a multi-phase CT-based radiomics model to differentiate heterotopic pancreas from gastrointestinal stromal tumor. BMC Med Imaging 24, 44 (2024). https://doi.org/10.1186/s12880-024-01219-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12880-024-01219-2