Skip to main content

Machine learning for predicting the risk stratification of 1–5 cm gastric gastrointestinal stromal tumors based on CT

Abstract

Backgroud

To predict the malignancy of 1–5 cm gastric gastrointestinal stromal tumors (GISTs) by machine learning (ML) on CT images using three models - Logistic Regression (LR), Decision Tree (DT) and Gradient Boosting Decision Tree (GBDT).

Methods

231 patients from Center 1 were randomly assigned into the training cohort (n = 161) and the internal validation cohort (n = 70) in a 7:3 ratio. The other 78 patients from Center 2 served as the external test cohort. Scikit-learn software was used to build three classifiers. The performance of the three models were evaluated by sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV) and area under the curve (AUC). Diagnostic differences between ML models and radiologists were compared in the external test cohort. Important features of LR and GBDT were analyzed and compared.

Results

GBDT outperformed LR and DT with the largest AUC values (0.981 and 0.815) in the training and internal validation cohorts and the greatest accuracy (0.923, 0.833 and 0.844) across all three cohorts. However, LR was found to have the largest AUC value (0.910) in the external test cohort. DT yielded the worst accuracy (0.790 and 0.727) and AUC values (0.803 and 0.700) in both the internal validation cohort and the external test cohort. GBDT and LR performed better than radiologists. Long diameter was demonstrated to be the same and most important CT feature for GBDT and LR.

Conclusions

ML classifiers, especially GBDT and LR with high accuracy and strong robustness, were considered to be promising in risk classification of 1–5 cm gastric GISTs based on CT. Long diameter was found the most important feature for risk stratification.

Peer Review reports

Backgroud

Gastrointestinal stromal tumors (GISTs) are neoplasms that arise from Cajal cells of the gastrointestinal tract mesenchyme [1]. Nonetheless, they were reported growing throughout the whole digestive tract; stomach was the site with the highest incidence (50–60%) [2]. As a potentially malignant tumor, although less than 2 cm, about 10–30% of GISTs will develop into malignancy, and the risk of recurrence and metastasis is significantly increased [3].

The National Institutes of Health (NIH) classification system has been proposed to stratify the risk of GISTs. Currently, the modified NIH risk stratification criteria and the latest Chinese consensus guidelines (2017 Edition) of the Chinese Society of Clinical Oncology (CSCO) Expert Committee on GIST divide GISTs into very low, low, intermediate, and high risk groups according to tumor size, location, mitotic index, and whether the tumor ruptures [4, 5]. Very low and low risk GISTs generally have slow growth and a low incidence of recurrence and metastasis, whereas GISTs in intermediate and high risk stages have more invasive behavior [6]. Risk classification also facilitates clinical treatment planning [7]. The 2–5 cm GISTs with very low or low risk can be completely resected with endoscopic technology; however, a surgical operation is also necessary in intermediate or high risk ones [8, 9]. Another report suggested periodic follow-up by endoscopic ultrasound (EUS) for GISTs smaller than 1 cm [10]. However, localized GISTs (larger than 1 cm) with intermediate to high risk warranted resection followed by adjuvant treatment of the lesion with imatinib [10,11,12]. Only 2.2% of gastric GISTs with diameters less than 1 cm have been reported to be considered high-risk, while 1–2 cm GISTs had a malignant risk rate of 10.1% [13]. Most grading systems indicate that GISTs larger than 5 cm have a great tendency for high risk. Therefore, it is clinically meaningful to preoperatively identify high-risk gastric GISTs of 1–5 cm in diameter.

Contrast-enhanced CT (CE-CT) scan can clearly show not only the anatomical structure of gastric mesenchymal tumors, but also the internal and peripheral information of the lesion, including tumor density, necrosis, ulceration, hemorrhage, blood vessels, as well as invasion of surrounding tissues, lymph node metastases, and distant metastases [14, 15].

Plenty of studies predicting the risk stratification of GISTs based on CT imaging have been reported [16,17,18,19,20,21,22]. Tumor size was found to be an independent risk factor, even the only one, for high-risk malignant GISTs [16, 21]. Besides, other features such as percentage of tumor necrosis, growth pattern, intratumor angiogenesis, margins, and enhancement pattern were also demonstrated to be contributive to high-grade GISTs on CT images [17, 19, 22].

Machine learning (ML) algorithm provides the possibility of mining valuable data that have significant and intricate connections among enormous data items. ML algorithms have been applied to disease identification, differential diagnosis and prognosis analysis with outstanding performance and promising prospect [23,24,25,26]. Most previous studies have used univariate or multivariate logistic regression analysis aiming to predict the malignant potential of GIST. To our knowledge, no research has attempted to classify the risk of GISTs using ML classifier. What’s more, our report focuses on tumors up to 5 cm in the gastric site, which is different from studies including large-sized GISTs in different parts of the gastrointestinal tract. In this study, 309 patients’ CT images of gastric GISTs less than 5 cm were collected to assess the malignancy risk using three models - Logistic Regression (LR), Decision Tree (DT) and Gradient Boosting Decision Tree (GBDT).

Methods

Patient selection

This retrospective study was approved by the ethics committee of Tongde Hospital of Zhejiang Province and the need for informed consent was waived (Approval No. 2021-040). Patients with gastric GISTs from two centers (Center 1: Tongde Hospital of Zhejiang Province, Center 2: Zhejiang Hospital) from January, 2012 to September, 2022 were enrolled in this research. The criteria for inclusion were as follows: (a) patients with complete CT images (including unenhanced, arterial and portal venous phase images) within 15 days before surgery; (b) solitary and primary lesions; (c) lesions without neoadjuvant treatment; (d) lesions larger than 1 cm and smaller than 5 cm in the long diameter. (e) patients with detailed clinical data (including age, gender, clinical symptoms and tumor markers). The inclusion and exclusion of patients are shown in Fig. 1. 231 patients from Center 1 were randomly assigned into the training cohort (n = 161) and the internal validation cohort (n = 70) according a 7:3 ratio. Another 78 patients from Center 2 were served as an external test cohort. Clinical characteristics including age, gender, symptoms and tumor markers were collected for each patient. All surgically resected lesions of GISTs were finally divided into a low-grade malignancy group and a high-grade malignancy group. The low-grade malignancy category consisted of GISTs with very low or low risk and the high-grade malignancy group included GISTs with intermediate or high risk. The NIH modified criteria in 2008 [4] were applied for risk stratification.

Fig. 1
figure 1

Flowchart shows inclusion and exclusion criteria for this study

CT examination

All patients underwent abdominal CE-CT examination using two 64-slice spiral CT scanners (Siemens, Forchheim, Germany or Philips Medical Systems, Cleveland, OH, USA). The parameters of CT imaging were set as follows: for Siemens, 120 kV tube voltage, 150–250 mA tube current, 0.5 s tube rotation time, 64 × 0.6 mm detector collimation, 350 × 350 mm field of view, 5 mm section thickness and 1-1.25 mm reconstruction interval; for Philips, 120 kV tube voltage, 200–250 mA tube current, 0.5 s tube rotation time, 64 × 0.625 mm detector collimation, 350 × 350 mm field of view, 5 mm section thickness and 1-1.5 mm reconstruction interval. Subsequently, arterial phase (delay 30–40 s) and the portal venous phase (delay 60–70 s) images were obtained after 2 mL/kg of iodinated contrast medium was injected intravenously at a rate of 3 ml/s.

Image analysis

Two radiologists (Reviewer 1 with 6 years and Reviewer 2 with 13 years of experience in abdominal imaging) independently reviewed CT images and reached final conclusions by consensus without knowledge of the surgical and pathological information of every patient. The determined CT imaging features included (a) the CT attenuation values (Hounsfield units, HU) in unenhancement phase (CTU), (b) in arterial phase (CTA) and (c) in venous phase (CTV) of the tumor, (d) degree of enhancement in arterial phase (DEAP) and (e) in portal venous phase (DEPP), (f) enhanced potentiality in arterial phase (EPa) and (g) in portal venous phase (EPv), (h) long diameter (LD), (i) short diameter (SD), (j) the ratio of long diameter to short diameter (LD/SD), (k) contour (round; oval; irregular), (l) necrosis (yes or no), (m) calcification (yes or no), (n) surface ulceration (yes or no), (o) intratumoral angiogenesis (yes or no) and (p) peripheral enlarged lymph node (LN) (yes or no). The CT attenuation value was measured by drawing the region of interest (ROI) on the same axial slice of the tumor avoiding vessels, calcification, and the necrotic regions. DEAP or DEPP was obtained by subtracting CTU from CTA or CTV respectively. EPa or EPv was equal to DEAP or DEPP divided by CTU. Enlarged lymph node was considered present if the shortest axis diameter of lymph node was more than 10 mm. Some of the CT features referred to our previous report [27]. LD, SD, contour, necrosis, surface ulceration and intratumoral angiogenesis of tumors were the main aspects considered by radiologists when classifying tumors into low- or high-risk GISTs.

Machine learning

Scikit-learn software was used to build three classifiers-DT, GBDT and LR. The detailed methodology is described on the website of official documentation (https://scikit-learn.org/), which has also been applied in our previous research [27]. The three datasets (training, internal validation and external test cohort) had no any intersection in our study. The training dataset aims to train the models, the internal validation cohort aims to adjust parameters and the external test cohort aims to evaluate the model performance. For each model, sensitivity, specificity, accuracy, positive predictive value (PPV), negative predictive value (NPV) and area under the curve (AUC) together with 95% confidence intervals (95% CI) were respectively calculated to assess the performance of each classifier. We observed a significant imbalance between the number of low-risk versus high-risk tumors in the three cohorts. The function of Class_Weight in scikit-learn soft was performed to solve the problem of unbalanced samples. In classification task, small sample categories (high-risk malignant) were assigned high weight and large sample categories (low-risk malignant) assigned low weight.

Grid search strategy for selecting optimal parameters

In order to find the optimal parameters of three models, the grid search strategy in scikit-learn software was used. In the grid search process, 5-fold Cross-Validation (CV) was used to evaluate model performance. Meanwhile, the accuracy was used as an evaluation metric to maximize model performance. The detail of grid search method is described in the model selection module on the website of official documentation (https://scikit-learn.org/stable/model_selection.html#model-selection).

Logistic regression (LR)

LR is the most conventional approach to measure the relationship between discrete response variables and several covariates by estimating probabilities. It can be written as: p = 1/(1 + e− z). z refers to logistic regression model. The response variable can take two values (0 as no and 1 as yes) according to p smaller than 0.5 or not.

The final optimal parameters of LR were set as follows: C = 100, random_state = 12, penalty = ’l1’, solver = ’liblinear’. Other parameter factors were set as default in sklearn software module.

Decision tree (DT)

DT, as a binary method, can be used to classify data by calculating their characteristics. Decision nodes, branches and leaves are the three main components of DT. DT starts with a node and extends to many branches and child nodes, finally to leaves. The criterion used in our model is the Gini’s Diversity Index, which is a measure of node impurity. The standard CART algorithm, implemented using sciki-learn library in Python, was applied to build decision tree.

The parameters set in the DT were: random_state = 0, max_features = 6, max_depth = 6, criterion = ’gini’. Other parameters were set as default in sklearn software module.

Gradient boosting decision tree (GBDT)

GBDT is an ensemble classifier based on bootstrap sampling, which aims to improve the generalization ability and robustness by combining the prediction results of multiple base learners (i.e. weak decision trees). The weight is adjusted with iteration, so that the higher weight will be assigned to the data poorly classified. Totally 15 weak decision trees were created in GBDT model in this study (e.g. a tree is showed in Fig S1).

The following parameter factors were used for GBDT: learning_rate = 0.1, max_depth = 8, random_state = 0, min_samples_leaf = 2. Other parameters were also set as default in the sklearn software.

Performance comparison between radiologists and models

Diagnostic performance differences between the three ML models and the two radiologists were compared in the external test cohort. Before performance comparisons, intra-class correlation coefficients (ICCs) were calculated to assess agreement between the two reviewers.

Feature variable analysis

GBDT and LR showed excellent diagnostic efficiency in predicting risk classification of gastric GISTs on account of the high accuracy and strong robustness. LR is well known for determining the beneficial features to support decision by linear analysis, since the result is easy to explain. Firstly, significant CT features were determined by univariate analysis. Secondly, variable with P less than 0.05 were as the input data to calculate the independent risk factors for high-risk malignant GISTs. In order to find out important features for high-grade malignant GISTs in GBDT, the function of Feature_Importance was performed. The description of feature importance is on the website: https://scikit-learn.org/stable/ modules/ensemble.html#gradient-tree-boosting. According to the official documentation description, individual decision trees in the GBDT model intrinsically performed feature selection by selecting appropriate split points. This information can be used to measure the importance of each feature. The basic idea is: the more often a feature is used in the split points of a tree, the more important that feature is. Subsequently, the feature variables of LR and GBDT were compared.

Statistical analysis

P-P plots and Q-Q plots were used to assess normal distribution of data. Continuous distributed data were showed as mean ± SD, and categorical variables were expressed as n (%). Univariate analysis using t test or Mann-Whitney U test for continuous variables and Fisher’s exact test for categorical variables were performed to compare CT features between the low-grade malignancy and high-grade malignancy groups. Variables with P < 0.05 were considered as significant features and were included in the LR multivariate analysis. The final features with P < 0.05 from multivariate logistic regression model indicated the significant predictors of high risk GISTs. Statistical analyses were performed using SPSS version 22.0 (SPSS Inc., Chicago, IL, USA). A statistically significant difference was defined as two - sided P value < 0.05.

Results

Clinical characteristics of patients

231 patients (109 men and 122 women; mean age, 59.47 ± 10.13 years) from Center 1 and 78 patients (41 men and 37 women; mean age, 62.69 ± 10.78 years) from Center 2 were included in our series. The training cohort enrolled 161 patients with gastric GISTs consisting of 47 high-risk tumors and 114 low-risk ones. 70 patients with GISTs (21 high-risk tumors and 49 low-risk ones) and 78 patients with GISTs (16 high-risk tumors and 62 low-risk ones) constituted the internal validation cohort and the external test cohort, respectively.

Details of the clinical characteristics of three cohorts are shown in Table S1. Results of the univariate analysis indicated that patients in three databases had no significant difference in variables of age, sex, clinical symptom and tumor marker between the low-grade malignancy and the high-grade malignancy groups (all P > 0.05). The clinical characteristics of patients had no contribution to the prediction of malignancy in gastric GISTs.

Univariate analysis of CT data

Results of univariate analysis of CT imaging features is exhibited in Table 1. LD, SD, contour, presence of necrosis and surface ulceration were showed to be significant features in distinguishing two groups in three cohorts. Size in high-grade malignancy GISTs was found larger than that of low-grade ones. Lesions with oval and irregular contours were seen more commonly in the high-grade malignancy group. Necrosis and surface ulceration were more likely to be found in the high-grade group. Intratumoral angiogenesis was significantly different between the two groups in the training and external test cohorts, but not in the internal validation cohort, as shown in Fig. 2. No difference of the remaining CT imaging variables was found in all three cohorts.

Table 1 Univariate analysis of CT features of GISTs in the training cohort, internal validation cohort and external test cohort
Fig. 2
figure 2

CT image of low-grade and high-grade malignant GISTs. (A-B). A low-grade malignant GIST in cardia in an elderly man. (A-B). Axial and sagittal CT scans in portal venous phase show a round mass with 2.5 cm of the long diameter (white arrows). The lesion has a homogeneous enhancement pattern without necrosis, calcification, and intratumoral angiogenesis in tumor. (C-D). A high-grade malignant GIST in gastric body in a middle-aged woman. Axial and coronal CT images in portal venous phase show an irregular neoplasm with 4.7 cm of the long diameter (white arrows). Intratumoral angiogenesis (black arrows) are seen in the lesion and the mass shows heterogeneous enhancement with necrotic portion (*) within the tumor

Model evaluation

Results of diagnostic performance of LR, DT and GBDT are described in Table 2; Fig. 3. GBDT gained the largest sensitivity (0.986), specificity (0.770), accuracy (0.923), PPV (0.639), NPV (0.993) and AUC (0.981) among the three models in the training cohort. Due to the compromise between sensitivity and specificity, accuracy and AUC are considered as better diagnostic indicators. GBDT achieved the largest AUC (0.981 and 0.815) among all three classifiers in the training and internal validation cohorts. Nevertheless, we found that LR had the largest AUC (0.910), followed by GBDT (0.819) and DT (0.700) in the external test cohort. The lowest accuracy (0.790 and 0.727) and AUC values (0.803 and 0.700) were gained in DT model both in the internal validation cohort and the external test cohort. GBDT and LR performed best among the three models and in three cohorts with high accuracy and strong robustness. Confusion matrixes of three models in the training cohort are showed in Fig. 4. Figure S2 and S3 exhibit confusion matrixes of three models in the internal validation cohort and external test cohort, respectively. We also tried to build the stepwise logistic regression models using all CT features or significant features by univariate analysis in the training cohort. The accuracy and AUC together with 95%CI of stepwise logistic regression model using all CT features were 0.789 and 0.770 (0.668–0.860), slightly less than those of LR model built with independent risk factor. In addition, stepwise logistic regression model leaving six significant CT features by univariate analysis yielded 0.727 of accuracy and 0.732 (0.648–0.846) of AUC together with 95%CI, less than those of LR model built in this study as well. Results are showed in Figure S4.

Table 2 Diagnostic performance analysis of LR, DT and GBDT models
Fig. 3
figure 3

Receiver operating characteristic (ROC) curves of three models to predict the risk stratification of gastric GISTs. (A). ROC curve of three models in the training cohort. The largest area under the curve (AUC) was GBDT (0.981), followed by DT (0.883) and LR (0.815). (B). ROC curve of three models in the internal validation cohort. GBDT and LR had the equal AUC (0.815), and DT gained the smallest AUC (0.803). (C). ROC curve of three models in the external test cohort. LR gained the best AUC (0.910), followed by GBDT (0.819) and DT (0.700)

Fig. 4
figure 4

Confusion matrixes of LR (A), DT (B) and GBDT (C) models in the training cohort

Performance comparison between radiologists and models

ICC of 0.83 indicated that the agreement between two radiologists was good. Table 3 displays the two reviewers’ diagnostic performance in the external test cohort. GBDT and LR showed more favorable performances than two radiologists.

Table 3 Results of radiologists’ diagnostic performance in the external test cohort

Feature variable analysis

LD, SD, contour, necrosis, surface ulceration and intratumoral angiogenesis were selected as significant features by univariate analysis to input into multivariate analysis. Table 4 shows that only LD is an independent risk factor for high-grade malignant GISTs (P < 0.001, OR = 1.066). Results of important characteristics rank in GBDT are reported in Fig. 5. LD ranked the most important feature among all CT features with importance score of 0.202, followed by SD (0.175), DEPP (0.115), CTU (0.088) and DEAP (0.064). The remaining features had low importance scores. LD was demonstrated as the only same and most important feature for LR and GBDT in terms of feature variable analysis.

Table 4 Results of feature variable analysis in LR model
Fig. 5
figure 5

Features importance rank in GBDT model. Five top important features were as follows: LD (importance score 0.202), SD (0.175), DEPP (0.115), CTU (0.088) and DEAP (0.064). The remaining features had low importance scores

Discussion

To our best knowledge, this is the first research on the prediction of malignancy in gastric GISTs by machine learning classifiers. In addition, our report focuses on GISTs tumors of 1–5 cm in the gastric, which is different from studies that include large-size GISTs located in various sites of the gastrointestinal tract. This study has the largest sample size among relevant studies, so the reliability of the results can be guaranteed. Various qualitative and quantitative variables extracted from CT signs were inputted into LR, DT and GBDT models. The results of model evaluation were different but not inconsistent among the three cohorts. For the training cohort, GBDT had the greatest sensitivity, specificity, accuracy, PPV, NPV and AUC among the three models. What’s more, GBDT gained the largest AUC in the training and internal validation sets and performed best in all three cohorts in terms of accuracy, although the AUC was not as good as LR in the external test cohort. Furthermore, GBDT and LR showed better performance than the two radiologists. However, the performance of DT was not as outstanding as GBDT and LR. Therefore, GBDT and LR were suggested to be promising ML models for CT-based risk classification prediction of gastric GISTs due to the high accuracy and strong robustness.

GBDT, an ensemble method based on bootstrap sampling, was demonstrated to be a favorable algorithm with high predictive efficiency, as reported in various previous researches [28,29,30,31]. The excellent performance of GBDT classifier is attributed to its ability to optimize the algorithm by increasing the weight of weak decision trees. As a classic ML algorithm for solving two-class classification problem, LR assumes that the data conform to the Bernoulli distribution, and then calculates the parameters through the maximum likelihood function method to achieve two classification. Additionally, probability prediction obtained from LR model can be preferentially utilized to better assist in decision making. However, LR has some limitations. First, logistic regression analysis, which is difficult to fit the true distribution of data, has only linear decision boundary. Second, it may weaken the performance of classification task due to the limitation of overfitting and multicollinearity. In terms of DT, a tree consisting of decision nodes, branches, and leaves, is generated using the training dataset, and the test dataset is classified or predicted. In this study, the ability of DT in predicting the risk grading of gastric GISTs may be weaker than LR and GBDT.

When it comes to feature variable selection, LD was found to be the only common CT feature between LR and GBDT that distinguished for high-grade malignant GISTs in this study. Several studies using multivariate logistic regression analysis revealed that the size of GISTs was the only independent risk factor for differentiation of the high-grade malignant GISTs [16, 21], so did the gastric GISTs with size shorter than 5 cm in our study. Kim et al. [21] reported that for GISTs ≤ 5 cm, it was not possible to identify malignant from benign by tumor size based on CT scan images, which contradicted with our results. It may be related to different grouping definitions and different tumor composition ratios. Mazzei et al. [32] found that the maximum diameter of GISTs with high mitotic index (> 5 mitoses) was larger than that of GISTs with low mitotic index (≤ 5 mitoses), suggesting that the larger the tumor was, the faster it grew and the the higher degree of malignancy it had. However, oval and irregular contours, the presence of necrosis, surface ulceration and intratumoral angiogenesis appeared more frequently in the high-grade malignancy group, similar to other studies [17, 19, 21, 22], however, these features were excluded from the selection of predictors in LR. GBDT algorithm could determine complicated and impalpable feature relationships to support decision-making that may not be detected in logistic regression analysis [33]. In this study, DEPP, CTU and DEAP were selected as important features despite not being significant different factors in univariate analysis. The important features that GBDT concentrated on appear to be unimportant features in traditional statistical method, from which, high predictive performance can be obtained.

Artificially determined CT imaging features were used as the input variables for the three ML models to predict the risk classification of gastric GISTs in this study, and great results were obtained, especially in GBDT and LR. Compared with the diagnostic ability of radiologists, ML achieved more promising results, which may have a guiding prospect for doctors in daily diagnostic work, especially for the junior ones. It may promisingly provide theoretical and practical support for texture analysis or deep learning since ML may play an important role in feature selection.

There are some limitations in our study. First, our sample size was small for ML. ML classifiers can highlight their advantages in the context of large data, amounts of predictor variables or complex relationship. Second, four risk grades were finally divided into two, so the results were unable to meet the requirement of each risk classification. Simple ML model cannot meet the needs of predicting four risk levels, but the convolutional neural network can, which puts the next step of research on the agenda. Third, only three simple ML models were implemented in our research, including the classic LR. We will try other more complex ML models to assess the risk stratification, such as random forest, support vector machine, k-nearest neighbors, etc. Fourth, radiomics, which transforms medical images into high-dimensional data by extracting tumor’s shape, intensity, and texture features, has recently shown great potential in aiding clinical decision making. Developing CT-based radiomics models for GIST risk stratifcation will be a future work.

Conclusions

In summary, GBDT and LR showed outstanding performance with high accuracy and strong robustness in the risk assessment of gastric GISTs less than 5 cm on CT imaging. The long diameter of lesion was found to be the most important feature for risk stratification.

Data Availability

All datasets presented and analyzed in this study were interpreted and provided by the corresponding author.

Abbreviations

AUC:

Area under the curve

CE-CT:

Contrast-enhanced CT

CSCO:

Chinese Society of Clinical Oncology

CTU/CTA/CTV:

CT attenuation value in unenhancement phase/arterial phase /venous phase

CV:

Cross-Validation

DEAP/ DEPP:

Degree of enhancement in arterial phase/venous phase

DT:

Decision tree

EPa/EPv:

Enhanced potentiality in arterial phase/venous phase

GBDT:

Gradient boosting decision tree

GISTs:

Gastric gastrointestinal stromal tumors

HU:

Hounsfield unit

ICC:

Intra-class correlation coefficient

LD:

Long diameter

LN:

Peripheral enlarged lymph nodes

LR:

Logistic regression

ML:

Machine learning

NCCN:

National Comprehensive Cancer Network

NPV:

Negative predictive value

PPV:

Positive predictive value

SD:

Short diameter

References

  1. Blay JY, Kang YK, Nishida T, et al. Gastrointestinal stromal tumours. Nat Rev Dis Primers. 2021; 7(1): 22.

  2. Aghdassi A, Christoph A, Dombrowski F, et al. Gastrointestinal stromal tumors: clinical symptoms, location, metastasis formation, and associated malignancies in a single center retrospective study. Dig Dis. 2018;36(5):337–45.

    Article  PubMed  Google Scholar 

  3. Søreide K. Cancer biology of small gastrointestinal stromal tumors (< 2 cm): what is the risk of malignancy? Eur J Surg Oncol. 2017;43(7):1344–9.

    Article  PubMed  Google Scholar 

  4. Joensuu H. Risk stratification of patients diagnosed with gastrointestinal stromal tumor. Hum Pathol. 2008;39:1411–9.

    Article  PubMed  Google Scholar 

  5. Li J, Ye Y, Wang J, Zhang B, et al. Chinese society of clinical oncology csco expert committee on gastrointestinal stromal tumor. Chinese consensus guidelines for diagnosis and management of gastrointestinal stromal tumor. Chin J Cancer Res. 2017;29(4):281–93.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Nishida T, Blay JY, Hirota S, Nishida T, Blay JY, Hirota S, Kitagawa Y, Kang YK, et al. The standard diagnosis, treatment, and follow-up of gastrointestinal stromal tumors based on guidelines. Gastric Cancer. 2016;19(1):3–14.

    Article  CAS  PubMed  Google Scholar 

  7. Farag S, Smith MJ, Fotiadis N, et al. Revolutions in treatment options in gastrointestinal stromal tumours (GISTs): the latest updates. Curr Treat Options Oncol. 2020;21(7):55.

    Article  PubMed  PubMed Central  Google Scholar 

  8. Gao RP, Zhang YP, Li QM. How to Approach submucosal lesions in the gastrointestinal tract: different ideas between China and USA. Gastroenterol Res Pract. 2022;2022:8635387.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Joo MK, Park JJ, Lee YH, et al. Clinical efficacy and safety of endoscopic treatment of gastrointestinal stromal tumors in the stomach. Gut Liver. 2023;17:217–25.

    Article  PubMed Central  Google Scholar 

  10. Caturegli I, Raut CP. Gastrointestinal stromal tumors and the General Surgeon. Surg Clin North Am. 2022;102(4):625–36.

    Article  PubMed  Google Scholar 

  11. Casali PG, Blay JY, Abecassis N, ESMO Guidelines Committee, et al. Gastrointestinal stromal tumours: ESMO-EURACAN-GENTURIS clinical practice guidelines for diagnosis, treatment and follow-up. Ann Oncol. 2022;33(1):20–33.

    Article  CAS  PubMed  Google Scholar 

  12. Blay JY, Hindi N, Bollard J, et al. SELNET clinical practice guidelines for soft tissue sarcoma and GIST. Cancer Treat Rev. 2022;102:102312.

    Article  CAS  PubMed  Google Scholar 

  13. Jia X, Wan L, Chen X, et al. Risk stratification for 1- to 2-cm gastric gastrointestinal stromal tumors: visual assessment of CT and EUS high-risk features versus CT radiomics analysis. Eur Radiol. 2022. https://doi.org/10.1007/s00330-022-09228-x. Online ahead of print.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Liu M, Liu L, Jin E. Gastric sub-epithelial tumors: identification of gastrointestinal stromal tumors using CT with a practical scoring method. Gastric Cancer. 2019;22(4):769–77.

    Article  PubMed  Google Scholar 

  15. Vernuccio F, Taibbi A, Picone D, et al. Imaging of gastrointestinal stromal tumors: from diagnosis to evaluation of therapeutic response. Anticancer Res. 2016;36(6):2639–48.

    CAS  PubMed  Google Scholar 

  16. Yang TH, Hwang JI, Yang MS, et al. Gastrointestinal stromal tumors: computed tomographic features and prediction of malignant risk from computed tomographic imaging. J Chin Med Assoc. 2007;70(9):367–73.

    Article  PubMed  Google Scholar 

  17. Yang D, Ren H, Yang Y, et al. Risk stratification of 2- to 5-cm gastric stromal tumors based on clinical and computed tomography manifestations. Eur J Radiol. 2022;157:110590.

    Article  PubMed  Google Scholar 

  18. Chen Z, Yang J, Sun J, et al. Gastric gastrointestinal stromal tumours (2–5 cm): correlation of CT features with malignancy and differential diagnosis. Eur J Radiol. 2020;123:108783.

    Article  PubMed  Google Scholar 

  19. Iannicelli E, Carbonetti F, Federici GF, et al. Evaluation of the relationships between computed tomography features, pathological findings, and prognostic risk assessment in gastrointestinal stromal tumors. J Comput Assist Tomogr. 2017;41(2):271–8.

    Article  PubMed  Google Scholar 

  20. Zhang X, Bai L, Wang D, et al. Gastrointestinal stromal tumor risk classification: spectral CT quantitative parameters. Abdom Radiol (NY). 2019;44(7):2329–36.

    Article  PubMed  Google Scholar 

  21. Kim HC, Lee JM, Kim KW, et al. Gastrointestinal stromal tumors of the stomach: CT findings and prediction of malignancy. AJR Am J Roentgenol. 2004;183(4):893–8.

    Article  PubMed  Google Scholar 

  22. Xu J, Zhou J, Wang X, et al. A multi-class scoring system based on CT features for preoperative prediction in gastric gastrointestinal stromal tumors. Am J Cancer Res. 2020;10(11):3867–81.

    PubMed  PubMed Central  Google Scholar 

  23. Ouhmich F, Agnus V, Noblet V, et al. Liver tissue segmentation in multiphase CT scans using cascaded convolutional neural networks. Int J Comput Assist Radiol Surg. 2019;14(8):1275–84.

    Article  PubMed  Google Scholar 

  24. Kuo KM, Talley PC, Huang CH, et al. Predicting hospital-acquired pneumonia among schizophrenic patients: a machine learning approach. BMC Med Inform Decis Mak. 2019;19(1):42.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Kamogashira T, Fujimoto C, Kinoshita M, et al. Prediction of vestibular dysfunction by applying machine learning algorithms to postural instability. Front Neurol. 2020;11:7.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Diller GP, Kempny A, Babu-Narayan SV, et al. Machine learning algorithms estimating prognosis and guiding therapy in adult congenital heart disease: data from a single tertiary centre including 10019 patients. Eur Heart J. 2019;40(13):1069–77.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Wang J, Xie Z, Zhu X, et al. Differentiation of gastric schwannomas from gastrointestinal stromal tumors by CT using machine learning. Abdom Radiol (NY). 2021;46(5):1773–82.

    Article  PubMed  Google Scholar 

  28. Chiew CJ, Liu N, Tagami T, et al. Heart rate variability based machine learning models for risk prediction of suspected sepsis patients in the emergency department. Med (Baltim). 2019;98(6):e14197.

    Article  Google Scholar 

  29. Li W, Liu W, Hussain Memon F, et al. An external-validated prediction model to predict lung metastasis among osteosarcoma: a multicenter analysis based on machine learning. Comput Intell Neurosci. 2022; 2022:2220527.

  30. Mollalo A, Vahedi B, Bhattarai S, et al. Predicting the hotspots of age-adjusted mortality rates of lower respiratory infection across the continental United States: integration of GIS, spatial statistics and machine learning algorithms. Int J Med Inform. 2020;142:104248.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Wang Y, Lei L, Ji M, et al. Predicting postoperative delirium after microvascular decompression surgery with machine learning. J Clin Anesth. 2020;66:109896.

    Article  PubMed  Google Scholar 

  32. Mazzei MA, Cioffi Squitieri N, Vindigni C, et al. Gastrointestinal stromal tumors (GIST): a proposal of a “CT-based predictive model of Miettinen index” in predicting the risk of malignancy. Abdom Radiol (NY). 2020;45(10):2989–96.

    Article  CAS  PubMed  Google Scholar 

  33. Saeys Y, Inza I, Larrañaga P. A review of feature selection techniques in bioinformatics. Bioinformatics. 2007;23(19):2507–17.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

Not applicable.

Funding

This research was supported by Zhejiang Provincial Natural Science Foundation of China under Grant No.LGF21H030004.

Author information

Authors and Affiliations

Authors

Contributions

CZ, JW, HJY designed the research. JW, HJY provided the administrative support for this research. CZ, YY, BLD, ZHX, FMZ were responsible for data collection. CZ, YY, BLD, ZHX, FMZ were responsible for data analysis and interpretation. CZ wrote the first draft. All authors reviewed the analyses and drafts of this manuscript and approved its final version.

Corresponding author

Correspondence to Huajun Yu.

Ethics declarations

Ethics approval and consent to participate

This retrospective study was approved by ethics committee of Tongde Hospital of Zhejiang Province (Approval No: 2021-040). Written informed consent was waived by the ethics committee of Tongde Hospital of Zhejiang Province. All methods were carried out in accordance with relevant guidelines and regulations.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, C., Wang, J., Yang, Y. et al. Machine learning for predicting the risk stratification of 1–5 cm gastric gastrointestinal stromal tumors based on CT. BMC Med Imaging 23, 90 (2023). https://doi.org/10.1186/s12880-023-01053-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12880-023-01053-y

Keywords