A deep learning-based radiomics model for predicting lymph node status from lung adenocarcinoma

Xie, Hui; Song, Chaoling; Jian, Lei; Guo, Yeang; Li, Mei; Luo, Jiang; Li, Qing; Tan, Tao

doi:10.1186/s12880-024-01300-w

Research
Open access
Published: 24 May 2024

A deep learning-based radiomics model for predicting lymph node status from lung adenocarcinoma

Hui Xie^1,2,
Chaoling Song³,
Lei Jian³,
Yeang Guo³,
Mei Li³,
Jiang Luo³,
Qing Li¹ &
…
Tao Tan^2,4

BMC Medical Imaging volume 24, Article number: 121 (2024) Cite this article

449 Accesses
1 Altmetric
Metrics details

Abstract

Objectives

At present, there are many limitations in the evaluation of lymph node metastasis of lung adenocarcinoma. Currently, there is a demand for a safe and accurate method to predict lymph node metastasis of lung cancer. In this study, radiomics was used to accurately predict the lymph node status of lung adenocarcinoma patients based on contrast-enhanced CT.

Methods

A total of 503 cases that fulfilled the analysis requirements were gathered from two distinct hospitals. Among these, 287 patients exhibited lymph node metastasis (LNM +) while 216 patients were confirmed to be without lymph node metastasis (LNM-). Using both traditional and deep learning methods, 22,318 features were extracted from the segmented images of each patient's enhanced CT. Then, the spearman test and the least absolute shrinkage and selection operator were used to effectively reduce the dimension of the feature data, enabling us to focus on the most pertinent features and enhance the overall analysis. Finally, the classification model of lung adenocarcinoma lymph node metastasis was constructed by machine learning algorithm. The Accuracy, AUC, Specificity, Precision, Recall and F1 were used to evaluate the efficiency of the model.

Results

By incorporating a comprehensively selected set of features, the extreme gradient boosting method (XGBoost) effectively distinguished the status of lymph nodes in patients with lung adenocarcinoma. The Accuracy, AUC, Specificity, Precision, Recall and F1 of the prediction model performance on the external test set were 0.765, 0.845, 0.705, 0.784, 0.811 and 0.797, respectively. Moreover, the decision curve analysis, calibration curve and confusion matrix of the model on the external test set all indicated the stability and accuracy of the model.

Conclusions

Leveraging enhanced CT images, our study introduces a noninvasive classification prediction model based on the extreme gradient boosting method. This approach exhibits remarkable precision in identifying the lymph node status of lung adenocarcinoma patients, offering a safe and accurate alternative to invasive procedures. By providing clinicians with a reliable tool for diagnosing and assessing disease progression, our method holds the potential to significantly improve patient outcomes and enhance the overall quality of clinical practice.

Peer Review reports

Introduction

Lung cancer is one of the leading causes of cancer-related death worldwide [1]. According to the American Cancer Society (https://www.cancer.org), 350 people die of lung cancer every day in the United States in 2022 [2]. Furthermore, lung cancer is also the malignant neoplasm with the highest incidence and mortality in China [3]. Lung adenocarcinoma comprises about 40% of all lung cancer cases [2]. Thanks to advanced computed tomography techniques of the chest and low-dose screening with computed tomography (CT), many lung cancer patients are detected at an early stage. The early diagnosis of lung cancer is of great significance to improve the treatment level and prognosis of patients. The National comprehensive cancer network (NCCN) guidelines state that surgery should be the first option for patients with early lung cancer [4]. However, lung tumors often involve mediastinal lymph nodes. Lymph node metastasis rates have been reported in 15% to 20% of patients with early-stage non-small cell lung cancer(NSCLC) whose lung tumors are 2 cm or less in diameter [5]. Lymphatic metastasis is the most common metastatic pathway in lung cancer. Clinically, the presence of lymph node metastasis (LNM) is a crucial factor in determining the TNM staging of lung cancer patients. The TNM staging system, which stands for Tumor, Node, and Metastasis, is a widely used method for classifying the severity and extent of lung cancer. Within this system, the status of the lymph nodes plays a pivotal role in assessing the overall clinical stage of the disease, thereby guiding treatment decisions and predicting patient outcomes [6]. Preoperative assessment of the presence of LNM in lung cancer patients can provide valuable information for determining the need for adjuvant therapy and surgery, thus helping clinicians make the right decision.

At present, there are many methods to evaluate LNM in lung cancer patients, including CT, Positron Emission Tomography-CT(PET-CT), ultrasound-guided biopsy, thoracoscopy, etc. [7]. Although biopsy and thoracoscopy can better evaluate lymph node staging, both are invasive. The radiologist determines the LNM status by the size of the lymph nodes on the CT, which obviously has great limitations [8]. PET-CT is a relatively accurate imaging technique with a high specificity for LNM in preoperative lung cancer patients. However, the misdiagnosis and false negative rates of LNM diagnosed by PET-CT are high [9]. In addition, PET-CT scans are often too expensive for most patients, particularly in less economically developed countries, which further restricts their widespread clinical application.

As a well-established field, radiomics can play an important role in the preoperative evaluation of LNM in lung cancer patients [10,11,12]. Radiomics extracts quantifiable features from medical images, such as intensity, texture, and shape descriptors. These features capture the heterogeneity within the tumor, which is often associated with its biological behavior and response to treatment. By analyzing these features, radiomics can assist in diagnosing the presence of malignancy, evaluating the prognosis of patients, and predicting the likely outcome of different treatment options. This non-invasive approach offers clinicians a powerful tool for personalized medicine, enabling more precise and targeted care for lung cancer patients. Yang et al. [10] collected 159 lung adenocarcinoma patients for radiomics analysis and established a prediction model of LNM with an AUC of 0.86. Zheng et al. [11] collected radiological characteristics and clinical parameters of 217 patients with stage I-IIIB NSCLC to predict LNM status, and the test set AUC was 0.71. Huang et al. [12] recruited 155 patients with NSCLC and established a PET-CT radiomics model to predict LNM, with an AUC of 0.847. But we found this kind of studies are based on small data set, and the number of positive cases of low, large sample distribution bias. There are also a large number of studies that lack external test sets when constructing models.

This study aims to achieve seamless integration of radiomics with deep learning(DL), leveraging the efficient feature extraction capabilities of DL to facilitate the development of accurate predictive models, in order to achieve new breakthroughs in the medical field. At the same time, we aim to explore the potentially optimal model through attempting various machine learning modeling methods, and to construct a lung adenocarcinoma lymph node metastasis prediction model by mining radiomics features from CT images. This model has been validated using external data and proven to have clinical auxiliary diagnostic value.

Patients and methods

This study has obtained formal approval from the Institutional Review Board (IRB) of two participating institutions. These institutions are the Medical Research Ethics Committee of Xiangnan University Affiliated Hospital (Clinical College) (Approval Number: AF/SC-07–4/05.0) and the Medical Research Ethics Committee of the People's Hospital of HeBi (Approval Number: 22–350-18018). Both institutions have waived the need for informed consent for this study. This study was conducted following ethical guidelines of World Medical Association (WMA) Declaration of Helsinki. The technology roadmap for the whole study is shown in Fig. 1. The details are as follows.

Data collection

A total of 621 patients with lung adenocarcinoma confirmed by pathological results who underwent contrast-enhanced CT (CECT) thin-layer scanning in the Affiliated Hospital (Clinical College) of Xiangnan University (N = 486) and the People's Hospital of HeBi (N = 135) from January, 2017 to May, 2022 were retrospectively collected. From this, 503 patients were selected. The flow chart of the screening datasets is presented in Fig. 2. All enrolled patients were divided into two groups: 287 patients with lymph node metastasis (LNM +) and 216 patients without lymph node metastasis (LNM-). The Affiliated Hospital of Xiangnan University cohort was selected as the training set and the People's Hospital of HeBi cohort was selected as the test set. Inclusion criteria were as follows: (1). Patients had pathologically confirmed lung adenocarcinoma. (2). Dissection of lymph nodes and pathology showed lymph node status. (3). All patients in our study underwent routine thin-slice CECT of the lungs, with a 5mm slice thickness acquisition that was later reconstructed to 1–1.5mm slices, within a two-week window before their surgical procedure. (4). NULL of patients had received chemotherapy, radiotherapy and other related anti-tumor therapies before surgery. (5). CECT images were complete, and 3D-slicer software could be used to delineate lung cancer lesions smoothly and accurately. The exclusion criteria are as follows: (1). History of chemoradiotherapy is unknown. (2). The patient had no thin-slice CECT images. (3). CT images of patients are missing or incomplete, and the quality of CT images is inadequate. (4). The patient developed distant metastases such as ribs. In this study, all interpretations of CT images were conducted by experienced radiologists specializing in diagnostic radiology, who possess at least 15 years of professional experience.

CT scan protocols

Thin-slice CECT scan was performed on all patients. The screening scanner are the second generation Siemens SOMATOM HD FlashCT scanner (The Affiliated Hospital of Xiangnan University, Somatom Definition, Siemens Medical Solutions, Forchheim, Germany) and the Lighspeed-16 row scanner of GE Company (The People's Hospital of HeBi, GE Healthcare, Waukesha, WI, USA). All patients were examined in the supine position, and the scan range was from the thorax entrance to the posterior costal angle. The single breath was continuously screened after end-inspiratory. Scan parameters and settings were as follows: pitch (1.0mm), tube voltage (120 KVp), and tube current (80–300 mA). All CT image display settings were as follows: Lung window (level of − 600HU and width of 1200HU), mediastinal window (level of 40 HU and width of 350 HU). For the CECT scans, 1.5–2 ml/kg of non-ionic contrast media iohexol (100 ml, Jiangsu Hengrui Medicine Co., Ltd., Nanjing, China) was injected through the elbow vein at a rate of 3.0–3.5ml/s. The scan phase imaging was venous (delay, 90 s). All CT images were exported in Digital Imaging and Communications in Medicine (DICOM) format for radiomics feature extraction.

Data acquisition

Segmentation of the lung tumor region of interest (ROI): Lung tumors were manually delineated on vein-phase CECT by two radiologists with at least 15 years of experience in a blinded manner using the "segmenteditor" in 3D-Slicer (version 4.6, https://www.slicer.org) [13]. Then, a radiologist with the title of the associate chief physician or above reviewed and revised the tumor contours on CECT images one by one as the final gold standard for delineation. The purpose of this was to reduce the variability of manual delineation. If a controversial delineation was encountered, the three people discussed to decide how to delineate the tumor contour. After the manual segmentation, all images were resampled voxel size to 3 × 3 × 3 mm³ for the next step.

The "Pyradiomics" package [14] was used to extract the traditional radiomics features of the lung 3D ROI in python environment. The “Pyradiomics” package incorporates various filters, including exponential, LBP, logarithm, square, and wavelet, to extract diverse quantitative features from medical images. The exponential filter emphasizes specific image regions, LBP captures local texture patterns, logarithm stabilizes data variance, square normalizes data, and wavelet analyzes information across multiple scales. These filters jointly contribute to accurate and efficient feature extraction, enhancing clinical decision-making and research outcomes [14]. The extracted the traditional radiomics features comprehensively characterize the lung ROI. The Gray-Level Co-occurrence Matrix (GLCM) captures spatial relationships between pixel values, effectively quantifying various texture properties. First-Order Features detail the distribution of pixel values, including mean, median, and standard deviation (SD), providing a comprehensive overview of intensity characteristics. The Gray-Level Run-Length Matrix (GLRLM) quantifies the length of consecutive pixels, revealing lung texture uniformity and complexity. The Gray-Level Size Zone Matrix (GLSZM) analyzes the size of connected regions, offering insights into the spatial distribution of lung tissues. The Gray-Level Dependence Matrix (GLDM) explores the dependency between gray levels and their spatial arrangement, capturing intricate patterns and structures. Finally, Shape Features describe the geometric properties of the lung ROI, such as compactness and elongation, providing a holistic representation of its morphological features.

The maximum cross-sectional slices were obtained from the 3D ROI of the lung, and then the averge pool layer of the four DL models [15, 16] (VGG19 [17], resnet101 [18], googlenet [19] and mobilenet_v3_large [20]) were used to extract DL radiomics features. Four machine learning models underwent preliminary training using the 'ImageNet' dataset [21], followed by further training with CT images to enhance their specialization for medical image analysis.

Model construction

The Affiliated Hospital of Xiangnan University cohort (N = 401) was selected as the training set and the People's Hospital of HeBi cohort (N = 102) was selected as the test set. The training set was used to build a prediction model and the test set was used to test this model. Data normalization and filtration were performed prior to statistical analysis. Then, the least absolute shrinkage and selection operator (LASSO) was adopted. L1 regularization shrinks coefficients of less important features to zero by adding the absolute value of magnitude of coefficients as a penalty term to the loss function. Finally, we used support vector machine (SVM), K-nearest neighbor (KNN), randomforest (RF), extreme gradient boosting (XGBoost), light gradient boosting machine (LightGBM), NaiveBayes (NB), adaptive boosting (AdaBoost), gradient boosting (GBDT), logistic regression (LR) and multilayer perceptron (MLP) 10 machine learning algorithms were used to establish classification prediction models by hyperparameter method. The Rad_score of the model can be calculated as follows:

$$Rad\_Score = \sum\limits_{i} {{\text{feature}}_{{\text{i}}} \times {\text{Coefficient}}} + b$$

(1)

Cofficient is obtained by an iterative of LASSO algorithm, and i represents the feature.

Statistical analysis

Python(version: 3.7, http://www.python.org [22]) was used for statistical analysis. Continuous variables were expressed as mean ± standard deviation ($\begin{gathered} \overline{x} \pm s \hfill \\ \hfill \\ \end{gathered}$), and categorical variables were expressed as counts. Differences between groups were analyzed using t-tests and Chi-squared tests. Consistency assessment of radiomics features: The CT images of 50 randomly selected patients were segmented and radiomics features were extracted by the same radiologist twice, with an interval of more than one month. The intra-group (ROI segmentation of 50 patients by two radiologists, respectively) and inter-group (ROI segmentation of 50 patients by two radiologists, respectively) were used. Intraclass Correlation Coefficient (ICC) method was used to evaluate the repeated consistency of features. Features are extracted consistently when ICC is higher than 0.75 [23, 24]. Omics features are standardized with Z-score, and the formula is as follows:

$${\mathrm X}_\_\mathrm{norm}\;=\frac{\left(\mathrm X\;-\;\mathrm{mean}\right)}{\mathrm{std}}$$

(2)

Spearman correlation method was used to preliminary screen the features. Travel through the LASSO algorithm L1 penalty function has been screened again, filtering features built into the model in the end. The classification prediction model was established by using 10 machine learning algorithms (SVM, KNN, RF, XGBoost, LightGBM, NB, AdaBoost, GBDT, LR and MLP). The diagnostic performance of the model was evaluated by Area under the curve (AUC), Accuracy, Specificity, Precision, Reall and F1 score.

$${\text{Accuracy}} = \frac{{{\text{(TP}} + {\text{TN)}}}}{{{\text{(TP}} + FN + FP + TN)}}$$

(3)

$$Specificity = \frac{TN}{{TN + FP}}$$

(4)

$${\text{Precision}} = \frac{{{\text{TP}}}}{{{\text{(TP}} + {\text{FP)}}}}$$

(5)

$${\text{Recall}} = \frac{{{\text{TP}}}}{{{\text{(TP}} + {\text{FN)}}}}$$

(6)

$${\text{F1}} = \frac{{{\text{2TP}}}}{{{\text{(2TP}} + {\text{FP}} + {\text{FN)}}}}$$

(7)

TP True Positive, FN False Negative, FP False Positive, TN True Negative

Decision curve analysis (DCA) was used to test whether the predictive model was clinically significant. In the training set, five-fold cross validation was used to determine the stability of the model, and then all the data from the test set were used to build a complete model to develop the final prediction model. The Hosmer–Lemeshow test was used to generate calibration curves to test whether the predicted results was consistent with the actual results. For each model, the AUC values were tested by Delong tests (p < 0.01 was considered statistically significant).

In the above analysis, there is no special explanation that P < 0.05 indicates statistical difference.

Results

Repeatability assessment of feature extraction

After manual segmentation by 3D-slice software. In the python environment, 1906 traditional radiomics features were extracted from each patient, and the ICC values of intra-group and inter-group were 0.786–0.962 and 0.783–0.833, respectively, which were greater than 0.75, indicating good consistency.

Traditional radiomics feature selection and model construction

There were 401 cases in the training set, including 172 cases of LNM(-) and 229 cases of LNM( +). In the test set of 102 cases, including 44 cases of LNM(-) and 58 cases of LNM( +). Supplementary Table 1 provides further clinical information on the enrolled patients, including age, gender, and details regarding the primary tumor site. Additionally, CT characteristics such as lobulation and Burr sign are also presented in this supplementary Table 1 for a more thorough understanding of the patients' condition. The training set included 1906 traditional radiomics features, and 364 features were obtained after spearman test (threshold was set to 0.9). LASSO was used to further reduce the dimension, when λ = 0.039 (Fig. 3A) the 11 best traditional radiomics features (Fig. 3B) were obtained. Using ten kinds of machine learning algorithms, constructing classification prediction model respectively. The performance evaluation of the ten classification prediction models across both the training and test sets is comprehensively presented in Table 1. Upon careful analysis of the ROC curve depicted in Fig. 3C, it becomes evident that the LR model exhibits superior prediction capabilities among the traditional radiomics-based prediction models. As highlighted in Fig. 3D and summarized in Table 1, LR stands out as the most effective in terms of overall performance. Unfortunately, we observed instances of overfitting in the XGBoost and RF models. This overfitting may have hindered their ability to generalize effectively to unseen data. Furthermore, several other models demonstrated suboptimal performance across various metrics, including Specificity, Precision, Recall, and F1. Therefore, based on the traditional radiomics features, LR algorithm was used to construct a prediction classification model (Rad-modle).

Table 1 The results of 10 traditional radiomics model analysis

Full size table

Deep learning radiomics feature selection and model construction

The maximum cross section of ROI of each patient was fed to four DL models (vgg19, resnet101, googlenet and mobilenet_v3_large) for DL radiomics feature extraction. Vgg19 extracted 16,383 features; resnet101 extracted 2,048 features; googlenet extracted 1,021 features; mobilenet_v3_large extracted 960 features. A total of 20,412 DL radiomics features were obtained. By spearman correlation (threshold was set to 0.9) and LASSO (λ = 0.013, Fig. 4A), finally got 17 DL radiomics features, used for model building (Fig. 4B). Table 2 shows the performance of the 10 machine learning classifiers in the training and test sets. The ROC curves of each classification prediction model in the test set are shown in Fig. 4C.

Table 2 The results of 10 deep learning radiomics model analysis

Full size table

Similar to the traditional radiomics model (Table 1), the RF and XGBoost models exhibited signs of overfitting. The NB, AdaBoost, GBDT, and MLP models did not perform as well as the KNN model. In the test set, the KNN model exhibited comparable performance to both LR and SVM. However, when it comes to metrics such as Accuracy, AUC, Specificity, Precision, and Recall, KNN emerged as the top performer. While KNN slightly trailed behind LR and SVM in terms of the F1, its overall excellence in the majority of evaluation metrics makes it the preferred choice as the final model. Through comprehensive analysis, KNN model based on DL radiomics features performs best. Therefore, based on these DL radiomics features, KNN algorithm was used to construct a prediction classification model (Fig. 4D, DL-modle).

Comprehensive predictive model construction

The extracted traditional radiomics and DL radiomics features were integrated. By spearman correlation (threshold was set to 0.9) and LASSO (λ = 0.039, Fig. 5A), ends up with 14 hub features (Fig. 5B). There were 7 traditional radiomics features: Exponential_firstorder_RobustMeanAbsoluteDeviation, lbp_3D_m1_glszm_GrayLevelNonUniformityNormalized, logarithm_firstorder_10Percentile, original_shape_Flatness, original_shape_Sphericity, Square_firstorder_RobustMeanAbsoluteDeviation and wavelet_HLL_firstorder_RobustMeanAbsoluteDeviation. And, there were 7 DL radiomics features: VGG_1, VGG_2, VGG_3, GOOG_8, GOOG_17, mobilenet_0 and mobilenet_4. Among the 7 DL radiomics features, VGG_1, VGG_2 and VGG_3 were derived from the DL model of VGG19. GOOG_8, GOOG_17 are derived from DL model of googlenet, mobilenet_0 and mobilenet_4 are derived from DL model of mobilenet_v3_large. However, features extracted from DL model of restnet101 were not significantly correlated with LNM of lung adenocarcinoma. These 7 traditional radiomic features are classified into first-order features, gray-level size zone matrix (GLSZM) and shape features. Using these 14 hub features, the classification prediction model were constructed by 10 machine learning algorithms. From Table 3, as well as in the ROC curve (Fig. 5C), we can find XGBoost performance is superior to other models of the model. Therefore, based on the traditional and DL radiomics features, XGBoost algorithm was used to construct a prediction classification model (Fusion-modle, Fig. 5D).

Table 3 The results of 10 prediction model analysis, basis on the traditional and deep learning radiomics

Full size table

Prediction model determination

Despite partial overlap in the confidence intervals depicted in Fig. 6A, upon comprehensive evaluation of various metrics such as prediction accuracy, AUC value, and model stability (Tables 1, 2, and 3), coupled with consideration of practical application needs, we have identified the XGBoost method, leveraging both traditional and DL radiomics features, as exhibiting superior performance across multiple dimensions. Notably, the XGBoost method demonstrates significant advantages over other methods in terms of Accuracy, AUC, Specificity, Precision, Recall, and F1. Consequently, within the context of this study, we can confidently assert that the XGBoost method possesses relatively superior predictive performance. The DCA curve (Fig. 6B) of the three models showed that the fusion model also had the greatest clinical benefit. Therefore, this study identified this model as the final predictive classification model. The calibration curve (Fig. 6C) of the model in the test set showed that the model has strong applicability and high prediction accuracy. As shown in Fig. 6D, the normalized confusion matrix further showed the model’s classification accuracy on the test set.

In addition, the Rad score of this model to predict the LNM of lung adenocarcinoma patients can be calculated as:

Rad_score = 0.5716628665724426.

-0.012359 × exponential_firstorder_RobustMeanAbsoluteDeviation.

-0.008611 × lbp_3D_m1_glszm_GrayLevelNonUniformityNormalized.

+ 0.016666 × logarithm_firstorder_10Percentile.

-0.006898 × original_shape_Flatness.

-0.116110 × original_shape_Sphericity.

-0.010594 × square_firstorder_RobustMeanAbsoluteDeviation.

-0.005452 × wavelet_HLL_firstorder_RobustMeanAbsoluteDeviation.

+ 0.031910 × VGG_1.

+ 0.010605 × VGG_2.

-0.011560 × VGG_3.

-0.005295 × GOOG_8.

+ 0.012453 × GOOG_17.

+ 0.143270 × mobilenet_0.

-0.018727 × mobilenet_4.

Although the results of the DeLong's test reveal no statistically significant difference between the ROC curves of the Fusion-model, Rad-model, and DL-model (Fusion-model VS Rad-model [p-value = 0.116], Fusion-model VS DL-model [p-value = 0.499], DL-model VS Rad-model [p-value = 0.609]), we maintain that the Fusion-model emerges as the superior model in this study due to its seamless integration of diverse techniques or methodologies. This integration potentially enhances generalization capabilities, stability, and reduces specific error types, albeit these benefits did not achieve statistical significance in the ROC curve analysis. Nonetheless, the Fusion-model remains a frontrunner worthy of further investigation and application within our research framework. Its superiority is further underscored by its performance across various evaluation metrics, including Accuracy, Specificity, Precision, Recall, F1, DCA, and calibration curves.

Discussion

LNM is a crucial factor for clinicians to determine the clinical staging of lung cancer, formulate treatment plans, and predict prognosis [6]. Although current medical imaging examination can detect LNM to some extent, an assessment solely on morphological changes is insufficient to provide accurate histopathological information. There is an urgent need for a non-invasive and effective method to evaluate the LNM status of patients. This study compares the traditional, DL, and DL-traditional radiomics models in predicting LNM based on preoperative CECT of lung adenocarcinoma. In cases with larger datasets, DL models have outperformed hand-crafted feature extraction [25]. However, access to large data in the field of medicine is relatively difficult and may be affected by disease prevalence rates, data acquisition, and other clinical factors [26]. For smaller datasets, studies have shown that feature engineering may be more suitable for machine learning strategies [27], and radiomics has advantages in medical imaging analysis. Currently, there are relatively few studies that directly compare the performance of radiomics and DL models [14, 28, 29]. In this study, we predicted the risk of LNM in lung adenocarcinoma patients. We not only verified that the DL-based prediction model (with accuracy, AUC, specificity, precision, recall and F1 in the test set are as follows: 0.755, 0.811, 0.773, 0.811, 0.741and 0.775) was superior to the traditional radiomics model (with accuracy, AUC, specificity, precision, recall and F1 in the test set are as follows: 0.696, 0.782, 0.614, 0.721, 0.759, 0.739), but also the Fusion model (Accuracy, AUC, Specificity, Precision, Recall and F1 in the test set are as follows:0.765, 0.844, 0.705, 0.783, 0.810 and 0.797) obtained by integrating DL and traditional radiomics had better prediction results and improved model interpretability to a certain extent. The DCA curve of external validation intuitively shows that the Fusion model has higher clinical benefit than the other two models. Moreover, the calibration curve verified by external verification proves that the Fusion model is in good agreement with the actual value.

Among the hub features for constructing Fusion model, 7 are traditional radiomics features. In radiomics RobustMeanAbsoluteDeviation is all intensity and gray level between or equal to the average between the 10th and the 90th percentile of the distance between the average, and negatively correlated with metastasis [30]. Our study also confirmed this theory. And in our study RobustMeanAbsoluteDeviation has carried on the wavelet, exponential and square transformation makes the futertes of the multiple correction, and stronger stability [31]. Shen et al. [28]. reported that the RobustMeanAbsoluteDeviation feature played a key role in distinguishing histological subtypes of NSCLC. In addition, Sphericity is also introduced into the modeling. Our study also suggests that Sphericity is negatively correlated with LNM of lung adenocarcinoma. In other words, the more regular and spherical the primary tumor shape, the less likely it is to cause LNM. This is consistent with the studies of others, where smaller Sphericity is more likely to induce LNM in breast cancer and esophageal cancer [29]. In clinical practice, an important feature for radiologists to read CT to judge lung tumors is the burr-like structure [31], which also represents the irregularity of the tumor. Burr-like structure of NSCLC is more aggressive and has poor prognosis [32]. In Supplementary Table 1, it can be observed that there is also a significant difference in Burr sign between the two groups, NLM( +) and NLM(-). It can be inferred that the smaller of Sphericity, the higher malignant degree of lung adenocarcinoma and the greater possibility of LNM. The 10Percentile is the set of intensity voxels in the region of interest, which represents less than 10% of the observations [33]. This research shows that, by the 10Percentile can predict the nature of the lesion, and positive correlation. This means that in the CT pulmonary primary lesion is on the gray scale difference exist in whether LNM. However, the gray difference in the specific area of the lesion needs further research. The study by Folhoffer et al. [34] pointed out that the 10Percentile and the 90Percentile are very useful for the classification of high and low grade fibrosis in the liver. Shen's study also showed that 10Percentile can be used to classify the subtypes of lung cancer [28]. GLSZM is the starting point of the Thibault matrix, which can effectively describe the texture uniformity, non-periodicity or similar texture [35]. It has been proved that the gray level quantization has an important effect on the texture classification performance [36]. The gray level non-uniformity is a radiological texture feature that indicates heterogeneity [37]. It is particularly important that many radiomics features are unstable among different reconstruction algorithms, and GLNU is one of the most reproducible radiomics features with good stability [38]. Recent studies have shown that the value of GLNU increases if the lesion is heterogeneous [35]. Heterogeneity is an important feature of malignant tumors [39], which is closely related to the malignant biological behavior, and can reflect the changes of related growth factors and the microenvironment of tumor growth [40]. The higher the malignant degree of tumors, the higher the heterogeneity [41]. In the study of Yang X et al. [10], GLNU was incorporated into the prediction of LNM status of lung adenocarcinoma, and the AUC in the training and test set were 0.854 and 0.803, respectively. In a previous study, GLNU was also identified as the most important radiomics features in hypertrophic cardiomyopathy [42].

However, in this study, we not only used traditional radiomics features, but also used DL methods. Four DL models (VGG19, resnet10, Googlenet and mobilenet_v3_large) were used to extract DL radiomics features. The integration of DL radiomics features into feature engineering has greatly increased the dimensionality of the data studied (from the original 1,907 dimensions to the later 22,319). Finally, 14 hub features (7 traditional radiomics features, 7 DL radiomics features) were entered into the construction of the model. This method can make the prediction results more reliable. Our study dataset is relatively large, with 503 patients from CECT images routinely acquired in clinical settings, which greatly improves the authenticity of the results. Unlike most current studies, our study used an external data test set. A total of 102 patients from the People's Hospital of HeBi were used as the test set to verify the model. The prediction AUC of the model reached 0.844 in the test set, which had strong robustness. While the Fusion model demonstrates significant superiority in various metrics such as Accuracy, Specificity, Precision, Recall, and F1 score, surpassing both the traditional radiomics model and the DL model, it is important to acknowledge that it does have some limitations. Notably, in the Delong test, the Fusion model did not achieve the desired level of performance. This could be attributed to various factors, including the complexity of the dataset, the specific nature of the test, or potential areas for improvement in the model's architecture or training process. Despite this shortcoming, the Fusion model's overall excellence in other metrics remains compelling and suggests that it holds great potential for further development and optimization.

Nonetheless, it is imperative to recognize that our research possesses inherent limitations that necessitate additional scrutiny and consideration. Firstly, given the strong association between the LNM of lung adenocarcinoma and genetic factors [43], we recognize the need to incorporate genomics data in future studies. The absence of such data in our current investigation limits our understanding of the underlying genetic mechanisms involved in LNM. Secondly, the incomplete follow-up data precluded us from conducting thorough investigations into patient outcomes, which is crucial for assessing the long-term impact of our findings. To address these limitations, we plan to explore various avenues in future research. Firstly, our primary objective is to refine the architecture and hyperparameters of the machine learning model in order to bolster its predictive capabilities. This might involve exploring various neural network structures and meticulously adjusting the learning rate to achieve optimal performance. Secondly, we intend to investigate the generalizability of our model by applying it to different cancer datasets, thereby extending its potential applications to other malignancies. Finally, we are interested in exploring the integration of our model with other clinical and genetic data to develop a more comprehensive and personalized approach to cancer diagnosis and treatment. By addressing these limitations and exploring these directions, we hope to contribute to the advancement of precision medicine in cancer treatment.

Conclusion

Leveraging enhanced CT images, our study introduces a noninvasive classification prediction model based on the extreme gradient boosting method. This approach exhibits remarkable precision in identifying the lymph node status of lung adenocarcinoma patients, offering a safe and accurate alternative to invasive procedures. By providing clinicians with a reliable tool for diagnosing and assessing disease progression, our method holds the potential to significantly improve patient outcomes and enhance the overall quality of clinical practice.

Availability of data and materials

The datasets used and/or analysed during the current study available from the corresponding author on reasonable request.

References

Szczepanski AP, Tsuboyama N, Watanabe J, Hashizume R, Zhao Z, Wang L. POU2AF2/C11orf53 functions as a coactivator of POU2F3 by maintaining chromatin accessibility and enhancer activity. Sci Adv. 2022;8(40):eabq2403. https://doi.org/10.1126/sciadv.abq2403. (Epub 2022 Oct 5. PMID: 36197978; PMCID: PMC9534498).
Article CAS PubMed PubMed Central Google Scholar
Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2022. CA Cancer J Clin. 2022;72(1):7–33. https://doi.org/10.3322/caac.21708. (Epub 2022 Jan 12 PMID: 35020204).
Article PubMed Google Scholar
Xie H, Chen Z, Deng J, Zhang J, Duan H, Li Q. Automatic segmentation of the gross target volume in radiotherapy for lung cancer using transresSEUnet 2.5D Network. J Transl Med. 2022;20(1):524. https://doi.org/10.1186/s12967-022-03732-w. (PMID: 36371220; PMCID: PMC9652981).
Article PubMed PubMed Central Google Scholar
National Comprehensive Care Network. Small cell lung cancer. Version 1. 2019. In: NCCN: Clinical practice guidelines in oncology. 2019. Available online: https://www.nccn.org/.Accessed 15 Jul 2019.
Montagne F, Chaari Z, Bottet B, Sarsam M, Mbadinga F, Selim J, Guisier F, Gillibert A, Baste JM. Long-term survival following minimally invasive lung cancer surgery: comparing robotic-assisted and video-assisted surgery. Cancers (Basel). 2022;14(11):2611. https://doi.org/10.3390/cancers14112611. (PMID: 35681593; PMCID: PMC9179652).
Article PubMed Google Scholar
Montagne F, Guisier F, Venissac N, Baste J-M. The role of surgery in lung cancer treatment: present indications and future perspectives—state of the art. Cancers. 2021;13:3711. https://doi.org/10.3390/cancers13153711.
Article PubMed PubMed Central Google Scholar
Ma YC, Tian PF, Chen ZP, Yue DS, Liu CC, Li CG, Chen C, Zhang H, Liu HL, Zhang ZF, Chen L, Zhang B, Wang CL. Urinary malate dehydrogenase 2 is a new biomarker for early detection of non-small-cell lung cancer. Cancer Sci. 2021;112(6):2349–60. https://doi.org/10.1111/cas.14845. (Epub 2021 May 1. PMID: 33565687; PMCID: PMC8177790).
Article CAS PubMed PubMed Central Google Scholar
Xu L, Yang P, Liang W, Liu W, Wang W, Luo C, Wang J, Peng Z, Xing L, Huang M, Zheng S, Niu T. A radiomics approach based on support vector machine using MR images for preoperative lymph node status evaluation in intrahepatic cholangiocarcinoma. Theranostics. 2019;9(18):5374–85. https://doi.org/10.7150/thno.34149. (PMID: 31410221; PMCID: PMC6691572).
Article PubMed PubMed Central Google Scholar
Maiga AW, Deppen SA, Mercaldo SF, Blume JD, Montgomery C, Vaszar LT, Williamson C, Isbell JM, Rickman OB, Pinkerman R, Lambright ES, Nesbitt JC, Grogan EL. Assessment of Fluorodeoxyglucose F18-Labeled Positron Emission Tomography for Diagnosis of High-Risk Lung Nodules. JAMA Surg. 2018;153(4):329–34. https://doi.org/10.1001/jamasurg.2017.4495. (PMID: 29117314; PMCID: PMC5910279).
Article PubMed Google Scholar
Yang X, Pan X, Liu H, et al. A new approach to predict lymph node metastasis in solid lung adenocarcinoma: a radiomics nomogram. J Thorac Dis. 2018;10(Suppl 7):S807–19.
Article PubMed PubMed Central Google Scholar
Zhou L, et al. A comprehensive nomogram combining CT Imaging with clinical features for prediction of lymph node metastasis in Stage I-IIIB non-small cell lung cancer. Ther Innov Regul Sci. 2022;56(1):155–67. https://doi.org/10.1007/s43441-021-00345-1.
Article PubMed Google Scholar
Huang Y, Jiang X, Xu H, et al. Preoperative prediction of mediastinal lymph node metastasis in non-small cell lung cancer based on 18F-FDG PET/CT radiomics. Clin Radiol. 2023;78(1):8–17. https://doi.org/10.1016/j.crad.2022.08.140.
Article CAS PubMed Google Scholar
Fedorov A, et al. 3D Slicer as an image computing platform for the quantitative imaging network. Magn Reson Imaging. 2012;30:1323–41. https://doi.org/10.1016/j.mri.2012.05.001.
Article PubMed PubMed Central Google Scholar
van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, Beets-Tan RGH, Fillion-Robin JC, Pieper S, Aerts HJWL. Computational radiomics system to decode the radiographic phenotype. Cancer Res. 2017;77(21):e104–7. https://doi.org/10.1158/0008-5472.CAN-17-0339. (PMID: 29092951; PMCID: PMC5672828).
Article CAS PubMed PubMed Central Google Scholar
Zhou Z, Shin JY, Gurudu SR, Gotway MB, Liang J. Active, continual fine tuning of convolutional neural networks for reducing annotation efforts. Med Image Anal. 2021;71:101997. https://doi.org/10.1016/j.media.2021.101997. (Epub 2021 Mar 24. PMID: 33853034; PMCID: PMC8483451).
Article PubMed PubMed Central Google Scholar
Xu Y, Vaziri-Pashkam M. Limits to visual representational correspondence between convolutional neural networks and the human brain. Nat Commun. 2021;12(1):2065. https://doi.org/10.1038/s41467-021-22244-7. (Erratum.In:NatCommun.2021May6;12(1):2740.PMID: 33824315; PMCID: PMC8024324).
Article CAS PubMed PubMed Central Google Scholar
Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. 2014. arXiv preprint arXiv:1409.1556.
He K, Zhang X, Ren S, Sun J. Deep Residual Learning for Image Recognition. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016; 770–778. https://doi.org/10.1109/CVPR.2016.90
Szegedy C., Liu W., Jia Y., Sermanet P., Reed S., Anguelov D., Erhanet D., Vanhoucke V., Rabinovich A. Going Deeper with Convolutions. Retrieved December 17, 2019, from Google Research website:https://research.google/pubs/pub43022/
Howard A, Sandler M, Chu G, Chen LC, Chen B, Tan M, et al. Searching for mobilenetv3[C]//Proceedings of the IEEE/CVF international conference on computer vision. 2019. p. 1314–24.
Deng J., Dong W., Socher R., Li JL., LI FF. "ImageNet: a Large-Scale Hierarchical Image Database." 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 20–25 June 2009, Miami, Florida, USA IEEE, 2009.
http://www.python.org.
Hessam S, Scholl L, Sand M, Schmitz L, Reitenbach S, Bechara FG. A novel severity assessment scoring system for hidradenitis suppurativa. JAMA Dermatol. 2018;154(3):330–5. https://doi.org/10.1001/jamadermatol.2017.5890. (PMID: 29417136; PMCID: PMC5885841).
Article PubMed PubMed Central Google Scholar
Tan B, Chua J, Lin E, Cheng J, Gan A, Yao X, Wong DWK, Sabanayagam C, Wong D, Chan CM, Wong TY, Schmetterer L, Tan GS. Quantitative microvascular analysis with wide-field optical coherence tomography angiography in eyes with diabetic retinopathy. JAMA Netw Open. 2020;3(1):e1919469. https://doi.org/10.1001/jamanetworkopen.2019.19469. (Erratum.In:JAMANetwOpen.2020Jun1;3(6):e2010994.PMID: 31951275; PMCID: PMC6991275).
Article PubMed PubMed Central Google Scholar
Hosny A, Parmar C, Quackenbush J, Schwartz LH, Aerts H. Artificial intelligence in radiology. Nat Rev Cancer. 2018;18:500–10. https://doi.org/10.1038/s41568-018-0016-5.
Article CAS PubMed PubMed Central Google Scholar
Hewitt J, Carter B, Vilches-Moraga A, Quinn TJ, Braude P, Verduri A, Pearce L, Stechman M, Short R, Price A, Collins JT, Bruce E, Einarsson A, Rickard F, Mitchell E, Holloway M, Hesford J, Barlow-Pay F, Clini E, Myint PK, Moug SJ, McCarthy K, COPE Study Collaborators. The effect of frailty on survival in patients with COVID-19 (COPE): a multicentre, European, observational cohort study. Lancet Public Health. 2020;5(8):e444–51. https://doi.org/10.1016/S2468-2667(20)30146-8. (Epub 2020 Jun 30. PMID: 32619408; PMCID: PMC7326416).
Article PubMed PubMed Central Google Scholar
Hemani G, Zheng J, Elsworth B, Wade KH, Haberland V, Baird D, Laurin C, Burgess S, Bowden J, Langdon R, Tan VY, Yarmolinsky J, Shihab HA, Timpson NJ, Evans DM, Relton C, Martin RM, Davey Smith G, Gaunt TR, Haycock PC. The MR-Base platform supports systematic causal inference across the human phenome. Elife. 2018;30(7):e34408. https://doi.org/10.7554/eLife.34408. (PMID: 29846171; PMCID: PMC5976434).
Article Google Scholar
Shen H, Chen L, Liu K, Zhao K, Li J, Yu L, Ye H, Zhu W. A subregion-based positron emission tomography/computed tomography (PET/CT) radiomics model for the classification of non-small cell lung cancer histopathological subtypes. Quant Imaging Med Surg. 2021;11(7):2918–32. https://doi.org/10.21037/qims-20-1182. (PMID: 34249623; PMCID: PMC8250013).
Article PubMed PubMed Central Google Scholar
Tagliafico AS, Bignotti B, Rossi F, Matos J, Calabrese M, Valdora F, et al. Breast cancer Ki-67 expression prediction by digital breast tomosynthesis radiomics features. Eur Radiol Exp. 2019;3:36. https://doi.org/10.1186/s41747-019-0117-2.
Article PubMed PubMed Central Google Scholar
Feng L, Zhu S, Liu F, He Y, Bao Y, Zhang C. Hyperspectral imaging for seed quality and safety inspection: a review. Plant Methods. 2019;8(15):91. https://doi.org/10.1186/s13007-019-0476-y. (PMID: 31406499; PMCID: PMC6686453).
Article CAS Google Scholar
Ban X, Hu H, Li Y, Yang L, Wang Y, Zhang R, Xie C, Zhou C, Duan X. Morphologic CT and MRI features of primary parotid squamous cell carcinoma and its predictive factors for differential diagnosis with mucoepidermoid carcinoma. Insights Imaging. 2022;13(1):119. https://doi.org/10.1186/s13244-022-01256-x. (PMID: 35840821; PMCID: PMC9287497).
Article PubMed PubMed Central Google Scholar
Hida T, Hata A, Lu J, Valtchinov VI, Hino T, Nishino M, Honda H, Tomiyama N, Christiani DC, Hatabu H. Interstitial lung abnormalities in patients with stage I non-small cell lung cancer are associated with shorter overall survival: the Boston lung cancer study. Cancer Imaging. 2021;21(1):14. https://doi.org/10.1186/s40644-021-00383-w. (PMID: 33468255; PMCID: PMC7816399).
Article PubMed PubMed Central Google Scholar
Budai BK, Tóth A, Borsos P, Frank VG, Shariati S, Fejér B, Folhoffer A, Szalay F, Bérczi V, Kaposi PN. Three-dimensional CT texture analysis of anatomic liver segments can differentiate between low-grade and high-grade fibrosis. BMC Med Imaging. 2020;20(1):108. https://doi.org/10.1186/s12880-020-00508-w. (PMID: 32957949; PMCID: PMC7507285).
Article PubMed PubMed Central Google Scholar
Folhoffer A, Szalay F, Bérczi V, Kaposi PN. Three-dimensional CT texture analysis of anatomic liver segments can differentiate between low-grade and high-grade fibrosis. BMC Med Imaging. 2020;20(1):108. https://doi.org/10.1186/s12880-020-00508-w. (PMID: 32957949; PMCID: PMC7507285).
Article PubMed PubMed Central Google Scholar
Toyama Y, Hotta M, Motoi F, Takanami K, Minamimoto R, Takase K. Prognostic value of FDG-PET radiomics with machine learning in pancreatic cancer. Sci Rep. 2020;10(1):17024. https://doi.org/10.1038/s41598-020-73237-3. (PMID: 33046736; PMCID: PMC7550575).
Article CAS PubMed PubMed Central Google Scholar
Alksas A, Shehata M, Saleh GA, Shaffie A, Soliman A, Ghazal M, Khelifi A, Khalifeh HA, Razek AA, Giridharan GA, El-Baz A. A novel computer-aided diagnostic system for accurate detection and grading of liver tumors. Sci Rep. 2021;11(1):13148. https://doi.org/10.1038/s41598-021-91634-0. (PMID: 34162893; PMCID: PMC8222341).
Article CAS PubMed PubMed Central Google Scholar
Shao Y, Chen Z, Ming S, Ye Q, Shu Z, Gong C, Pang P, Gong X. Predicting the development of normal-appearing white matter with radiomics in the aging brain: a longitudinal clinical study. Front Aging Neurosci. 2018;28(10):393. https://doi.org/10.3389/fnagi.2018.00393. (PMID: 30546304; PMCID: PMC6279861).
Article CAS Google Scholar
Duron L, Balvay D, Vande Perre S, Bouchouicha A, Savatovsky J, Sadik JC, Thomassin-Naggara I, Fournier L, Lecler A. Gray-level discretization impacts reproducible MRI radiomics texture features. PLoS One. 2019;14(3):e0213459. https://doi.org/10.1371/journal.pone.0213459. (PMID: 30845221; PMCID: PMC6405136).
Article CAS PubMed PubMed Central Google Scholar
Rovira-Clavé X, Drainas AP, Jiang S, Bai Y, Baron M, Zhu B, Dallas AE, Lee MC, Chu TP, Holzem A, Ayyagari R, Bhattacharya D, McCaffrey EF, Greenwald NF, Markovic M, Coles GL, Angelo M, Bassik MC, Sage J, Nolan GP. Spatial epitope barcoding reveals clonal tumor patch behaviors. Cancer Cell. 2022;40(11):1423–1439.e11. https://doi.org/10.1016/j.ccell.2022.09.014. (Epub 2022 Oct 13. PMID: 36240778; PMCID: PMC9673683).
Article CAS PubMed PubMed Central Google Scholar
Kim JS, Jeong SK, Oh SJ, Lee CG, Kang YR, Jo WS, Jeong MH. The resveratrol analogue, HS-1793, enhances the effects of radiation therapy through the induction of anti-tumor immunity in mammary tumor growth. Int J Oncol. 2020;56(6):1405–16. https://doi.org/10.3892/ijo.2020.5017. (Epub 2020 Mar 19. PMID: 32236622; PMCID: PMC7170036).
Article CAS PubMed PubMed Central Google Scholar
Sheffield NC, Pierron G, Klughammer J, Datlinger P, Schönegger A, Schuster M, Hadler J, Surdez D, Guillemot D, Lapouble E, Freneaux P, Champigneulle J, Bouvier R, Walder D, Ambros IM, Hutter C, Sorz E, Amaral AT, de Álava E, Schallmoser K, Strunk D, Rinner B, Liegl-Atzwanger B, Huppertz B, Leithner A, de Pinieux G, Terrier P, Laurence V, Michon J, Ladenstein R, Holter W, Windhager R, Dirksen U, Ambros PF, Delattre O, Kovar H, Bock C, Tomazou EM. DNA methylation heterogeneity defines a disease spectrum in Ewing sarcoma. Nat Med. 2017;23(3):386–95. https://doi.org/10.1038/nm.4273. (Epub 2017 Jan 30. PMID: 28134926; PMCID: PMC5951283).
Article CAS PubMed PubMed Central Google Scholar
Baeßler B, Mannil M, Maintz D, Alkadhi H, Manka R. Texture analysis and machine learning of non-contrast T1-weighted MR images in patients with hypertrophic cardiomyopathy-Preliminary results. Eur J Radiol. 2018;102:61–7. https://doi.org/10.1016/j.ejrad.2018.03.013.
Article PubMed Google Scholar
Jiang F, Yu Q, Chu Y, Zhu X, Lu W, Liu Q, Wang Q. MicroRNA-98–5p inhibits proliferation and metastasis in non-small cell lung cancer by targeting TGFBR1. Int J Oncol. 2019;54(1):128–38. https://doi.org/10.3892/ijo.2018.4610. (Epub 2018 Oct 29. PMID: 30387848; PMCID: PMC6255066).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

Thanks for the support from the above funding.

Funding

The funding bodies played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript. This study was supported by:

1 Key Laboratory of Tumor Precision Medicine, Hunan colleges and Universities Project (No.2019–379).

2 Science and Technology Funding Project of Hunan Province, China (No. 2021SK52205).

3 National innovation and entrepreneurship training program for college students, (No. 202210545021).

4 Macao Polytechnic University Grant (RP/FCA-05/2022).

5 Science and Technology Development Fund of Macao (0105/2022/A).

Author information

Authors and Affiliations

Department of Radiation Oncology, Affiliated Hospital (Clinical College) of Xiangnan University, Chenzhou, Hunan province, 423000, People’s Republic of China
Hui Xie & Qing Li
Faculty of Applied Sciences, Macao Polytechnic University, Macao, 999078, People’s Republic of China
Hui Xie & Tao Tan
School of Medical Imaging, Laboratory Science and Rehabilitation, Xiangnan University, Chenzhou, Hunan province, 423000, People’s Republic of China
Chaoling Song, Lei Jian, Yeang Guo, Mei Li & Jiang Luo
Department of Radiology and Nuclear Medicine, Radboud University Medical Centre, Nijmegen, Netherlands
Tao Tan

Authors

Hui Xie
View author publications
You can also search for this author in PubMed Google Scholar
Chaoling Song
View author publications
You can also search for this author in PubMed Google Scholar
Lei Jian
View author publications
You can also search for this author in PubMed Google Scholar
Yeang Guo
View author publications
You can also search for this author in PubMed Google Scholar
Mei Li
View author publications
You can also search for this author in PubMed Google Scholar
Jiang Luo
View author publications
You can also search for this author in PubMed Google Scholar
Qing Li
View author publications
You can also search for this author in PubMed Google Scholar
Tao Tan
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Tao Tan and Hui Xie designed the study, searched, analyzed and interpreted the literature and are the major contributors in writing the manuscript. Chaoling Song, Lei Jian, Yeang Guo, Mei Li, Jiang Luo and Qing Li collect the case data and Tao Tan revised the manuscript.

Corresponding author

Correspondence to Tao Tan.

Ethics declarations

Ethics approval and consent to participate

This study has obtained formal approval from the Institutional Review Board (IRB) of two participating institutions. These institutions are the Medical Research Ethics Committee of Xiangnan University Affiliated Hospital (Clinical College) (Approval Number: AF/SC-07–4/05.0) and the Medical Research Ethics Committee of Hebi People's Hospital (Approval Number: 22–350-18018). Both institutions have waived the need for informed consent for this study. This study was constructed following ethical guidelines of World Medical Association (WMA) Declaration of Helsinki.

Consent for publication

N/A.

Competing interests

The authors declare no competing interests..

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Material 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Xie, H., Song, C., Jian, L. et al. A deep learning-based radiomics model for predicting lymph node status from lung adenocarcinoma. BMC Med Imaging 24, 121 (2024). https://doi.org/10.1186/s12880-024-01300-w

Download citation

Received: 06 March 2023
Accepted: 14 May 2024
Published: 24 May 2024
DOI: https://doi.org/10.1186/s12880-024-01300-w

A deep learning-based radiomics model for predicting lymph node status from lung adenocarcinoma

Abstract

Objectives

Methods

Results

Conclusions

Introduction

Patients and methods

Data collection

CT scan protocols

Data acquisition

Model construction

Statistical analysis

Results

Repeatability assessment of feature extraction

Traditional radiomics feature selection and model construction

Deep learning radiomics feature selection and model construction

Comprehensive predictive model construction

Prediction model determination

Discussion

Conclusion

Availability of data and materials

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Supplementary Material 1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Imaging

Contact us