Skip to main content

Table 4 Experiment 1- Diagnostic performance of the CNNs and radiologists

From: The efficacy of deep learning models in the diagnosis of endometrial cancer using MRI: a comparison with radiologists

Image set

Interpreter

Sensitivity

Specificity

Accuracy

AUC

P-value for AUC (vs. CNN)

Axial ADC map

CNN

0.94 (0.87–0.98)

0.87 (0.79–0.91)

0.91 (0.83–0.95)

0.95 (0.91–1.00)

 

Reader1

0.71 (0.56–0.83)

0.85 (0.71–0.94)

0.77 (0.68–0.85)

0.78 (0.70–0.86)

 < 0.001*

 

Reader2

0.67 (0.52–0.79)

0.87 (0.74–0.95)

0.76 (0.67–0.84)

0.77 (0.69–0.85)

 < 0.001*

 

Reader3

0.77 (0.63–0.87)

0.78 (0.63–0.87)

0.77 (0.68–0.85)

0.77 (0.69–0.86)

 < 0.001*

Axial T2WI

CNN

0.90 (0.83–0.95)

0.83 (0.74–0.88)

0.87 (0.79–0.92)

0.90 (0.84–0.96)

 

Reader1

0.73 (0.58–0.84)

0.96 (0.85–1.00)

0.84 (0.75–0.90)

0.84 (0.77–0.91)

0.220

 

Reader2

0.61 (0.46–0.74)

0.94 (0.82–0.99)

0.76 (0.67–0.84)

0.77 (0.70–0.85)

0.015*

 

Reader3

0.73 (0.58–0.84)

0.91 (0.79–0.98)

0.81 (0.72–0.89)

0.82 (0.75–0.89)

0.100

Sagittal T2WI

CNN

0.90 (0.82–0.95)

0.80 (0.72–0.86)

0.86 (0.77–0.91)

0.88 (0.81–0.95)

 

Reader1

0.69 (0.54–0.81)

1.00 (0.89–1.00)

0.84 (0.75–0.90)

0.84 (0.78–0.91)

0.457

 

Reader2

0.77 (0.63–0.87)

0.94 (0.82–0.99)

0.85 (0.76–0.91)

0.85 (0.78–0.92)

0.574

 

Reader3

0.75 (0.60–0.86)

0.87 (0.74–0.95)

0.80 (0.71–0.88)

0.81 (0.73–0.89)

0.167

Axial CE-T1WI

CNN

0.84 (0.71–0.93)

0.89 (0.76–0.96)

0.87 (0.78–0.93)

0.93 (0.87–0.98)

 

Reader1

0.75 (0.60–0.86)

0.94 (0.82–0.99)

0.84 (0.75–0.90)

0.84 (0.77–0.91)

0.006*

 

Reader2

0.77 (0.63–0.87)

0.91 (0.79–0.98)

0.84 (0.75–0.90)

0.84 (0.77–0.91)

0.002*

 

Reader3

0.77 (0.63–0.87)

0.91 (0.79–0.98)

0.84 (0.75–0.90)

0.84 (0.77–0.91)

0.014*

Sagittal CE-T1WI

CNN

0.90 (0.83–0.95)

0.83 (0.74–0.88)

0.87 (0.79–0.92)

0.90 (0.84–0.97)

 

Reader1

0.78 (0.65–0.89)

0.94 (0.82–0.99)

0.86 (0.77–0.92)

0.86 (0.79–0.93)

0.336

 

Reader2

0.73 (0.58–0.84)

0.96 (0.85–1.00)

0.84 (0.75–0.90)

0.84 (0.77–0.91)

0.173

 

Reader3

0.84 (0.71–0.93)

0.87 (0.74–0.95)

0.86 (0.77–0.92)

0.86 (0.79–0.93)

0.341

Combined axial T2WI + ADC map

CNN

0.82 (0.69–0.92)

0.87 (0.74–0.95)

0.85 (0.76–0.91)

0.93 (0.88–0.98)

 

Reader1

0.73 (0.58–0.84)

0.96 (0.85–1.00)

0.84 (0.75–0.90)

0.58 (0.48–0.68)

 < 0.001*

 

Reader2

0.84 (0.71–0.93)

0.98 (0.89–1.00)

0.91 (0.83–0.96)

0.91 (0.86–0.97)

0.598

 

Reader3

0.88 (0.76–0.96)

0.87 (0.74–0.95)

0.88 (0.79–0.93)

0.88 (0.81–0.94)

0.196

Combined axial T2WI + CE-T1WI

CNN

0.84 (0.71–0.93)

0.91 (0.79–0.98)

0.88 (0.79–0.93)

0.89 (0.83–0.96)

 

Reader1

0.80 (0.67–0.90)

0.98 (0.89–1.00)

0.89 (0.81–0.94)

0.89 (0.83–0.95)

0.943

 

Reader2

0.80 (0.67–0.90)

0.96 (0.85–1.00)

0.88 (0.79–0.93)

0.88 (0.82–0.94)

0.720

 

Reader3

0.92 (0.81–0.98)

0.85 (0.71–0.94)

0.89 (0.81–0.94)

0.89 (0.82–0.95)

0.839

Combined sagittal T2WI + CE-T1WI

CNN

0.94 (0.84–0.99)

0.74 (0.59–0.86)

0.85 (0.76–0.91)

0.89 (0.82–0.95)

 

Reader1

0.80 (0.67–0.90)

0.98 (0.89–1.00)

0.89 (0.81–0.94)

0.89 (0.83–0.95)

0.890

 

Reader2

0.69 (0.54–0.81)

1.00 (0.89–1.00)

0.84 (0.75–0.90)

0.84 (0.78–0.91)

0.375

 

Reader3

0.86 (0.74–0.94)

0.87 (0.74–0.95)

0.87 (0.78–0.93)

0.87 (0.80–0.94)

0.667

Combined axial T2WI + ADC map + CE-T1WI

CNN

0.80 (0.67–0.90)

0.80 (0.66–0.91)

0.80 (0.71–0.88)

0.87 (0.80–0.94)

 

Reader1

0.71 (0.56–0.83)

1.00 (0.89–1.00)

0.85 (0.76–0.91)

0.85 (0.79–0.92)

0.675

 

Reader2

0.67 (0.52–0.79)

1.00 (0.89–1.00)

0.83 (0.73–0.89)

0.83 (0.77–0.90)

0.406

 

Reader3

0.78 (0.65–0.89)

0.94 (0.82–0.99)

0.86 (0.77–0.92)

0.86 (0.79–0.93)

0.813

  1. Diagnostic performance of the CNNs and radiologists in the test using the single and combined image sets
  2. T2WI, T2 weighted image; ADC, Apparent Diffusion Coefficient; CE-T1WI, contrast-enhanced T1 weighted image, AUC, area under the receiver operating characteristic curve; Data in parentheses are 95% confidence interval. *P < 0.05