Skip to main content

The segmentation of bones in pelvic CT images based on extraction of key frames

Abstract

Background

Bone segmentation is important in computed tomography (CT) imaging of the pelvis, which assists physicians in the early diagnosis of pelvic injury, in planning operations, and in evaluating the effects of surgical treatment. This study developed a new algorithm for the accurate, fast, and efficient segmentation of the pelvis.

Methods

The proposed method consists of two main parts: the extraction of key frames and the segmentation of pelvic CT images. Key frames were extracted based on pixel difference, mutual information and normalized correlation coefficient. In the pelvis segmentation phase, skeleton extraction from CT images and a marker-based watershed algorithm were combined to segment the pelvis. To meet the requirements of clinical application, physician’s judgment is needed. Therefore the proposed methodology is semi-automated.

Results

In this paper, 5 sets of CT data were used to test the overlapping area, and 15 CT images were used to determine the average deviation distance. The average overlapping area of the 5 sets was greater than 94%, and the minimum average deviation distance was approximately 0.58 pixels. In addition, the key frame extraction efficiency and the running time of the proposed method were evaluated on 20 sets of CT data. For each set, approximately 13% of the images were selected as key frames, and the average processing time was approximately 2 min (the time for manual marking was not included).

Conclusions

The proposed method is able to achieve accurate, fast, and efficient segmentation of pelvic CT image sequences. Segmentation results not only provide an important reference for early diagnosis and decisions regarding surgical procedures, they also offer more accurate data for medical image registration, recognition and 3D reconstruction.

Peer Review reports

Background

Recently, traffic accidents, falls and other serious high-energy trauma have often led to pelvic fractures. Researches show that the incidence of heavy-vehicle injury has increased over the years [1] and patients with pelvic fractures who present in shock have a mortality of 30–50% [2]. Pelvic fracture is the third most common cause of death in traffic trauma [3]. Rapid and accurate diagnosis and treatment are not only important to reduce the mortality caused by pelvic fractures, but also helpful for the functional reconstruction and correction of deformities of the pelvis. Medical imaging and processing are crucial in the process of diagnosis and treatment. The computed tomography (CT) images and 3D reconstruction by CT are commonly used to display the anatomical structure of pelvis and characteristics of the lesions [4]. Analysis based on CT images is important for the description of pelvic anatomy, the planning of surgical procedures and the evaluation of the post-operative effects [5]. And the most important step of analysis is bone segmentation, which is crucial to quantitatively evaluate the degree of fracture, detect the location of bleeding, and judge the condition of injury [6]. Therefore, the accuracy of segmentation will affect the doctor’s judgment of the disease, the selection of the best surgical approach, etc.

At present, the pelvic region is manually marked for surgical procedures in the clinic. This process is time-consuming and error prone [7]. Therefore, it is necessary to propose a fast, accurate, and efficient method for pelvic segmentation.

Image segmentation is a hot topic in medical image processing. Recently, various segmentation methods have been proposed [8], some of which are based on thresholding [9,10,11,12]. The central idea of threshold segmentation is to transform the segmentation problem into pixel classification problem. Considering the fact that bone mineral density is heterogeneous, the connection between the femoral head and acetabulum is narrow, and a weak edge can be caused by disease, it is difficult to select a universal threshold. Other methods are based on the region growing technique, in which pixels with similar properties are set up as a region [13,14,15]. However, this method requires a long time and a large amount of space. In recent years, classification and clustering have also been used for medical image segmentation, and the relevant research focuses on the improvement of robustness [16,17,18]. Methods based on deformable models [19,20,21,22,23,24,25] and active shape models [26,27,28,29,30] have become a hot topic. For example, Calder et al. [20]proposed a segmentation method based on level set. Truc et al. [21] included density distance enhancement in a C-V model, and Martinez et al. [22] applied multi-scale edge detection to adjust a geometric model. Wu et al. [26] combined the active shape model with template matching, and Li et al. [27] combined the active shape model with a clustering model. However, most segmentation algorithms put more attention to the characteristics of a single CT image, so that the characteristics of CT sequences are usually ignored. Although there are many existing algorithms for medical image segmentation, an effective and accurate algorithm for the segmentation of pelvic CT image sequences has not been proposed. To assist doctors in diagnosis and treatment, an accurate segmentation of pelvic structure is essential. However, considering factors such as individual differences among different patients, bone mineral density unevenness, and narrowing of the hip joint space, it is difficult to rely only on gray scale information for accurate segmentation.

This paper proposes a novel segmentation method based on CT images of the pelvis. In the sequence of CT slices, because of the small shooting distance, the morphological characteristics between two adjacent CT images have high similarity. Thus, the key frames can be extracted. And the results of key frames are used to direct the segmentation of the remaining pelvic CT images according to the watershed algorithm based on skeleton markers. The proposed method greatly decreases the diagnostic time, and it’s helpful for the early diagnosis of patients, selection of surgical planning, etc.

Methods

In this algorithm, we will select some CT images as key frames by using the method of key frame extraction. Then, experts will manually mark key frames to appoint the bone topological structure of these CT images. The segmentation results of key frames will be applied to describe the bone topology of other images. Next, the bone topology of the CT images will be used as a marker, and the watershed algorithm based on skeleton markers will be used to realize the automatic segmentation of pelvic structure. The algorithm consists of four parts: pre-processing of the CT image, key frame extraction in the CT image sequence, interactive marking and the watershed algorithm based on skeleton markers. The pretreatment process is used to denoise, extract regions of interest, and resize the image, reducing the amount of data in a single CT image. The CT sequence key frame extraction process is divided into two steps, and this process is used to reduce the number of CT images required for manual segmentation. In addition, the aim of interactive marking is to specify the bone topology of key frames. By matching key frames, the bone topology of each CT image can be acquired so that the accuracy of pelvic segmentation can be guaranteed. Finally, based on the interactive marking of key frames, the watershed algorithm based on skeleton markers is used to automatically segment all the CT images. The overall schematic diagram of the algorithm is shown in Fig. 1. In the following sections, the method is explained in detail.

Fig. 1
figure 1

Schematic diagram of pelvic bone segmentation. The method consists of four parts: pre-processing of the CT image, key frame extraction, interactive marking and the watershed algorithm based on skeleton markers. After pre-processing, some CT images are selected as key frames. Experts manually add or remove marker points to realize the segmentation of key frames. The segmentation results of key frames will be used to segment other CT images

Dataset

The dataset has been obtained from Tianjin Medical University General Hospital and Tianjin Nankai Hospital. Data has been randomly collected from 20 patients with traumatic pelvic injuries. These patients are in the 35- to 60-year age range and both sexes account for half. For each CT image sequence, the size of the planning CT images in the axial plane is 512 × 512 pixels, with 1 mm image resolution and 2–7 mm slice thickness. Besides, a total of 245 images are collected from each patient.

Pre-processing

Pre-processing removes surrounding artifacts from the original image, such as the CT platform and cables. Moreover, the image size is also adjusted in this process. The pelvis area is segmented from the original image as follows:

  1. 1.

    Only extract the image part of each DICOM file. The image data is 16 bits and there are 65,536 Gy levels. Considering the limited resolution of the human eyes and the limitations of experimental facilities, convert the data in the image using (1) with data compression to 256 Gy levels.

$$ f(x)=\left\{\begin{array}{c}0\\ {}\left[x-\left(c-\frac{w}{2}\right)\right]\times \frac{255}{w}\\ {}255\end{array}\begin{array}{c}x\le c-\frac{w}{2}\\ {}c-\frac{w}{2}<x<c+\frac{w}{2}\\ {}x\ge c+\frac{w}{2}\end{array}\right. $$
(1)

Here, f is a bitmap gray value, x is image data, w is window width, and c is window level.

  1. 2.

    Create a binary version of the original image according to (2).

$$ f(x)=\left\{\begin{array}{c}0\\ {}1\end{array}\right.{\displaystyle \begin{array}{c}x=0\\ {} else\end{array}} $$
(2)
  1. 3.

    Apply morphological operations to the binary image so that the different objects can be separated and that the object which has the largest area is the pelvis. Next, all the pixels in this object will be set to 1, and others become 0. The binary image is called the “mask image”.

  2. 4.

    Conduct an image convolution with the mask image and the original image, removing the CT platform, artifacts and other interference. At this time, the image is reserved for only the pelvis region; the remainder has been set to 0.

  3. 5.

    Adjust the size of the image by cutting out the non-pelvis area as much as possible. From now on, the pre-processed image is called the “input image”.

The method of key frame extraction

After pre-processing, key frame extraction from the CT image sequence becomes the focus of the method. In this paper, key frames refer to the CT images of a CT sequence that have obvious changes in bone structure. As we can see from Fig. 2, there are four consecutive CT images. The first two CT images have similar bone structures. From the graph, we also find that the bone structure of (c) has obvious topological differences from the previous images. (b) has three bone topologies, while (c) has four obvious bone topologies. Therefore, (c) can be selected as a key frame. The method of key frame extraction consists of two steps. The first step is to obtain the candidate key frames. The last step is to obtain the final key frames.

Fig. 2
figure 2

Four consecutive CT images of Patient 1. (a) and (b) have similar bone topologies. We believe that each has three bone topologies, while (c) and (d) have four obvious bone topologies. The differences are shown in red

To obtain candidate key frames, we extract the approximate region of bone as a region of interest so that the computation can be reduced. The pixel difference of the adjacent images is used to determine whether a new candidate key frame is required. The pixel difference of the adjacent images is defined as (3).

$$ Dif={y}_{j+1}-{y}_j $$
(3)

The greater the value is, the lower the degree of similarity is. Therefore, when the pixel difference of the adjacent images is greater than the threshold, it is considered that the difference between the adjacent images is large and that the similarity is not obvious. Thus, a new candidate key frame is added to the candidate set.

To summarize, the extraction of candidate key frames is implemented as follows:

1. Use a Gaussian filter on the input image.

2. Perform wavelet analysis of the filtered image and conduct image reconstruction using the approximation matrix.

3. Calculate T1 based on (4).

4. Create a binary image based on (5).

5. Convolute the binary image with the filtered image and the resulting image is called the “interesting image”.

6. Sort the interesting images in accordance with the spatial order and import CT data{y1, y2, y3, , yL}.

7. For i = 1 to L ‐ 1

  Calculate the pixel difference of the adjacent images

  If the difference is greater than T2

   the i + 1image is added to the candidate set

  Endif

 Endfor

8. The candidate set of key frames {g1, g2, g3, , g l } is gained.

During the extraction of the candidate key frames, T1 is the mean value of the reconstructed image of the non-zero area. It is defined as (4) and used to create a binary image. (4) and (5) are combined to obtain the approximate region of bones. In particular, the key frames in the candidate set are the pre-processed images and not the reconstructed images.

$$ {T}_1=\sum \limits_{x=0}^{N-1}\sum \limits_{y=0}^{M-1}f\left(x,y\right)/\left[N\times M- Zeros(S)\right] $$
(4)

Where f(x, y) is a bitmap gray value, N is the width of image, and M is the height of image. S = {(x, y)|f(x, y) = 0} is the set of background pixels and zero pixels. Zeros(S) is the notation used for the cardinality of set S.

$$ Mask\left(x,y\right)=\left\{\begin{array}{c}1\\ {}0\end{array}\right.{\displaystyle \begin{array}{c}f\left(x,y\right)>{T}_1\\ {}f\left(x,y\right)\le {T}_1\end{array}} $$
(5)

Where Mask(x, y)is the binary image of reconstructed image.

To illustrate the key frame extraction process, we will use data of Patient 1 to explain the process in detail. After generating the interesting images, these images are numbered from 1 to 245 according to the anatomic structure. Figure 3 shows the pixel difference value of the example. To avoid missing key frames, T2 is defined as (6), and it is described in blue in Fig. 3. After the extraction of the candidate set, there are 196 CT images to be selected out.

$$ {T}_2=0.7\times mean(Dif) $$
(6)
Fig. 3
figure 3

Pixel difference value of images for Patient 1

To obtain the target key frames, mutual information [31] and the normalized correlation coefficient [32] are utilized. The mutual information is defined as (7), and the normalized correlation coefficient is defined as (8). The mutual information reflects the gray correlation of the image; a greater mutual information value indicates that the gray difference of the two images is smaller. However, there is a lack of spatial location information so that redundant key frames will be extracted only when using mutual information. Thus, the normalized correlation coefficient is proposed to reduce the redundant key frames; a greater normalized correlation coefficient value indicates that the two images have higher similarity. The experimental results show that using the normalized correlation coefficient can effectively reduce the number of redundant key frames.

$$ I\left(x,y\right)=\sum \limits_{a\in {f}_1,b\in {f}_2}{P}_{f_1{f}_2}\left(a,b\right)\log \left[{P}_{f_1{f}_2}\left(a,b\right)/{P}_{f_1}(a){P}_{f_2}(b)\right] $$
(7)

Where \( {P}_{f_1}(a) \) is the probability density of imagef1, and \( {P}_{f_2}(b) \) is the probability density of image f2. \( {P}_{f_1{f}_2}\left(a,b\right) \) is the joint probability density of image f1 and image f2.

$$ R\left(x,y\right)=\frac{\sum \limits_{i=0}^{M-1}\sum \limits_{j=0}^{N-1}\left[{f}_1\left(i,j\right)-\overline{f_1}\right]\left[{f}_2\left(i,j\right)-\overline{f_2}\right]}{\sqrt{\sum \limits_{i=0}^{M-1}\sum \limits_{j=0}^{N-1}{\left[{f}_1\left(i,j\right)-\overline{f_1}\right]}^2\sum \limits_{i=0}^{M-1}\sum \limits_{j=0}^{N-1}{\left[{f}_2\left(i,j\right)-\overline{f_2}\right]}^2}} $$
(8)

Here, f1(i, j) is the pixel value of image f1, f2(i, j) is the pixel value of imagef2, \( \overline{f_1} \) is the average gray value of image f1, and \( \overline{f_2} \) is the average gray value of imagef2 .

The extraction of the target set from the candidate set is illustrated as follows:

1. Import the candidate set{g1, g2, g3, , g l }.

2. For i = 1 to l ‐ 1

  Calculate the mutual information

  If (I(i + 1, i) ≤ T3)

   the i + 1image is added to the intermediate set

  Endif

 Endfor

3. For j = 1to k ‐ 1

  Calculate the normalized correlation coefficient

  If (R(j + 1, j) ≤ T4)

   the j + 1image is added to the target set

  Endif

 Endfor

4. The target set of key frames {k1, k2, k3, , k t } is gained.

The mutual information value of the candidate set is shown in Fig. 4. In this case, T3 is the mean mutual information value and is shown in blue. In addition, when the mutual information value of adjacent candidate key frames is less thanT3, the candidate key frames will be selected out to calculate the normalized correlation coefficient. As we can see from Fig. 5, there is no significant change in the bone topology of these images in the intermediate set, and these images can use the segmentation results of the same key frame as the reference. Therefore, it is necessary to take measures to remove redundant key frames in the intermediate set. Figure 6 shows the normalized correlation coefficient value of images in the intermediate set. T4 is the mean normalized correlation coefficient and is shown in blue. An image will be chosen as a key frame in the target set when the normalized correlation coefficient is less than T4. Finally, there are 31 images in the target set.

Fig. 4
figure 4

The mutual information value of candidate key frames for Patient 1

Fig. 5
figure 5

Example of some consecutive key frames in an intermediate set for Patient 1. After using the mutual information, some obtained key frames still have similar bone topologies. As seen from (a - f), all frames have three bone topologies

Fig. 6
figure 6

The normalized correlation coefficient of images in the intermediate set for Patient 1

To describe the distribution of the key frames in the CT image sequence, Fig. 7 shows the interval between key frames. And the interval refers to the difference of each key-frame slice from the previous key-frame slice. At the same time, we equidistantly extract 31 images from the CT image sequence as a reference. It is found that the key frames are not evenly distributed. As a result, we can draw a conclusion that the key frames can effectively reflect the changes in pelvic structure and that it is meaningful to extract key frames.

Fig. 7
figure 7

The interval between key frames in the target set for Patient 1

Based on the above procedure, the target set of key frames {k1, k2, k3, , k t } can be extracted. After the extraction of key frames, the number of CT images required for manual marking is greatly reduced.

Interactive marking

For each image in the target set, the first step is creating a binary version of the image. Next, marker points are obtained by extracting the skeleton from the binary image. The method of extracting the skeleton is introduced in the next section. Finally, the initial contours can be automatically generated through the watershed algorithm based on skeleton markers, which will also be introduced in next section. However, the results may be inconsistent with the topological structure of the human pelvis so that the segmentation results of this automatic algorithm cannot meet the requirements of clinical guidance. Therefore, the resulting image will be provided to experts, who will manually add or remove marker points to correct the initial contours. The segmentation results of key frames will be used to segment other CT images.

The watershed algorithm based on skeleton marking

The method is divided into two parts: skeleton extraction and the marker-based watershed algorithm. Before segmentation, it is first necessary to search for the key frame matching the CT image. When a CT image is processed, the corresponding key frame image is obtained by calculating the normalized correlation coefficient between the CT image and key frames in the target set. In the corresponding key frame image, the bone region segmentation results will be regarded as foreground, and the remaining region will be regarded as background. Next, the skeleton of foreground and background can be obtained via skeleton extraction. On the other hand, the gradient image of CT image should be gained. Then, using skeleton image mark the gradient image and shield the original minimum pixels in gradient image. Therefore, the new gradient image will be acquired. Finally, the segmentation results are obtained by using watershed transform for the new gradient image.

Skeleton extraction

The skeleton can provide information about the size and shape of targets in an image [33]. In this paper, we use the skeleton to describe location and topology information for pelvic bones. The method of thinning is used for skeleton extraction [34]. Whether or not a boundary point is converted into a background point is determined by the neighboring relations, and the iterative process is ended up with a pixel width. Supposing that the foreground pixel value is 1, the background pixel value is 0, n(p1) is the number of non-zeros in the neighborhood, and s(p1) is the total number of adjacent pixels from 0 to 1. Figure 8 shows the relationship between a pixel p1 and eight-neighboring pixels. Whether a point can be removed or not depends on the eight adjacent points. In this paper, we consider mainly the following three cases [35]: υ1 indicates whether the deletion will cause region splitting or not; υ2 indicates whether it is the east, south, or northwest border; υ3 indicates whether it is the north, west, or southeast border. These values can be described as in (9).

$$ {\displaystyle \begin{array}{l}{\upsilon}_1=\left\{{p}_1\left|2\le n\left({p}_1\right)\le 6,s\left({p}_1\right)\right.=1\right\}\\ {}{\upsilon}_2=\left\{{p}_1\left|{p}_2\times {p}_4\times {p}_6=0,{p}_8\times {p}_4\times {p}_6=0\right.\right\}\\ {}{\upsilon}_3=\left\{{p}_1\left|{p}_2\times {p}_4\times {p}_8=0,{p}_8\times {p}_2\times {p}_6=0\right.\right\}\end{array}} $$
(9)
$$ {\displaystyle \begin{array}{l}{D}_1=\left\{{p}_1\left|{p}_1=1,{p}_1\in {\upsilon}_1,{p}_1\in {\upsilon}_2\right.\right\}\\ {}{D}_2=\left\{{p}_1\left|{p}_1\in {\upsilon}_1,{p}_1\in {\upsilon}_3\right.\right\}\end{array}} $$
(10)
Fig. 8
figure 8

The relationship between neighboring pixels

The skeleton extraction algorithm is divided into two steps. First, the boundary pixels that satisfy the condition D1 in the whole image are sought. The pixel points that we find are the ones that can be deleted. The pixels are converted to the background points after traversing the whole image. Then, the boundary pixels that meet condition D2 in the transformed image are sought. Similarly, the pixels are converted after traversing the whole image. After this process is the next iteration, until there are no points that can be deleted between the two steps. In addition, the two conditions are presented as in (10).

As for key frames, the skeleton of the foreground (I F ) and background (I B ) can be generated by utilizing the skeleton extraction algorithm. The skeleton is shown as S H  = Skel(I F , I B ) and it will be the marker points for the marker-based watershed algorithm.

The marker-based watershed algorithm

The marker-based watershed algorithm [36] is based on a gradient image. Therefore, creating a gradient image is the first step in this process. In this method, the gradient image is obtained using the Sobel operator. The convolution templates (S x , S y ) can be described as follows:

$$ {S}_x=\left[\begin{array}{ccc}1& 2& 1\\ {}0& 0& 0\\ {}-1& -2& -1\end{array}\right]\kern1.68em {S}_y=\left[\begin{array}{ccc}1& 0& -1\\ {}2& 0& -2\\ {}1& 0& -1\end{array}\right] $$
(11)

The gradient of pixel p1 can be gained by calculating the convolution of the templates and the eight neighboring pixels in Fig. 8. The solution is as follows:

$$ {\displaystyle \begin{array}{l}{G}_x={p}_9+2\times {p}_2+{p}_3-\left({p}_7+2\times {p}_6+{p}_5\right)\\ {}{G}_y={p}_9+2\times {p}_8+{p}_7-\left({p}_3+2\times {p}_4+{p}_5\right)\end{array}} $$
(12)

The gradient image can be described as f(x, y), and f(x, y) = [G X , G Y ]T, where G x is the gradient along the X direction, and G y is the gradient along the Y direction.

The skeleton S H is used to mark the gradient image f(x, y) At the same time, the original minimum pixels in gradient image will be shielded. In other words, in the marked gradient image, the local minimal value corresponds to the pixel region that is the non-zero area in the skeleton image. Finally, segmentation of the pelvic structure is achieved by using the watershed transform for the marked gradient image.

Results

The proposed segmentation method is tested on 20 patients. Each patient has 245 pelvic CT images. Patient 1 is used as an example to display the results from different stages of the method and they are presented in this section as follows.

Figure 9 shows the pre-processing results for the example. The results show that the edges of bone tissue are clearly defined by the window technology. Therefore, adjusting the window is necessary. In addition, the pelvis area can be extracted completely from the CT image after the removal of artifacts, platforms, etc.

Fig. 9
figure 9

The pre-processing results of Patient 1. (a) is the original CT image whose pixel plane is 512 × 512. (b) is the image obtained by windowing, the window width is 600 Hu, and the window level is 900Hu. (c) is the pelvis area without the CT table, cables, etc. (d) is the pre-processed image. The image size is adjusted by cutting out the non-pelvis area as much as possible

Figure 10 shows the results obtained by the watershed algorithm based on skeleton markers. The results show that the distribution of pelvic structure can be well described by the skeleton and that the pelvic structure can be segmented accurately in a single CT image using this method.

Fig. 10
figure 10

Example results of Patient 1 obtained by the watershed algorithm based on skeleton markers. (a) is the input image as well as the pre-processed image. (b) is the skeleton image the for foreground and indicates the location of the pelvic structure. (c) is the skeleton image for the background and indicates the location of the other structures. (d) is the segmentation result and it is shown in the pre-processed image. The automatically segmented pelvic structure contours are displayed in red

Figure 11 shows the segmentation results of four CT images that are consecutive in the anatomic structure by the proposed method. The results show that the distribution of pelvic structure is well described by the marker points that were obtained from the corresponding key frame. Furthermore, the results also indicate that the proposed method can reliably segment the CT image sequence.

Fig. 11
figure 11

Example results of Patient 1 obtained by the proposed method. (a) is the input image as well as the pre-processed image. (b) is the manual segmentation results for the corresponding key frame, and the edges are shown in green. (c) is the skeleton images. The skeleton of foreground is shown in blue and the skeleton of background is displayed in yellow. These skeletons will be used as marker points for the marker-based watershed algorithm. (d) is the segmentation result of the proposed method and it is shown in the pre-processed image. The automatically segmented pelvic structure contours are displayed in red

Figure 12 shows the segmentation results using the proposed method in fracture areas. (a), (b), (c) and (d) are consecutive and these images are from Patient 3. These images show the segmentation results of CT images with sacrum bone fracture. (e), (f), (g) and (h) are also consecutive and they are from Patient 7. And they display the contours of bones in ischium bone fracture area. As can be seen from Fig. 12, the proposed method can accurately segment the fracture area.

Fig. 12
figure 12

Example of segmentation results using proposed method in the fracture area. (a), (b), (c) and (d) are from Patient 3. (e), (f), (g) and (h) are from Patient 7. The segmentation results of the proposed method are shown in the original image. The manual segmentation results by experts are displayed in blue while the automatically segmented pelvic structure contours are displayed in red

Table 1 shows the number of key frames in each CT sequence. The results show that the ratio between the number of key frames and the total CT sequence number is approximately 13%. Thus, the number of images that need to be manually marked will be greatly reduced by extracting key frames.

Table 1 Shows the number of key frames in each CT sequence

Discussion

In this section, some measures are taken to evaluate the performance of the proposed algorithm. A quantitative comparison is performed between the manual segmentation and the computed segmentation. In addition, a three-dimensional display of the segmented bone structure is rendered. Besides, the performance of the watershed algorithm based on skeleton markers and gradient vector flow (GVF) is compared. And the running time of the proposed algorithm is also an important index.

Evaluation measures

The overlapping area and the mean deviation distance are used to quantify the accuracy [37]. In this section, 245 CT images will be manually segmented by experts for each patient and the results will be the ground truth.

S1 represents surface obtained by the proposed method and S2 represents surfaces obtained by experts manually. Besides, L1 represents a set of contour points obtained by the proposed method andL2 represents a set of contour points manually obtained by experts. As for two surfaces, we define A1 as the area for surface S1and A2as the area for surface S2.

The overlapping area O of two surfaces S1 and S2 is defined as:

$$ O=\frac{A_1\cap {A}_2}{A_2}\times 100\% $$
(13)

The mean deviation distance Mad of two contour points sets L1 and L2 is defined as:

$$ Mad=\frac{1}{K}\sum \limits_{n=1}^Kd\left({l}_n,{L}_2\right) $$
(14)

Where l n denotes each contour point inL1,Kdenotes the total number of contour points in L1. Deviation distance refers to the distance between the contours obtained by the proposed algorithm and the ground truth. The deviation distance between L1 and L2is defined as:

$$ d\left({l}_n,{L}_2\right)={\min}_{l_i\in {L}_2}{\left\Vert {l}_n-{l}_i\right\Vert}_2 $$
(15)

Five patients of the 20 patients are randomly selected, and their CT images are used to calculate the overlapping area O. The shapes presenting with an overlapping area of more than 90% are classified as “accurate”, the shapes presenting with an overlapping area of 90–80% are classified as “fair”, and the shapes presenting with an overlapping area of less than 80% are classified as “unacceptable” [37].

Table 2 shows the segmentation results and the overlapping area of CT images from five patients’. The results show that the average overlapping area of all the data is more than 94%. More than 93.5% of the results are classified as “accurate”, and less than 2% of the results are classified as “unacceptable”. The unacceptable results may be caused by blurred edges of bones, uneven gray value of bones, lack of appropriate key frames, etc.

Table 2 Shows the segmentation results and the overlapping area of CT images from five patients’

Figure 13 shows the mean deviation distance of fifteen testing images. These images are randomly selected from Patient 1 to Patient 5.The maximum mean deviation distance is 5.03 pixels, and the minimum mean deviation distance is approximately 0.58 pixels. Without the impact of the method, an unsmooth contour may also lead to a large mean deviation distance. The results show that the proposed method is reliable for accurate pelvic segmentation.

Fig. 13
figure 13

The mean deviation distance of fifteen testing images

After segmented sequences of CT slice images are obtained, a three-dimensional model of pelvic bone structure can be reconstructed utilizing these segmented bone structures. In addition, three-dimensional visualization may be used for further validating the accuracy of the proposed method. The segmentation results are visually inspected as shown in Fig. 14. As can be seen from this figure, the three-dimensional structure of pelvic bone is clearly presented and the detected fracture exits in the pelvic region.

Fig. 14
figure 14

Three-dimensional pelvic bone structure of Patient 4

The gradient vector flow (GVF) model

In this section, the segmentation performance using the watershed algorithm based on skeleton markers and the GVF model is compared. The GVF model was introduced by Xu and Prince [38] in 1998. In relation to the traditional snake model, the sensitivity of the initial curve position and the convergence of the concave area are improved. In this paper, the segmentation results of the key frames are taken as the initial curve in the GVF model.

Figure 15 shows the example of segmentation results using the two methods. As can be seen from Fig. 15b, the results of key frames cannot accurately describe the edges of pelvic bones in intervening images. However, the contours of bones in key frames can offer location and topology information of pelvic bones in the intervening images. Location information can be used to indicate the possible areas of bones and the topology information can be used to indicate the possible number of bones. From Fig. 15d, we can find that the information given by key frames cannot make GVF model obtain good segmentation results. Through this comparison of the segmentation results, it can be found that the watershed algorithm based on skeleton markers has better segmentation effect than the GVF model.

Fig. 15
figure 15

Example results using the watershed algorithm based on skeleton markers and the GVF model. These testing images are from Patient 1. (a) is the input image as well as the pre-processed image. (b) is the manual segmentation results of the corresponding key frame, and the edges are shown in green. (c) is the automatic segmentation results of the watershed algorithm based on skeleton markers, and the edges are shown in red. (d) is the segmentation results of the GVF model, the edges are shown in red and they are shown in the pre-processed image

Algorithm running time

In this section, the test environment is as follows: the CPU is Inter-Core i3–2120 3.3 GHZ, the system memory is 4 GB, and the programming environment is MatLabR2014. Table 3 shows the running time of the proposed method except for the time of manual marking. As we can see from the graph, the average running time is approximately 2 min for one patient with 245 CT images.

Table 3 Shows the Running Time of the Proposed Method

Interactive marking has an approximate time of 30 s for each key frame image because the experts only need to manually add or remove a small number of marker points. The experimental results show that there are approximately 30 CT images that are selected for manual marking. Therefore, the proposed method takes approximately 15 min for one patient with 245 CT images. Such performance suggests a considerable reduction in processing time compared with manual segmentation.

In conclusion, the proposed method not only ensures the accuracy of pelvic segmentation but also reduces the time of segmentation.

Conclusions

A novel segmentation method based on CT image sequences of the pelvis is presented in this paper. The key parts of the method are the method of key frame extraction and the watershed algorithm based on skeleton markers. By using the method of key frame extraction, the number of manually marked CT images decreases to approximately 30. Based on the method of skeleton extraction, the distribution of pelvic structure can be well described in key frames. Considering that anatomical structures have characteristics of spatial continuity, the skeleton can be used to describe the location and topological information of pelvic bones in other CT images. Appling the watershed algorithm based on skeleton markers can achieve accurate, fast and efficient segmentation of pelvic structure. And segmentation results not only provide an important reference for early diagnosis and decisions regarding surgical procedures, they also offer more accurate data for medical image registration, recognition and 3D reconstruction.

In this paper, the performance of the proposed method depends on the accuracy of key frame extraction. When a crucial CT image is not selected, the accuracy of segmentation using the method will be affected. In addition, the performance is also affected by blurred edges of bones, variation of bones, etc. since the watershed algorithm is applied.

Future work will focus on the following points: 1) Propose a more efficient method to extract key frames. This method should ensure that the key frames are not redundant and that crucial CT images will not be missing. In addition, the running time of the method should also be considered. 2) Apply this method to segment other organs. 3) Using this method to process MRI and ultrasound images. 4) Explore a method for 3D reconstruction relying only on the key frames.

In conclusion, the proposed method is able to achieve accurate, fast, and efficient segmentation of pelvic CT image sequences.

Abbreviations

CT:

Computed tomography

GVF:

The gradient vector flow (GVF) model

References

  1. Naumann RB, Dellinger AM, Zaloshnja E, Lawrence BA, Miller TR. Incidence and total lifetime costs of motor vehicle-related fatal and non-fatal injury by road user type in the United States. Traffic Inj Prev. 2010;11(4):353.

    Article  PubMed  Google Scholar 

  2. Management of Exsanguinating Pelvis Injuries:An algorithm for the management of exsanguinating pelvic trauma. http://www.trauma.org/index.php/main/article/668/.Accessed 20 May 2008.

  3. Fulkerson EW, Egol KA. Timing issues in fracture management: a review of current concepts. Bull Hosp Jt Dis. 2009;67(1):58.

    Google Scholar 

  4. Zhao JB, Liu J. Application value of three-dimensional CT reconstruction in diagnosis and treatment of pelvic fracture-dislocation. China J Orthop Traumatol. 2006;19(7):403–4.

  5. Foshager MC, Walsh JW. CT anatomy of the female pelvis: a second look. Radiographics A Rev Publ Radiol Soc North America Inc. 1994;14(1):64–6.

    Google Scholar 

  6. Najarian K, Vasilache S, Smith R, et al. Accurate pelvic fracture detection for X-ray and CT images: US, US 8538117 B2[P]. 2013. http://www.freepatentsonline.com/8538117.pdf.

  7. Dalinka MK, Arger P, Coleman B. CT in pelvic trauma. Orthop Clin N Am. 1985;16(3):471–80.

    CAS  Google Scholar 

  8. Collier DC, Burnett SS, Amin M, Bilton S, Brooks C, Ryan A, Roniger D, Tran D, Starkschall G. Assessment of consistency in contouring of normal-tissue anatomic structures. J Appl Clin Med Phys. 2003;4(1):17–24.

    Article  PubMed  PubMed Central  Google Scholar 

  9. Norouzi A, Rahim MSM, Altameem A, Saba T, Rad AE, Rehman A, Uddin M. Medical image segmentation methods, algorithms, and applications. IETE Tech Rev. 2014;31(3):199–213.

    Article  Google Scholar 

  10. Li Y. Bone segmentation in human CT images. J Biomed Eng. 2004;21(2):169.

    Google Scholar 

  11. Natsheh AR, Ponnapalli PV, Anani N, Benchebra D. Segmentation of bone structure in sinus CT images using self-organizing maps. In: IEEE international conference on imaging systems and techniques: 2010; 2010. p. 294–9.

    Chapter  Google Scholar 

  12. Aguirreramos H, Avinacervantes JG, Cruzaceves I. Automatic bone segmentation by a Gaussian modeled threshold. In: Medical Physics: Fourteenth Mexican Symposium on Medical Physics: 2016; 2016. p. 883–94.

    Google Scholar 

  13. Janc K, Tarasiuk J, Bonnet AS, Lipinski P. Genetic algorithms as a useful tool for trabecular and cortical bone segmentation. Comput Methods Prog Biomed. 2013;111(1):72–83.

    Article  CAS  Google Scholar 

  14. Mehnert A, Jackway P. An improved seeded region growing algorithm. Pattern Recogn Lett. 1997;18(10):1065–71.

    Article  Google Scholar 

  15. Vasilache S, Ward K, Cockrell C, Ha J, Najarian K. Unified wavelet and gaussian filtering for segmentation of CT images; application in segmentation of bone in pelvic CT images. Bmc Med Inform Decis Making. 2009;9(Suppl 1):1–8.

    Article  Google Scholar 

  16. Nguyen NT, Branzan-Albu A, Branzan-Albu A. A new segmentation method for MRI images of the shoulder joint. In: Computer and robot vision, 2007 CRV '07 fourth Canadian conference on: 2007; 2007. p. 329–38.

    Google Scholar 

  17. Kim E, Li H, Huang X. A hierarchical image clustering cosegmentation framework. In: Computer Vision and Pattern Recognition, vol. 2012; 2012. p. 686–93.

    Google Scholar 

  18. Camastra F, Vinciarelli A. Machine learning for audio, image and video analysis: theory and applications. London: Springer; 2008.

    Book  Google Scholar 

  19. Mantas J. Methodologies in pattern recognition and image analysis—a brief survey. Pattern Recogn. 1987;20(1):1–6.

    Article  Google Scholar 

  20. Calder J, Tahmasebi AM. A variational approach to bone segmentation in CT images. Proc SPIE Int Soc Opt Eng. 2011;7962(3):346–54.

    Google Scholar 

  21. Truc PTH, Lee S, Kim TS. A density distance augmented Chan-Vese active contour for CT bone segmentation. In: Conference: international conference of the IEEE engineering in Medicine & Biology Society IEEE engineering in Medicine & Biology Society Conference, vol. 2008; 2008. p. 482–5.

    Google Scholar 

  22. Martínez F, Romero E, Dréan G, Simon A, Haigron P, De CR, Acosta O. Segmentation of pelvic structures for planning CT using a geometrical shape model tuned by a multi-scale edge detector. Phys Med Biol. 2014;59(6):1471.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Chen L, Chen Y, Wang Z, Zhao W, Chen J. Segmentation of the pelvic bone using a generalized gradient vector convolution field snake model. J Med Imaging Health Inform. 2015;5(7):1482–7.

  24. Gao Y, Shao Y, Lian J, Wang AZ, Chen RC, Shen D. Accurate segmentation of CT male pelvic organs via regression-based deformable models and multi-task random forests. IEEE Trans Med Imaging. 2016;35(6):1532–43.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Kardell M, Magnusson M, Sandborg M, Carlsson GA, Jeuthe J, Malusek A. Automatic segmentation of pelvis for brachytherapy of prostate. Radiat Prot Dosim. 2016;169(1–4):398.

    Article  CAS  Google Scholar 

  26. Wu J, Davuluri P, Ward K, Cockrell C, Hobson R, Najarian K. A new hierarchical method for multi-level segmentation of bone in pelvic CT scans. Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 2011; 2011(4):3399–3402.

  27. Li D, Zang P, Chai X, Cui Y, Li R, Lei X. Automatic multiorgan segmentation in CT images of the male pelvis using region-specific hierarchical appearance cluster models. Med Phys. 2016;43(10):5426.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Zhang W, Liu J, Yao J, Summers RM. Segmenting the thoracic, abdominal and pelvic musculature on CT scans combining atlas-based model and active contour model. Proc SPIE Int Soc Opt Eng. 2013;8670(4):08.

    Google Scholar 

  29. Chen X, Bagci U. 3D automatic anatomy segmentation based on iterative graph-cut-ASM. Med Phys. 2011;38(8):4610.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Bakir H, Charfi M, Zrida J. Automatic active contour segmentation approach via vector field convolution. Signal Image Video Proc. 2016;10(1):1–10.

    Article  Google Scholar 

  31. Steuer R, Kurths J, Daub CO, Weise J, Selbig J. The mutual information: detecting and evaluating dependencies between variables. Bioinformatics. 2002;18(suppl_2):S231–40.

    Article  PubMed  Google Scholar 

  32. Gao GZ, Zhongwu LI, Lifu YU. Application of the normalized cross correlation coefficient in image sequence object detection. Comput Eng Sci. 2005;27(3):38–40.

    Google Scholar 

  33. Cornea N D, Silver D, Min P. Curve-skeleton properties, applications, and algorithms. IEEE Transactions on Visualization & Computer Graphics. 2007;13(3):530.

  34. Shu C, Mo Y. Morphological thinning based on image's edges. In: International conference on communication technology proceedings: 1998, vol. 1; 1998. p. 5.

    Google Scholar 

  35. Hui YU, Changsheng QU, Jinhang LI. Method for hand profile extraction in complicated conditions. Comput Eng Appl. 2015;51(14):170–4.

    Google Scholar 

  36. Wiese T, Yao J, Burns JE, Summers RM. Detection of sclerotic bone metastases in the spine using watershed algorithm and graph cut. Proc SPIE. 2012;8315(6):36.

    Google Scholar 

  37. Wu J, Belle A, Hargraves RH, Cockrell C, Tang Y, Najarian K. Bone segmentation and 3D visualization of CT images for traumatic pelvic injuries. Int J Imaging Syst Technol. 2014;24(1):29–38.

    Article  Google Scholar 

  38. Xu C, Prince JL. Generalized gradient vector flow external forces for active contours 1. Signal Process. 1998;71(2):131–9.

    Article  Google Scholar 

Download references

Acknowledgments

The dataset used for this project was provided by Tianjin Medical University General Hospital and Tianjin Nankai Hospital.

Funding

This work was supported by the National Natural Science Foundation of China (NSFC) under Grant 61475116.

Availability of data and materials

The datasets generated and analysed during the current study are available from the corresponding author on reasonable request.

Author information

Authors and Affiliations

Authors

Contributions

HY and HW conceived and designed the experiments; HW performed the experiments; YS and XY analyzed the data; KX and YC provided dataset and medical guidance; HW wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Haijun Wang.

Ethics declarations

Ethics approval and consent to participate

The dataset used for this project was provided by Tianjin Medical University General Hospital and Tianjin Nankai Hospital. And the study was approved by the Ethical Committee of Tianjin Medical University General Hospital and was conducted under the Declaration of Helsinki.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Yu, H., Wang, H., Shi, Y. et al. The segmentation of bones in pelvic CT images based on extraction of key frames. BMC Med Imaging 18, 18 (2018). https://doi.org/10.1186/s12880-018-0260-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12880-018-0260-x

Keywords