Evaluation of three methods for delineation and attenuation estimation of the sinus region in MR-based attenuation correction for brain PET-MR imaging

Lindén, Jani; Teuho, Jarmo; Teräs, Mika; Klén, Riku

doi:10.1186/s12880-022-00770-0

Research
Open access
Published: 17 March 2022

Evaluation of three methods for delineation and attenuation estimation of the sinus region in MR-based attenuation correction for brain PET-MR imaging

Jani Lindén^1,2^na1,
Jarmo Teuho^1,3^na1,
Mika Teräs^3,4 &
…
Riku Klén¹

BMC Medical Imaging volume 22, Article number: 48 (2022) Cite this article

1308 Accesses
3 Altmetric
Metrics details

Abstract

Background

Attenuation correction is crucial in quantitative positron emission tomography-magnetic resonance (PET-MRI) imaging. We evaluated three methods to improve the segmentation and modelling of the attenuation coefficients in the nasal sinus region. Two methods (cuboid and template method) included a MRI-CT conversion model for assigning the attenuation coefficients in the nasal sinus region, whereas one used fixed attenuation coefficient assignment (bulk method).

Methods

The study population consisted of data of 10 subjects which had undergone PET-CT and PET-MRI. PET images were reconstructed with and without time-of-flight (TOF) using CT-based attenuation correction (CTAC) as reference. Comparison was done visually, using DICE coefficients, correlation, analyzing attenuation coefficients, and quantitative analysis of PET and bias atlas images.

Results

The median DICE coefficients were 0.824, 0.853, 0.849 for the bulk, cuboid and template method, respectively. The median attenuation coefficients were 0.0841 cm⁻¹, 0.0876 cm⁻¹, 0.0861 cm⁻¹ and 0.0852 cm⁻¹, for CTAC, bulk, cuboid and template method, respectively. The cuboid and template methods showed error of less than 2.5% in attenuation coefficients. An increased correlation to CTAC was shown with the cuboid and template methods. In the regional analysis, improvement in at least 49% and 80% of VOI was seen with non-TOF and TOF imaging. All methods showed errors less than 2.5% in non-TOF and less than 2% in TOF reconstructions.

Conclusions

We evaluated two proof-of-concept methods for improving quantitative accuracy in PET/MRI imaging and showed that bias can be further reduced by inclusion of TOF. Largest improvements were seen in the regions of olfactory bulb, Heschl's gyri, lingual gyrus and cerebellar vermis. However, the overall effect of inclusion of the sinus region as separate class in MRAC to PET quantification in the brain was considered modest.

Peer Review reports

Background

Attenuation correction (AC) of the bone is of paramount importance for the quantitative accuracy of positron emission tomography (PET), especially in the brain. Transmission-based attenuation correction (TXAC) with a rotating ⁶⁸Ge rod source can be considered as a “gold standard” method for attenuation correction, allowing to measure the attenuation coefficients directly. Whereas, computed tomography (CT)-based attenuation correction (CTAC) could be considered as a “silver standard” [1], where bilinear scaling method of CT Hounsfield Units (HU) to attenuation correction factors is used to derive an attenuation map (µ-map).

Contrary to TXAC or CTAC, magnetic resonance based attenuation correction (MRAC) does not allow a direct measurement of electron density information. This is because magnetic resonance (MR) images represent the tissue proton densities and tissue relaxation values, which cannot be directly translated to electron density information. For accurate attenuation correction on PET-MRI systems, several methods have been introduced, where the advanced methods have achieved an accuracy of ± 5% in quantitative accuracy of PET when compared to CTAC [2]. These methods can be roughly divided into atlas-based, emission-based and segmentation-based methods. These methodologies and the physical basis of attenuation correction have been discussed in review articles [3,4,5,6].

To shortly summarize the main methods for MRAC, atlas and template-based methods allow to calculate a pseudo-CT, or a synthetic-CT image, which can be created by, for example, co-registering a single CT or a database of CT images to the individual anatomy of a subject presented by the MR image [7,8,9]. Recently, several methods based on machine learning have also been introduced [10]. Emission-based methods use raw PET data to mathematically reconstruct the attenuation sinogram for MRAC [4, 11,12,13]. Furthermore, on time-of-flight (TOF) compatible PET-MRI systems, incorporating TOF data in the image reconstruction might bring further increases in quantitative accuracy [14, 15].

In addition to these approaches, the MR images can be segmented into different tissue classes. When using conventional T1/T2-weighted MRI sequences, the accurate delineation of bone becomes challenging, as bone has a similar T2* relaxation time as air. However, with advanced segmentation methods, such as probabilistic atlas-based segmentation [16], bone can be segmented also with conventional MRI sequences. In addition, ultra-short echo time (UTE) sequences and zero echo time (ZTE) sequences allow to visualize and segment bone directly from MR images [17, 18]. Advanced segmentation-based methods have shown very good accuracy when compared to emission- and atlas-based methods [2]. However, there are a few remaining challenges in segmentation-based MRAC, which still need to be addressed.

One of the remaining challenges is to accurately segment and assign attenuation coefficients to regions such as the nasal sinuses, which contain a mixture of bone, air and soft tissue [19]. Manual contouring of the nasal sinus region has been suggested [20, 21], but preferably, the subject-specific mixtures of different tissues should be accounted on an individual basis in an automated way. Previously, regional masks and templates have been suggested to improve the segmentation accuracy in these regions [22,23,24]. In addition, inclusion of TOF in image reconstruction may further reduce inaccuracies in these regions [14]. Thus, it is important to study alternative approaches to accurately delineate and to implement methods to accurately estimate the attenuation coefficients in this region. Furthermore, the accuracy of the methodology should be investigated with and without TOF.

For the basis of evaluation in this paper, we used a segmentation-based MRAC method based on Statistical Parametric Mapping software (SPM8) introduced in [25, 26]. The method was modified to use the segmentation engine in SPM12 and included three different methods for tissue delineation in the sinus region. Two new methods were developed to provide a proof of concept for accounting the subject-specific variation of the attenuation coefficients in the sinus region. The method accuracy across the brain were evaluated with and without TOF reconstruction, in regard to the activity recovery in PET.

Materials and methods

Clinical subject data

The data consisted of 10 subjects (three males, seven females), suspected of memory disorders, which had undergone a PET-CT and PET-MRI scan, were used in this retrospective study. The subjects had undergone both PET-CT and PET-MRI during the same imaging session (seven subjects) or PET-CT and PET-MRI performed on separate days (9 days, 6 months 19 days and 7 months 3 days apart). The mean and standard deviation of subject age, weight, and injected dose of [¹⁸F]-FDG were: 53 ± 14 years, 73 ± 19 kg, and 277 ± 47 MBq. The mean ± standard deviations of the scan start times were 80 ± 20 min after the injection for the group which had undergone imaging during subsequent sessions. All PET-CT and PET-MRI acquisitions were performed using the standard protocol for neurological imaging.

The study was conducted as a retrospective registry study at Turku PET Centre (study number: T7/2021). The need to acquire an active informed consent from the individuals included in the study was waived and the study protocol was approved by Turku University Hospital Research Board and the legislative team. The requirements for ethical review in Finland are stipulated primarily in the Medical Research Act (488/1999, as amended) and the Act of the Medical Use of Human Organs, Tissues and Cells (101/2001 as amended). Ethical review is statutorily required for interventional medical research and some circumstances for studies on human organs, tissues or cells. According to Finnish legislation, no ethical assessment or approval by an independent review board is mandatory for registry studies. The study was conducted in accordance with the Declaration of Helsinki.

PET-MRI system

The MRI and PET acquisitions were performed sequentially with the Philips Ingenuity TF PET-MRI (Philips Healthcare, Cleveland, OH, USA). Ingenuity TF is a combination of a PET subsystem (Gemini TF with TOF capabilities) and an MRI system (Achieva 3 T X-series). The PET system has 4 mm × 4 mm × 22 mm lutetium oxyorthosilicate crystals arranged in 28 detector modules. The PET system has a 180 mm axial field of view (FOV) and a coincidence window of 6 ns with an energy window of 460–665 keV. The maximum gradient strength of the MRI system is 40 mT/M and the slew rate is 200 T/m/s. The performance of the system is evaluated in more detail in [27].

MRI and PET image acquisition

Anatomical MRI (atMR) from the vendor-based MRAC acquisition was used to derive all MRI-based μ-maps. The atMR is a T1-weighted 3-D fast field echo (FFE) sequence with 2 mm isotropic voxel size. The acquisition parameters were: echo time (TE) of 2.16 ms, FOV 320 × 320 mm², flip angle (FA) of 10°, repetition time (TR) of 4.18 ms, and an acquisition time of 84 s. The geometry correction on the MRI system was applied with the option value set as "default”. The SENSE 8-channel head coil was used for all PET-MRI imaging. For PET, a 15-min acquisition was performed in list-mode over one bed position covering the head region with transaxial acquisition FOV of 256 × 256 mm².

CT and PET image acquisition

For reference measurement for attenuation correction, CTAC data from the PET-CT examination was used. The Discovery 690 PET-CT (General Electric Healthcare, Milwaukee, WI, USA) was used to perform the CTAC acquisitions. The CT system is a LightSpeed volume computed tomography with 64 slices. All CTAC maps were acquired with a low-dose CT acquisition protocol using a tube voltage of 120 kV and 10 mAs with dose modulation. The physical performance of the PET system can be found in [28].

MR image segmentation and tissue classification

The method in this paper is based on the one described in [25, 26, 29]. The method uses tissue probability maps from SPM8 to create MR-based attenuation maps. The probability maps are segmented from T1-weighted MR images by use of the New Segment function in SPM8 (Wellcome Trust Centre for Neuroimaging, University College London, UK).

In this paper, we implemented the following modifications in the original method. First, the method was updated to use the Segment function in SPM12. As SPM12 introduces two new parameters for post-processing of the tissue probability maps and alters one parameter, adjustment of the segmentation settings in SPM12 was needed for optimal bone delineation. The final parameters used for the Segment function in SPM12 were very light (0.0001) for bias regularization (2, 2, 2, 3, 4, 2) for number of gaussians, 0 for strength of the Markov random field (MRF) cleanup, no cleanup, (0, 0.001, 0.5, 0.05, 0.2) for warping regularization and 0 mm for the smoothness parameter. These parameters were then used to segment the T1-weighted MR images to grey matter, white matter, cerebrospinal fluid, scalp, skull, and air.

Thereafter, separate binary masks for each tissue class were created, which were then combined and given discrete attenuation coefficients. The threshold values for segmenting tissues were set to: 0.25 for air, 0.50 for soft tissue, 0.25 for bone, and 0.5 for brain tissues. The final attenuation map consisted of four tissue classes with attenuation coefficients of 0.0 cm⁻¹ for air, 0.096 cm⁻¹ for soft tissue, 0.985 cm⁻¹ for brain tissue, and 0.151 cm⁻¹ for bone, as specified in [25, 30]. All of the processing is performed in an automated MATLAB 2018b pipeline to derive MRI-based μ-maps for PET-MR image reconstruction.

Image processing and modelling workflow for the sinus cavity

As there is no separate class for the sinus region in the Segment function of SPM12, it is mostly classified as bone. However, this region is anatomically a mix of air, soft tissue and fine bone. To take this challenging region into account, we modified the image processing pipeline to include three alternative methods for sinus delineation. The first method is called the bulk method and it was implemented directly as described in [29].

In short, the bulk method is based on registration of a CT template, to create a specific mask in the sinus region to assign a fixed attenuation coefficient of 0.100 cm⁻¹ for that region. The drawback of the method is that it needs an additional registration of a CT-based template to delineate the sinuses, assuming that the subject has normal anatomy. Another drawback is the use of a bulk assignment of an attenuation coefficient.

Two new methods presented in this study are called the cuboid method and the template method. The cuboid method is based on matching a cuboid-shaped mask to the individual anatomy of the subject. In the template method, the cuboid is inverse transformed from a template in Montreal Neurological Institute (MNI) space to subject anatomy by using the vector fields from SPM12 Segment function. Finally, the voxels within the cuboids are converted into air and soft tissue using a MRI-CT conversion model. The processing pipelines are described in detail below.

Processing pipeline for the cuboid method

First, an initial binary mask of the air inside the sinus cavity is created by intersection of air and soft tissue masks. The initial mask includes the throat, large air cavities in the sinus, and voxels that do not match with anatomical locations of air cavities. To delineate the air cavities only, the mask was summed across all slices, and the slice sums were made into a line plot. An initial search for the largest air cavities was performed using the largest local maximum preceding the largest drop in the inside air as a criteria for finding the optimal slice. Air cavities in this slice were then chosen by a center of mass method. Thereafter, a region-growing algorithm was applied to delineate all air cavities. The entire air cavity segmentation process is described in detail in Appendix A. After air delineation, a cuboid is fitted to the resulting mask to delineate the region applied for the conversion model.

To get an anatomically fitting cuboid, a set of cuboids are fitted around the air cavities. A detailed description of this method can be found in Appendix B. A large cuboid is placed initially in the front half of the head extending above and below the air cavities. Then sub-cuboids within the maximum cuboid are fitted and evaluated. The best cuboid is a tradeoff between the amount of bone segment covered, and the size of the cuboid, where the final cuboid is selected based on the following criteria:

$$CG = \frac{{\mathop \sum \nolimits_{i \in C} p_{i} }}{{\left( \frac{c}{4} \right)^{2} }},$$

(1)

where CG is the value of the cuboid goodness, C is the cuboid being tested, c is the perimeter of the cuboid and p_i is the bone segment probability value for the voxel i. In this way, the more voxels with considerable bone probability there are within the cuboid, the higher its goodness CG will be. The sub-cuboid with the largest CG is then selected as basis for further processing.

Processing pipeline for the template method

An alternative method for delineating the sinus cavities was evaluated based on matching a cuboid template to an individual anatomy by inverse transformation. This method is based on a cuboid template in MNI space, similarly to the regional masks used in [23]. The inverse transformation fields from the Segment function in SPM12 are used to transform the cuboid template back to the individual space of the subject. The resulting mask may no longer strictly be a square cuboid since the subjects’ heads can be slightly tilted and morphed in various directions. The inverse transformed cuboid is then used as a basis for further processing.

MRI-CT conversion model for the sinus region

Once the cuboid location was fixed for either of the methods, all bone segment voxels were converted to air, soft tissue, or a mix of both using an MRI-CT conversion model. The model was derived based on the 10 subjects as follows. The data scatter plot (Fig. 1; Additional file 1: Fig. S2) of the MRI and CT values in the bone segment voxels within the cuboid masks showed that there are concentration points around − 1000 and 0 HU values on a wide range of MRI values and a smaller amount of points scattered around these two lines. Thus, a three-step conversion model was implemented which followed a horizontal line at − 1000 HU (corresponding to air), a diagonal line from − 1000 to 0 HU (corresponding to mix of different tissues), and a horizontal line at 0 HU (corresponding to soft tissue).

The start and end points from − 1000 to 0 HU were estimated from the data. By using randomly selected 100 voxels in the MRI images from an sinus mask of one subject, possible combinations for MRI-CT values were fitted, where a square sum of the shortest distance between the MRI value at − 1000 HU and MRI value at 0 HU was used for optimization criteria. This randomization and fitting process was repeated 100 times for each subject, effectively creating a bootstrap estimate. For the final MRI-CT conversion model, the average over all 100 randomized estimates for MRI values at − 1000 HU and 0 HU were selected as final conversion points.

PET image reconstruction

Three sets of MRI-based attenuation maps were created for each subject. The only difference between the maps was the method used to delineate the sinus region. CTAC data was used as the reference method for attenuation correction. CTAC data was converted from HU to linear attenuation coefficients using the bilinear transform described in [31]. In cases where the attenuation map in CTAC did not cover the entire head of the subject, the attenuation map was complemented by soft tissue from MRAC.

To ensure accurate anatomical alignment, all CTAC and MRAC images were realigned and co-registered to non-attenuation corrected PET images. Co-registration was performed using rigid image registration with normalized mutual information as implemented in SPM12. The images were then resliced to the same voxel dimensions as used for the clinical MRAC. All images were smoothed into PET resolution of 5 mm to match the PET-MRI intrinsic resolution.

Thereafter, the MRAC and CTAC images were imported with the raw PET data to the PET-MRI reconstruction system and PET images were reconstructed using TOF and non-TOF reconstruction algorithms. For non-TOF reconstruction, a LOR-RAMLA with two iterations and 33 subsets, matrix size of 128 × 128 × 90 and isotropic voxel size of 2 mm was used. The reconstruction parameters used were alpha = 6.3716, radius = 2.8, blob increment = 2.0375, and relaxation parameter = 0.035. For TOF reconstruction, a TOF-BLOB-OSEM algorithm with 3 iterations and 33 subsets, matrix size of 128 × 128x90 and a voxel size of 2 mm was applied. The reconstruction parameters used were alpha = 4.1338, radius = 2.3, blob increment = 2.0375 and relaxation parameter = 1.0.

All reconstructions included the necessary corrections for image quantification, including random events, scatter, dead time, decay, and normalization. The head coil template and patient table were inserted automatically by the reconstruction software.

MR-based attenuation map and PET evaluation

DICE analysis

DICE coefficients were calculated from the sinus region and from the entire image stack. The sinus region was defined as slices ranging from 50 to 70, which corresponds to 4.2 cm in size. Estimating the DICE coefficients over the sinus region allows to estimate bone delineation accuracy in that region. Whereas, measurement over image volume is indicative of the accuracy in skull bone delineation. DICE coefficients for CTAC (A) and MRAC (B) were calculated in Eq. 2:

$$DICE = \frac{{2 \left| {(A \cap B)} \right|}}{\left| A \right| + \left| B \right|},$$

(2)

where $\left| X \right|$ denotes the number of voxels in the binary mask X. Voxels where HU > 157 were included in the masks.

An HU threshold of 157 HU was used throughout DICE analysis, which corresponds to attenuation coefficient of 0.105 cm⁻¹, which is higher than soft tissue, but slightly lower than the reported attenuation coefficient for spongy bone at 0.11 cm⁻¹ [32]. This threshold value was selected as it has been used previously as a boundary value between bone and non-bone tissue [33]. We report the change in the DICE coefficients as percentages between cuboid and template methods compared to the bulk method. To indicate the general trend, median change is also presented.

Correlation analysis

Correlation analysis was performed to investigate the accuracy of the skull bone delineation with each of the methods. Correlation data was sampled from each subject separately. Again, voxels with HU value > 157 were included in the analysis. Each subject was sampled 100 times for 200 random voxel pairs. Due to the sample size, Pearson correlation was used. These correlations were then plotted in a boxplot for each method to evaluate the distribution of the values.

Attenuation coefficient analysis

The average attenuation coefficient in the sinus region was calculated for all attenuation maps. For the analysis, a volume of interest (VOI) consisting of the nasal sinuses was delineated manually over CTAC in Carimas 2.9 (Turku PET Centre, Turku, Finland). The same VOI was then applied on the three MRAC methods.

A leave-one-out method to test out-of-sample accuracy of the MRI-CT conversion model was performed. In the leave-one-out validation, data for a single subject was removed and the conversion model was calculated for the remaining nine subjects. The resulting model would then be used to perform MRI-CT conversion for the removed subject. This process was then repeated for all subjects.

VOI analysis of PET images

In quantitative evaluation of reconstructed PET images, automatic VOI analysis was employed using a well-established anatomical atlas provided in automated anatomical labeling software [34], using in total 35 anatomical VOI from the grey matter. Individualization of the atlas was based on the spatial mapping from the MNI space to individual space using the vector fields extracted by the Segment function in SPM12. The atlas image was masked in the individual space with a grey matter mask with a lower threshold of 0.8.

In the analysis, we calculated the relative difference of the total mean activity in each VOI between CTAC and MRAC reconstructed PET images. The mean relative difference was calculated for each VOI using Eq. 3:

$$VOIDiff_{rel} = \frac{PETMRAC - PETCTAC}{{PETCTAC}},$$

(3)

where PETMRAC denotes the VOI activity measured from MRAC reconstructed PET with different MRI-based μ-maps, while PETCTAC denotes the VOI activity measured from CTAC reconstructed PET. We report the median of the average relative differences over all subjects per VOI.

Visual evaluation of PET Atlas and bias Atlas images

Atlas PET images representing the mean and standard deviation of PET uptake (kBq/mL) across the subject group were calculated with all MRAC methods and CTAC. In the evaluation of global bias distribution, mean bias atlas images were calculated as described by [25, 35]. All atlas images were masked with a mask covering the entire brain of the subject.

Results

Visual inspection

Figure 2 shows a visual inspection of CTAC and MRAC images. The air pathways and regions with mixture of tissue are better delineated with cuboid and template methods. Of note is that these regions are often mapped as bone in the bulk method, resulting in overestimation of attenuation.

Figures 3 and 4 show the mean and standard deviation PET atlas images using CTAC and different MRAC methods. The PET uptake values are very similar between the methods. Very slight differences can be noted visually in the cerebellum, in the mid-brain region and in the regions of the occipital and parietal lobes in both the mean and standard deviation images.