Clinical subject data
The data consisted of 10 subjects (three males, seven females), suspected of memory disorders, which had undergone a PET-CT and PET-MRI scan, were used in this retrospective study. The subjects had undergone both PET-CT and PET-MRI during the same imaging session (seven subjects) or PET-CT and PET-MRI performed on separate days (9 days, 6 months 19 days and 7 months 3 days apart). The mean and standard deviation of subject age, weight, and injected dose of [18F]-FDG were: 53 ± 14 years, 73 ± 19 kg, and 277 ± 47 MBq. The mean ± standard deviations of the scan start times were 80 ± 20 min after the injection for the group which had undergone imaging during subsequent sessions. All PET-CT and PET-MRI acquisitions were performed using the standard protocol for neurological imaging.
The study was conducted as a retrospective registry study at Turku PET Centre (study number: T7/2021). The need to acquire an active informed consent from the individuals included in the study was waived and the study protocol was approved by Turku University Hospital Research Board and the legislative team. The requirements for ethical review in Finland are stipulated primarily in the Medical Research Act (488/1999, as amended) and the Act of the Medical Use of Human Organs, Tissues and Cells (101/2001 as amended). Ethical review is statutorily required for interventional medical research and some circumstances for studies on human organs, tissues or cells. According to Finnish legislation, no ethical assessment or approval by an independent review board is mandatory for registry studies. The study was conducted in accordance with the Declaration of Helsinki.
PET-MRI system
The MRI and PET acquisitions were performed sequentially with the Philips Ingenuity TF PET-MRI (Philips Healthcare, Cleveland, OH, USA). Ingenuity TF is a combination of a PET subsystem (Gemini TF with TOF capabilities) and an MRI system (Achieva 3 T X-series). The PET system has 4 mm × 4 mm × 22 mm lutetium oxyorthosilicate crystals arranged in 28 detector modules. The PET system has a 180 mm axial field of view (FOV) and a coincidence window of 6 ns with an energy window of 460–665 keV. The maximum gradient strength of the MRI system is 40 mT/M and the slew rate is 200 T/m/s. The performance of the system is evaluated in more detail in [27].
MRI and PET image acquisition
Anatomical MRI (atMR) from the vendor-based MRAC acquisition was used to derive all MRI-based μ-maps. The atMR is a T1-weighted 3-D fast field echo (FFE) sequence with 2 mm isotropic voxel size. The acquisition parameters were: echo time (TE) of 2.16 ms, FOV 320 × 320 mm2, flip angle (FA) of 10°, repetition time (TR) of 4.18 ms, and an acquisition time of 84 s. The geometry correction on the MRI system was applied with the option value set as "default”. The SENSE 8-channel head coil was used for all PET-MRI imaging. For PET, a 15-min acquisition was performed in list-mode over one bed position covering the head region with transaxial acquisition FOV of 256 × 256 mm2.
CT and PET image acquisition
For reference measurement for attenuation correction, CTAC data from the PET-CT examination was used. The Discovery 690 PET-CT (General Electric Healthcare, Milwaukee, WI, USA) was used to perform the CTAC acquisitions. The CT system is a LightSpeed volume computed tomography with 64 slices. All CTAC maps were acquired with a low-dose CT acquisition protocol using a tube voltage of 120 kV and 10 mAs with dose modulation. The physical performance of the PET system can be found in [28].
MR image segmentation and tissue classification
The method in this paper is based on the one described in [25, 26, 29]. The method uses tissue probability maps from SPM8 to create MR-based attenuation maps. The probability maps are segmented from T1-weighted MR images by use of the New Segment function in SPM8 (Wellcome Trust Centre for Neuroimaging, University College London, UK).
In this paper, we implemented the following modifications in the original method. First, the method was updated to use the Segment function in SPM12. As SPM12 introduces two new parameters for post-processing of the tissue probability maps and alters one parameter, adjustment of the segmentation settings in SPM12 was needed for optimal bone delineation. The final parameters used for the Segment function in SPM12 were very light (0.0001) for bias regularization (2, 2, 2, 3, 4, 2) for number of gaussians, 0 for strength of the Markov random field (MRF) cleanup, no cleanup, (0, 0.001, 0.5, 0.05, 0.2) for warping regularization and 0 mm for the smoothness parameter. These parameters were then used to segment the T1-weighted MR images to grey matter, white matter, cerebrospinal fluid, scalp, skull, and air.
Thereafter, separate binary masks for each tissue class were created, which were then combined and given discrete attenuation coefficients. The threshold values for segmenting tissues were set to: 0.25 for air, 0.50 for soft tissue, 0.25 for bone, and 0.5 for brain tissues. The final attenuation map consisted of four tissue classes with attenuation coefficients of 0.0 cm−1 for air, 0.096 cm−1 for soft tissue, 0.985 cm−1 for brain tissue, and 0.151 cm−1 for bone, as specified in [25, 30]. All of the processing is performed in an automated MATLAB 2018b pipeline to derive MRI-based μ-maps for PET-MR image reconstruction.
Image processing and modelling workflow for the sinus cavity
As there is no separate class for the sinus region in the Segment function of SPM12, it is mostly classified as bone. However, this region is anatomically a mix of air, soft tissue and fine bone. To take this challenging region into account, we modified the image processing pipeline to include three alternative methods for sinus delineation. The first method is called the bulk method and it was implemented directly as described in [29].
In short, the bulk method is based on registration of a CT template, to create a specific mask in the sinus region to assign a fixed attenuation coefficient of 0.100 cm−1 for that region. The drawback of the method is that it needs an additional registration of a CT-based template to delineate the sinuses, assuming that the subject has normal anatomy. Another drawback is the use of a bulk assignment of an attenuation coefficient.
Two new methods presented in this study are called the cuboid method and the template method. The cuboid method is based on matching a cuboid-shaped mask to the individual anatomy of the subject. In the template method, the cuboid is inverse transformed from a template in Montreal Neurological Institute (MNI) space to subject anatomy by using the vector fields from SPM12 Segment function. Finally, the voxels within the cuboids are converted into air and soft tissue using a MRI-CT conversion model. The processing pipelines are described in detail below.
Processing pipeline for the cuboid method
First, an initial binary mask of the air inside the sinus cavity is created by intersection of air and soft tissue masks. The initial mask includes the throat, large air cavities in the sinus, and voxels that do not match with anatomical locations of air cavities. To delineate the air cavities only, the mask was summed across all slices, and the slice sums were made into a line plot. An initial search for the largest air cavities was performed using the largest local maximum preceding the largest drop in the inside air as a criteria for finding the optimal slice. Air cavities in this slice were then chosen by a center of mass method. Thereafter, a region-growing algorithm was applied to delineate all air cavities. The entire air cavity segmentation process is described in detail in Appendix A. After air delineation, a cuboid is fitted to the resulting mask to delineate the region applied for the conversion model.
To get an anatomically fitting cuboid, a set of cuboids are fitted around the air cavities. A detailed description of this method can be found in Appendix B. A large cuboid is placed initially in the front half of the head extending above and below the air cavities. Then sub-cuboids within the maximum cuboid are fitted and evaluated. The best cuboid is a tradeoff between the amount of bone segment covered, and the size of the cuboid, where the final cuboid is selected based on the following criteria:
$$CG = \frac{{\mathop \sum \nolimits_{i \in C} p_{i} }}{{\left( \frac{c}{4} \right)^{2} }},$$
(1)
where CG is the value of the cuboid goodness, C is the cuboid being tested, c is the perimeter of the cuboid and pi is the bone segment probability value for the voxel i. In this way, the more voxels with considerable bone probability there are within the cuboid, the higher its goodness CG will be. The sub-cuboid with the largest CG is then selected as basis for further processing.
Processing pipeline for the template method
An alternative method for delineating the sinus cavities was evaluated based on matching a cuboid template to an individual anatomy by inverse transformation. This method is based on a cuboid template in MNI space, similarly to the regional masks used in [23]. The inverse transformation fields from the Segment function in SPM12 are used to transform the cuboid template back to the individual space of the subject. The resulting mask may no longer strictly be a square cuboid since the subjects’ heads can be slightly tilted and morphed in various directions. The inverse transformed cuboid is then used as a basis for further processing.
MRI-CT conversion model for the sinus region
Once the cuboid location was fixed for either of the methods, all bone segment voxels were converted to air, soft tissue, or a mix of both using an MRI-CT conversion model. The model was derived based on the 10 subjects as follows. The data scatter plot (Fig. 1; Additional file 1: Fig. S2) of the MRI and CT values in the bone segment voxels within the cuboid masks showed that there are concentration points around − 1000 and 0 HU values on a wide range of MRI values and a smaller amount of points scattered around these two lines. Thus, a three-step conversion model was implemented which followed a horizontal line at − 1000 HU (corresponding to air), a diagonal line from − 1000 to 0 HU (corresponding to mix of different tissues), and a horizontal line at 0 HU (corresponding to soft tissue).
The start and end points from − 1000 to 0 HU were estimated from the data. By using randomly selected 100 voxels in the MRI images from an sinus mask of one subject, possible combinations for MRI-CT values were fitted, where a square sum of the shortest distance between the MRI value at − 1000 HU and MRI value at 0 HU was used for optimization criteria. This randomization and fitting process was repeated 100 times for each subject, effectively creating a bootstrap estimate. For the final MRI-CT conversion model, the average over all 100 randomized estimates for MRI values at − 1000 HU and 0 HU were selected as final conversion points.
PET image reconstruction
Three sets of MRI-based attenuation maps were created for each subject. The only difference between the maps was the method used to delineate the sinus region. CTAC data was used as the reference method for attenuation correction. CTAC data was converted from HU to linear attenuation coefficients using the bilinear transform described in [31]. In cases where the attenuation map in CTAC did not cover the entire head of the subject, the attenuation map was complemented by soft tissue from MRAC.
To ensure accurate anatomical alignment, all CTAC and MRAC images were realigned and co-registered to non-attenuation corrected PET images. Co-registration was performed using rigid image registration with normalized mutual information as implemented in SPM12. The images were then resliced to the same voxel dimensions as used for the clinical MRAC. All images were smoothed into PET resolution of 5 mm to match the PET-MRI intrinsic resolution.
Thereafter, the MRAC and CTAC images were imported with the raw PET data to the PET-MRI reconstruction system and PET images were reconstructed using TOF and non-TOF reconstruction algorithms. For non-TOF reconstruction, a LOR-RAMLA with two iterations and 33 subsets, matrix size of 128 × 128 × 90 and isotropic voxel size of 2 mm was used. The reconstruction parameters used were alpha = 6.3716, radius = 2.8, blob increment = 2.0375, and relaxation parameter = 0.035. For TOF reconstruction, a TOF-BLOB-OSEM algorithm with 3 iterations and 33 subsets, matrix size of 128 × 128x90 and a voxel size of 2 mm was applied. The reconstruction parameters used were alpha = 4.1338, radius = 2.3, blob increment = 2.0375 and relaxation parameter = 1.0.
All reconstructions included the necessary corrections for image quantification, including random events, scatter, dead time, decay, and normalization. The head coil template and patient table were inserted automatically by the reconstruction software.
MR-based attenuation map and PET evaluation
DICE analysis
DICE coefficients were calculated from the sinus region and from the entire image stack. The sinus region was defined as slices ranging from 50 to 70, which corresponds to 4.2 cm in size. Estimating the DICE coefficients over the sinus region allows to estimate bone delineation accuracy in that region. Whereas, measurement over image volume is indicative of the accuracy in skull bone delineation. DICE coefficients for CTAC (A) and MRAC (B) were calculated in Eq. 2:
$$DICE = \frac{{2 \left| {(A \cap B)} \right|}}{\left| A \right| + \left| B \right|},$$
(2)
where \(\left| X \right|\) denotes the number of voxels in the binary mask X. Voxels where HU > 157 were included in the masks.
An HU threshold of 157 HU was used throughout DICE analysis, which corresponds to attenuation coefficient of 0.105 cm−1, which is higher than soft tissue, but slightly lower than the reported attenuation coefficient for spongy bone at 0.11 cm−1 [32]. This threshold value was selected as it has been used previously as a boundary value between bone and non-bone tissue [33]. We report the change in the DICE coefficients as percentages between cuboid and template methods compared to the bulk method. To indicate the general trend, median change is also presented.
Correlation analysis
Correlation analysis was performed to investigate the accuracy of the skull bone delineation with each of the methods. Correlation data was sampled from each subject separately. Again, voxels with HU value > 157 were included in the analysis. Each subject was sampled 100 times for 200 random voxel pairs. Due to the sample size, Pearson correlation was used. These correlations were then plotted in a boxplot for each method to evaluate the distribution of the values.
Attenuation coefficient analysis
The average attenuation coefficient in the sinus region was calculated for all attenuation maps. For the analysis, a volume of interest (VOI) consisting of the nasal sinuses was delineated manually over CTAC in Carimas 2.9 (Turku PET Centre, Turku, Finland). The same VOI was then applied on the three MRAC methods.
A leave-one-out method to test out-of-sample accuracy of the MRI-CT conversion model was performed. In the leave-one-out validation, data for a single subject was removed and the conversion model was calculated for the remaining nine subjects. The resulting model would then be used to perform MRI-CT conversion for the removed subject. This process was then repeated for all subjects.
VOI analysis of PET images
In quantitative evaluation of reconstructed PET images, automatic VOI analysis was employed using a well-established anatomical atlas provided in automated anatomical labeling software [34], using in total 35 anatomical VOI from the grey matter. Individualization of the atlas was based on the spatial mapping from the MNI space to individual space using the vector fields extracted by the Segment function in SPM12. The atlas image was masked in the individual space with a grey matter mask with a lower threshold of 0.8.
In the analysis, we calculated the relative difference of the total mean activity in each VOI between CTAC and MRAC reconstructed PET images. The mean relative difference was calculated for each VOI using Eq. 3:
$$VOIDiff_{rel} = \frac{PETMRAC - PETCTAC}{{PETCTAC}},$$
(3)
where PETMRAC denotes the VOI activity measured from MRAC reconstructed PET with different MRI-based μ-maps, while PETCTAC denotes the VOI activity measured from CTAC reconstructed PET. We report the median of the average relative differences over all subjects per VOI.
Visual evaluation of PET Atlas and bias Atlas images
Atlas PET images representing the mean and standard deviation of PET uptake (kBq/mL) across the subject group were calculated with all MRAC methods and CTAC. In the evaluation of global bias distribution, mean bias atlas images were calculated as described by [25, 35]. All atlas images were masked with a mask covering the entire brain of the subject.