Reliability of the freehand region-of-interest method in quantitative cerebral diffusion tensor imaging

Hakulinen, Ullamari; Brander, Antti; Ilvesmäki, Tero; Helminen, Mika; Öhman, Juha; Luoto, Teemu M.; Eskola, Hannu

doi:10.1186/s12880-021-00663-8

Research
Open access
Published: 04 October 2021

Reliability of the freehand region-of-interest method in quantitative cerebral diffusion tensor imaging

Ullamari Hakulinen^1,2,3,
Antti Brander²,
Tero Ilvesmäki³,
Mika Helminen^4,5,
Juha Öhman⁶,
Teemu M. Luoto^3,6 &
…
Hannu Eskola^2,3

BMC Medical Imaging volume 21, Article number: 144 (2021) Cite this article

Abstract

Background

Diffusion tensor imaging (DTI) is a magnetic resonance imaging (MRI) technique used for evaluating changes in the white matter in brain parenchyma. The reliability of quantitative DTI analysis is influenced by several factors, such as the imaging protocol, pre-processing and post-processing methods, and selected diffusion parameters. The region-of-interest (ROI) method is most widely used of the post-processing methods because it is found in commercial software. The focus of our research was to study the reliability of the freehand ROI method using various intra- and inter-observer analyses.

Methods

This study included 40 neurologically healthy participants who underwent diffusion MRI of the brain with a 3 T scanner. The measurements were performed at nine different anatomical locations using a freehand ROI method. The data extracted from the ROIs included the regional mean values, intra- and inter-observer variability and reliability. The used DTI parameters were fractional anisotropy (FA), the apparent diffusion coefficient (ADC), and axial (AD) and radial (RD) diffusivity.

Results

The average intra-class correlation coefficient (ICC) of the intra-observer was found to be 0.9 (excellent). The single ICC results were excellent (> 0.8) or adequate (> 0.69) in eight out of the nine regions in terms of FA and ADC. The most reliable results were found in the frontobasal regions. Significant differences between age groups were also found in the frontobasal regions. Specifically, the FA and AD values were significantly higher and the RD values lower in the youngest age group (18–30 years) compared to the other age groups.

Conclusions

The quantitative freehand ROI method can be considered highly reliable for the average ICC and mostly adequate for the single ICC. The freehand method is suitable for research work with a well-experienced observer. Measurements should be performed at least twice in the same region to ensure that the results are sufficiently reliable. In our study, reliability was slightly undermined by artifacts in some regions such as the cerebral peduncle and centrum semiovale. From a clinical point of view, the results are most reliable in adults under the age of 30, when age-related changes in brain white matter have not yet occurred.

Peer Review reports

Background

Diffusion tensor imaging (DTI) is a magnetic resonance imaging (MRI) technique that has become a popular tool for central nervous system imaging [1, 2]. DTI is based on the diffusion characteristics of water molecules, which, in turn, reflect the histological structure of the tissue [3]. Diffusion data can be used to calculate several quantitative parameters, such as fractional anisotropy (FA), the apparent diffusion coefficient (ADC), and axial (AD) and radial (RD) diffusivity. FA indicates the degree of diffusion anisotropy. The diffusion is generally strongest in the orientation parallel to the nerve tracts. The ADC expresses the mean diffusion in each direction. AD can be considered to be modulated by the axonal integrity [4, 5], and its changes can thus reflect the degree of axonal degeneration [6]. RD, on the other hand, is modulated by axonal myelination [4, 5].

Several studies on different neurological diseases have utilized these DTI indices as biomarkers of white matter integrity [7,8,9,10,11]. Significant age-related changes in the integrity of white matter have also been found in healthy volunteers [12,13,14,15,16,17].

Chronic white matter diseases as well as normal aging, causes a decrease in FA values while RD values tend to increase [18,19,20,21,22,23,24,25]. A strong relationship has also been found between the changes in AD and axonal injury [4]. Moreover, ADC values may temporarily decrease in the acute phase of cerebrovascular accidents, but, in the chronic phase, they usually increase [26, 27].

The imaging process includes several steps between acquisition and the final parametric result, and each step is susceptible to different pitfall sources [28, 29]. Specifically, low resolution, a low signal-to-noise ratio (SNR), and a variety of different types of artifacts can reduce the image quality [30,31,32,33]. In particular, the single-shot echo-planar technique used in diffusion imaging can cause severe image distortions because of the long echo trains that are used in the sequence. The consequence of these susceptibility artifacts are geometric distortions at the interfaces between soft tissue and air at the base of the skull [34]. In addition, B₀ inhomogeneities cause a decrease in the efficiency of fat-saturation pulses [34]. Protons in water and fat have a different Larmor frequency, which leads to fat misregistration in single-shot echo-planar imaging. All of the above-mentioned pitfalls and artifacts also have a detrimental effect on the reliability of parametric results.

Post-processing and analysis methods can be selected according to whether individual or group results are required. The histogram [35], region-of-interest (ROI), and quantitative tractography methods [36] are suitable for both individual- and group-level analysis. In addition, the tract-based spatial statistics (TBSS) method [37] is an option for group analysis. Nowadays, different methods are often used concomitantly, giving additional value to the accuracy of the results [38, 39].

The ROI method is still a highly valid method when measuring individual subjects. While laborious, time-consuming, and observer-dependent, it however, is the most readily available method in commercial clinically approved software. The method can be used to evaluate the focal areas of brain parenchyma of a single subject and it enables leaving artifacts outside the area of measurement. The low or moderate repeatability of the method as well as its high intra- and inter-observer variation have been considered its cons [40].

The main objective of this study was to investigate the reliability of the freehand ROI method, by intra- and inter-observer variation and repeatability measurements. The aim was also to examine the effects of different parameters (FA, ADC, AD and RD) and artifacts on the reliability of the results. In addition, the effects of age on white matter changes were studied in group comparisons.

Methods

Subjects

Participants included 40 healthy adult volunteers consisting of 20 women and 20 men with an age range of 18–60 years and a mean age of 40.6 (SD 12.2) years [41, 42]. The age groups were: (i) 18–30, (ii) 31–40, (iii) 41–50, and (iv) 51–60 years. Each age group included five men and five women. Thirty-nine of the subjects were right-handed, and one was left-handed. MRI scans were performed within a year (2010–2011). The exclusion criteria consisted of the following: (i) neurological problems (including abnormalities upon neuroimaging), (ii) psychiatric problems, (iii) history of traumatic brain injury, (iv) former neurosurgical procedure, (v) problems with hearing or vision, (vi) first language other than Finnish, (vii) MRI contraindications, and (viii) refusal to participate. No indications of significant structural abnormalities were found in any of the subjects in conventional clinical sequences. An ethics approval was obtained from the Ethical Committee of the Pirkanmaa Hospital District, and a written consent was obtained from each volunteer.

MRI acquisition

The subjects were scanned with a 3 T Siemens Trio (Siemens Healthcare, Erlangen, Germany) MRI scanner. The MRI protocol included sagittal T1-weighted 3D IR-prepared gradient echo, axial T2-weighted turbo spin echo, conventional axial and high-resolution sagittal fluid attenuation inversion recovery (FLAIR), axial T2*-weighted, and an axial susceptibility weighted imaging (SWI) series. The DTI data was collected by a single-shot, spin echo-based, and diffusion-weighted echo planar imaging sequence. The parameters for the DTI sequence were the repetition time (TR) 5144 ms, echo time (TE) 92 ms, field-of view (FOV) 230 mm, matrix 128 × 128, 3 averages, slice/gap 3.0/0.9 mm, voxel dimension 1.8 × 1.8 × 3.0 mm³, b-factor 0, 1000 s/mm², and 20 diffusion gradient orientations. A 12-channel head coil and a four-channel neck coil were simultaneously used. The coils used in the study were subjected to regular quality tests throughout the study, so that they could be proven to be intact and of high quality.

Data analysis

The multidirectional diffusion data was visually analyzed for distortions and artifacts. The eddy current distortion was qualitatively estimated by drawing the brain contours to the b₀ image and copying the contours to the diffusion weighted images. Susceptibility and phase artifacts were verified by reviewing the FA, ADC, AD, RD, and b₀ maps slice-by-slice.

The SNR was determined according the National Electrical Manufacturers Association (NEMA) standards 1-2008 with the expression SNR = S/N, where S = the signal and N = the noise of the image, which was estimated with a Rayleigh distribution (SD = standard deviation): N = SD/0.66. SNR values were measured from the b₀ images in each region (b = 0 s/mm²).

Two experienced observers, a medical physicist (UH) and a neuroradiologist (AB), performed the freehand measurements on a workstation using commercially available software Neuro3D (Siemens Healthcare, Malvern, USA). The freehand ROIs were manually placed on the axial images of the color-coded FA maps and automatically transferred to the ADC, AD, and RD maps as well as the non-diffusion weighted b₀ images. The ROIs were centered in the region using color-coded directions. The measurements were aimed to avoid border areas, such as areas overlapping with cerebrospinal fluid spaces, partial volume effects, and neighboring tracts. The thalamus was drawn to the grayscale FA map, because the border areas were more clearly distinguishable in this manner than in the color map.

Slices containing artefacts were avoided. If this was not possible, the artefact areas were excluded by omitting them from the ROI regions (Figs. 1 and 2). The sizes of the ROIs were chosen using the anatomical knowledge of brain regions and a tract-based atlas of human white matter anatomy [43]. The ROI size ranged from 10 mm² (min, cerebral peduncle) to 430 mm² (max, centrum semiovale). The time between the first and repeated freehand ROI measurements was at least four weeks.

Intra-observer measurements were performed for all volunteers (n = 40) and inter-observer measurements for 15 volunteers (n = 15). Nine regions were measured, eight of which were in the white matter (Fig. 3). Two observers analyzed each distinct region. The first observer (UH) analyzed the images of 40 subjects twice and the second observer (AB) measured images of 15 subjects. The same 15 subjects were selected from observer 1 measurements for inter-observer analysis. The measurements were selected from the first measurements. The regions in the pyramidal tracts included: the cerebral peduncle, posterior limb of the internal capsule, corona radiata, and centrum semiovale. In the frontobasal area, these included the uncinate fasciculus and forceps minor, while, in the corpus callosum, these included the genu and splenium. One region—the thalamus—was in the gray matter. The FA, ADC, AD, and RD values were calculated for each region. The left and right hemispheres were measured separately for seven regions. Moreover, the ROIs for the genu and splenium of the corpus callosum were drawn in the center of the axial image with one ROI per region.

Statistical analyses

The statistical analyses were performed using the SPSS software package (IBM SPSS Statistics version 22 and 26, Chicago, IL). Means and standard deviations were calculated for each region and parameter, and asymmetries between hemispheres were evaluated using a paired samples t-test. The statistical significance was set to p < 0.007, with a Bonferroni correction for seven regions, according to the regions measured in each hemisphere of the brain. The normality of distributions was tested using the Shapiro–Wilk test (n < 50). The differences among all the age group means were analyzed using an analysis of variance (ANOVA) for the normally distributed data and Welch’s test in inhomogeneous cases, where the variance of the variable differed between the age groups. The Kruskal–Wallis test was used for non-normally distributed data. Correlation analysis between FA, ADC and age from the same data have been published in our previous study [41]. In that study, we mostly used a small circle ROI, including a freehand ROI in three regions for better repeatability.

The samples that showed statistically significant differences among the age groups were analyzed by a group comparison between the different age groups. The independent-samples t-test was used with the normally distributed samples, and the Mann–Whitney U test with the non-normal distributions.

To show the relative variability of each measurement, the percent coefficients of variation (CV%) were calculated according the following equation (with SD = standard deviation and M = mean): (SD/M) × 100% [44]. The variability was considered acceptable when the CV% was less than 10% [45]. The results between 11 and 20% were considered to be moderate but still adequate. CV% values over 21% were considered too high and inadequate.

Bland–Altman plots were used as graphical representations for intra- and inter-observer repeatability [44]. The 95% limits of agreement and ± 2 standard deviation of the differences were calculated. The better was consistency between the first and repeated measurements, the smaller the difference between the two limits. Intra- and inter-observer repeatability was also assessed using intra-class correlation coefficients (ICCs) with an absolute agreement. Two-way mixed option was chosen as the model because the aim was to investigate the repeatability of these specific observers. In this study, the average ICC refers to the repeatability (test–retest) when the same region is measured twice and the final score is the average of two measurements. The single ICC approximates a situation where the measurement would only be made once, as is usually the case in clinical situations. The cerebral hemispheres have been analyzed separately, but presented as the mean of the left and right hemispheres of the brain. The ICC values were considered to indicate excellent agreement if they were greater than 0.8. ICC results between 0.70 and 0.79 were considered adequate [45], and values below 0.69 were considered inadequate for clinical work. The statistical significance was set to p < 0.006, with a Bonferroni correction for nine regions.

Results

The data quality was excellent in most cases. In some of the cases, artifacts were detected in the cerebral peduncle, corona radiata, and centrum semiovale (Table 1 and Fig. 2). Significant eddy current artefacts did not occur.

Table 1 The incidence of artifacts in the regions (N = 40)

Full size table

The mean SNR values (± SD) for all regions was 27.7 ± 4.2: the pyramidal tract 30.5 ± 4.2, frontobasal area 24.1 ± 4.7, corpus callosum 25.4 ± 0.3, and thalamus 28.0 ± 4.2

Mean values

In the Shapiro-Wilks test, 90% of the means were normally distributed (p > 0.05). The intra-observer mean values for the FA, ADC, AD, and RD of the sample (n = 40) are shown in Table 2.

Table 2 The intra-observer (observer 1) regional mean FA (0–1, unitless), ADC (10⁻³ mm²/s), AD (10⁻³ mm²/s) and RD (10⁻³ mm²/s) values ± standard deviation (mean ± SD), variation (the percent coefficients of variation = CV%) and repeatability (the intra-class correlation coefficients (ICC) and mean difference ± 2SD) (N = 40)

Full size table

In white matter ROIs, the mean FA value was 0.67. The lowest value was found in the corona radiata (0.50), and highest in the genu of the corpus callosum (0.86). The mean ADC value was 0.74 × 10⁻³ mm²/s, with lowest value being found in the corona radiata (0.70 × 10⁻³ mm²/s) and the highest in the uncinate fasciculus (0.78 × 10⁻³ mm²/s). The mean AD value was 1.44 × 10⁻³ mm²/s, with the lowest value being found in the corona radiata (1.10 × 10⁻³ mm²/s), and highest in the genu of the corpus callosum (1.82 × 10⁻³ mm²/s). The mean RD value was 0.39 × 10⁻³ mm²/s, with the lowest value being found in the genu of the corpus callosum (0.26 × 10⁻³ mm²/s) and the highest in the forceps minor (0.53 × 10⁻³ mm²/s). In the gray matter—the thalamus—the corresponding mean values were 0.32 for the FA, 0.76 × 10⁻³ mm²/s for ADC, 1.00 × 10⁻³ mm²/s for AD, and 0.64 × 10⁻³ mm²/s for RD.

Statistically significant differences between the right and left hemispheres (paired t test, p < 0.007) are expressed in Table 2, and the absolute mean values can be found in the table footnotes. In the pyramidal tract, more precisely in the posterior limb of the internal capsule and corona radiata, the FA values were significantly higher and RD values lower in the left hemisphere. The ADC values were lower in the left hemisphere in all four regions of the pyramidal tract. In the cerebral peduncle, the AD value was also lower in the left hemisphere. In both frontobasal regions, the FA values were significantly higher in the right hemisphere.

Significant differences between age groups were found in the frontobasal regions (Fig. 4). The FA and AD values were significantly higher and the RD values significantly lower in the youngest age group (18–30 years) compared to the other age groups (31–40, 41–50 and 51–60 years) (Fig. 4A, B). Specifically, the FA and RD differences were found in both hemispheres and AD differences in the left. For the ADC, there were no significant differences between the groups. The inter-observer mean values were estimated for 15 subjects, and the values are shown in Table 3.

Table 3 Inter-observer regional mean FA (0-1, unitless), ADC (10⁻³ mm²/s), AD (10⁻³ mm²/s) and RD (10⁻³ mm²/s) values ± standard deviation (mean ± SD) values, variation (the percent coefficients of variation = CV%) and repeatability (mean difference ± 2SD) (observer 1 & 2) (N = 15)

Full size table

Variation

The intra-observer variations (CV%) are shown in Table 2 (n = 40) (Fig. 5A). In the pyramidal tract, the variation for the FA measurements was 8%. The lowest variation was in the posterior limb of the capsula interna (5%), and the highest in the centrum semiovale (12%). The variation was 11% in the frontobasal area and 5% in the corpus callosum. In the gray matter (thalamus), the variation for the FA was 8%. For the ADC and AD, it was between 3 to 8% with all white matter and gray matter regions. For the RD measurements, the variation in the pyramidal tract was 12%. The lowest variation was in the posterior limb of the capsula interna (8%) and the highest in the cerebral peduncle (18%). The RD variation was 9% in the frontobasal area and 26% in the corpus callosum. In the gray matter (thalamus), the variation was 5%. The inter-observer variation results (CV%) are shown in Table 3 (Fig. 5B).

Reliability

The intra-observer results of the limits of agreement are shown in Table 2. In the white matter, the best intra-observer agreement was found in the posterior limb of the capsula interna with all diffusion parameters. For the ADC, good agreement was also found in the corona radiata, centrum semiovale, uncinate fasciculus, and forceps minor. The largest range between the limits was found in the centrum semiovale for the FA and in the cerebral peduncle for the ADC, AD and RD measurements. The smallest and largest ranges between the 95% limits of agreement for each DTI parameter are presented in the Bland–Altman plots (Figs. 6, 7). For the gray matter, the agreement was very good with all DTI parameters (Fig. 8). On average, the 2 SD of the limit of agreement for the intra-observer results was 0.06. The inter-observer limits of agreement are shown in Table 3, and the smallest ranges between limits are presented in the Bland–Altman plots for each DTI parameter (Fig. 9). In white matter regions, the best agreement was found in the uncinate fasciculus for FA and RD in the corona radiata for ADC and AD. On average, the 2 SD of the limit of agreement for the inter-observer results was 0.08.

The intra-observer repeatability results (ICC) are shown in Table 2. For the FA, the mean was 0.87 for the average ICC and 0.78 for the single ICC. The highest average ICC was found in the uncinate fasciculus (0.95), and lowest in the cerebral peduncle (0.75). The average ICC results for the FA were above 0.8, and the single ICCs were above 0.7 in eight of the nine regions. Only one region, cerebral peduncle, had coefficients below these results (average 0.75 and single 0.60). For the ADC, the mean value for the average ICC was 0.91 and 0.85 for the single ICC. The highest ICC values were found in the centrum semiovale at both the average (0.98) and single (0.95) ICC. The lowest ICC was observed in the cerebral peduncle for both the average (0.80) and single (0.67) ICC. For AD, the mean average ICC result was 0.87, and the single ICC result was 0.78. The highest ICC values of AD were found in the splenium of the corpus callosum for both the average (0.94) and single (0.89). The lowest result of AD was in the centrum semiovale at the average (0.76) and single (0.62). For RD, the ICCs results were 0.90 for the average and 0.82 for the single measurement. The best repeatability values of ICCs for the average (0.96) and single (0.93) measurements were both found in the frontobasal area in the uncinate fasciculus. For RD, the lowest value was found in the cerebral peduncle by both the average result (0.76) and the single measurement (0.61). 70% of the inter-observer ICC results were statistically significant (p < 0.006). Only significant results were presented. The means of the average ICCs were 0.84 for FA, 0.88 for ADC, 0.81 for AD, and 0.88 for RD and the means of the single ICCs were 0.72, 0.79, 0.69 and 0.78, respectively. The highest ICCs were found in the corona radiata, the average ICC values were 0.94 for FA, 0.95 for ADC and 0.97 for RD and for the single ICCs 0.89, 0.90, 0.94, respectively. For AD, the highest ICCs were found in the splenium of the corpus callosum (the average 0.92 and single 0.84).

Discussion

FA values are considered to reflect the integrity of the white matter. Although not in itself a specific parameter in a diagnostic sense, it provides indirect information about myelination, fiber packing density, and fiber orientation [46]. It is well-known that FA values vary widely at different anatomic levels of the brain [12, 13, 40, 45, 47]. Specifically, Lee et al. [12] reported that regional FA values varied from 0.21 in deep gray matter (putamen) to 0.81 in tightly packed parallel white matter tract bundles, such as the genu of the corpus callosum. The corresponding results in this study were 0.32 for deep gray matter (thalamus) and 0.86 for the genu of the corpus callosum. Regions with coherently oriented fibers, such as the cerebral peduncle, internal capsule, and corpus callosum exhibited higher anisotropy than regions with less coherence, such as the centrum semiovale and other subcortical regions [48]. Due to the vast regional variability of FA, possible anatomical mismatches should be taken into account in inter-observer and intergroup comparisons [47]. The ADC values, on the other hand, exhibit less regional variation [13]. In our study, the ADC mean values varied between 0.7–0.8 × 10⁻³ mm²/s, and in other similar studies the variation was 0.7 to 0.9 × 10⁻³ mm²/s [45, 49,50,51]. In the frontobasal area, compared to other white matter regions demonstrated lower FA and AD values and higher ADC and RD values. The FA values were in line with a tractography study by Deng et al. [52], where a mean FA value of 0.41 (profile 0.3 to 0.52) was found in the uncinate fasciculus and 0.54 (profile 0.40 to 0.68) in the forceps minor. In our study, the corresponding FA values were 0.57 and 0.51. The results of Lieberman et al. [53] were also similar to ours in the uncinate fasciculus. The FA and ADC values were almost identical to those found in our previous study (30 subjects) in most of the regions [40]. The biggest difference (14%) between our present and previous study was found in the genu of the corpus callosum. In this region, measurements were previously made on sagittal [40] instead of axial images, like in the present study. In general, the measured quantitative diffusion metrics were well in line with previous studies.

Asymmetry between the hemispheres was found in some of the regions. Pyramidal tracts, such as the posterior limb of the capsula interna and corona radiata, expressed higher FA values and lower ADC and RD values in the left hemisphere. The present results are well in agreement with previous studies [13, 40, 54]. In addition, in the centrum semiovale, asymmetry of the cerebral hemispheres was observed in the ADC value, which was also lower on the left. Some of the observed asymmetry in our study may be attributed to handedness of the volunteers; 39 of the 40 volunteers in our study were right-handed. Corresponding hemispheric differences were obtained for right-handers in another study [54]. Phase artifacts (fat misregistration) could also be a possible explanation in the regions of the corona radiata and centrum semiovale. In the corona radiata, phase artifacts were present in 55% of cases in the left hemisphere but were not present at all in the right hemisphere. Similarly, the centrum semiovale included artifacts in 25% of cases in the left hemisphere and only in 5% in the right hemisphere. The fat misregistration generally raises FA values locally and decreases ADC and RD values. Artifacts can affect the ROIs in the vicinity, even if the visible part of the artifact is cropped out. Hemispheric differences were also found in the frontobasal area. In those regions, the FA values were found to be higher in the right hemisphere, which is in agreement with previous findings [40, 55]. Jahanshad et al. [55] found that the variance in the asymmetry of the frontal lobe is strongly due to genetic factors. In our study, higher FA values were usually found in the right hemisphere of the frontobasal area. Bonekamp et al. [56] reported that small hemispheric differences could be due to slight slice angulation. Therefore, keeping the same slice position and orientation in longitudinal studies is essential [47].

In terms of age-related changes, we found significant differences between the youngest age group (18–30 years) and other age groups (31–40, 41–50, and 51–60 years). Specifically, the FA values were higher and the RD values lower in the frontobasal area in both hemispheres in the youngest age group when compared to the other age groups. For FA, this result has already been published in our previous study [41]. Other studies have also found changes in the frontal regions of the brain caused by aging [16, 17]. In general, several studies have found a negative correlation between age and FA and a positive correlation between age and RD in white matter [21, 22, 57, 59]. These variations may be related to changes in myelination and axon density [17, 58, 60].

In the present study, acceptable intra-observer variability (≤ 10%) was found in six out of nine regions for FA, while three regions had moderate but adequate variation. For ADC and AD, all regions had acceptable variability. For RD, seven out of nine regions had an acceptable or moderate variation and two had high variation (genu and splenium of the corpus callosum). The percent variation of the RD values in the corpus callosum is naturally high, because the mean value is clearly lower than in the other regions. Low RD values are due to the fact that the fibers are tightly packed and parallel to each other. In this case, the variation was not a good indicator for assessing reliability. Overall, the variation results were in line with our previous study [40]. It is noteworthy that the freehand method gives an average of 4% lower variations in the pyramidal regions compared to the circle method [13, 41]. In contrast, in our study, the freehand method gave a slightly higher variation in the corpus callosum than the circle method in previous studies [13, 41]. This may be due to the fact that in our study, ROIs were plotted on the axial image, whereas in previous studies they were plotted on the sagittal image [13, 41]. Thus, in this particular region, it would be better to use the circle method for a sagittal image than the freehand method for an axial image. The inter-observer (n = 15) variability was acceptable or moderate in seven out of nine regions. The inter-observer variabilities are in line with our previous study [40].

The intra-observer repeatability was at a very good level according to the 95% limits of agreement. The results varied according to region, and, with tightly packed white matter tracts, such as the posterior limb of the capsula interna, the difference between the limits was small. Also, the only region of gray matter—the thalamus—was found to be reliable in this analysis. Furthermore, this difference was greater in regions containing crossing fibers, such as the centrum semiovale. Overall, the results were consistent with our previous research [40]. The inter-observer agreement was lower than the intra-observer agreement in all regions, and others have reported similar results [13, 40, 59, 60]. Several studies have shown that inter-observer agreement results have been one-third lower than intra-observer results [59, 60]. Our study further confirms the trend between inter-observer and intra-observer agreements. The uncinate fasciculus was found to be the most reliable region in the inter-observer analyses for FA and RD, while the corona radiata was the most reliable region for ADC and AD.

The intra-observer reliability was high according to the average measures of the ICC analysis. In our study, average ICC refers to the repeatability obtained as the average of two measurements from a single region. Overall, the average ICC results were excellent for all four parameters. The repeatability result was also excellent (above 0.8) in eight out of nine regions for FA and all regions for the ADC. The repeatability of the freehand method was significantly improved compared to our previous study [40]. The average ICC increase was 0.4 (37%) in terms of the FA and ADC parameters.

The higher ICC values were probably due to increased observer experience in selecting a slide, avoiding artifacts and the partial volume effect of border areas. The single intra-observer ICC analysis was, on average, excellent in terms of the ADC and RD parameters and moderate in terms of the FA and AD parameters. Single ICC in our study refers to the repeatability of a single measurement, which can be considered normal practice in clinical measurements. The results showed excellent or moderate repeatability in seven out of nine regions for all DTI parameters. The region with the highest single ICC values was the forceps minor, with excellent reliability for each parameter. Good reliability was also found in the following regions: the uncinate fasciculus, thalamus, and the genu and splenium of the corpus callosum. High reliability in the corpus callosum is consistent with previous studies with the ROI method [45, 61, 62] but also with the TBSS method [38]. Inadequate results (ICC < 0.69) were found in the cerebral peduncle (FA, ADC and RD) and centrum semiovale (AD). The reason for the inferior reliability of the cerebral peduncle was the susceptibility artifact, more specifically the air-cavity. This artifact causes local changes in the results of the parameters. Although efforts were made to avoid distracted areas in the ROI, the effects of the artifact were also reflected in the surrounding areas. The reason for the low reliability of the centrum semiovale in the AD values can be explained by the multitude of crossing fibers in the subcortical white matter. Also, the statistically significant inter-observer results were highly similar to the intra-observer results. The differences between intra- and inter-observer ICC results averaged at less than 5% for the average ICC and less than 10% for the single ICC. The most reliable inter-observer region was found to be the corona radiata, which had the highest value for three different parameters (FA, ADC, and RD). For AD, the highest value was obtained in the splenium of the corpus callosum. The reliability of the measurements is greatly improved if the measurement is repeated at least once or if the result is taken as a mean of the measurements from two different observers.

The SNR measurements showed that the image quality was sufficient for reliable quantitative measurements. In general, the SNR of b = 0 s/mm² should be at least 20 in order to derive reliable FA values [36]. In our study, the SNR was well above 20 in all regions, and the measured SNR values were comparable to other studies [63, 64].

A limitation of this study was that the commercial program did not include eddy current and subject motion corrections. In addition, the used imaging parameters may have not been optimal, especially compared to more recent diffusion imaging, e.g., high angular resolution diffusion imaging (HARDI) using isotropic voxels. Acquisition with higher resolution isotropic voxels and possibly HARDI may give more accurate results [36]. Furthermore, it has been shown that using near 1 mm isotropic voxels gives excellent results in repeatability [65]. In addition, 70% of the inter-observer ICC results were statistically significant. This was a consequence of the small number of samples. The schedule of measurements was limited.

In general, the regions with high reliability and low variation possess some common features. These regions have low anatomical variation and tightly packed fibers with a common orientation [66]. These areas also often have a better SNR, fewer partial volume effects, and are also less affected by “crossing” fibers. In addition, the larger ROI size increases the SNR value and improves the repeatability [66]. When a larger ROI size is used in a limited region, it is likely that there are more percentages of the same voxels between the two measurements than for a smaller ROI. The results of the repeat measurements are thus close to each other.

In future studies, larger samples of carefully collected high-spatial and -angular resolution DTI normal data should be acquired. In those studies, more subjects should be recruited for each age group in order to perform a reliable analysis of the effect of age. In addition, it would be interesting to study how much the reliability of the measurements improve when different methods, such as the ROI, tractography, and TBSS, are used simultaneously.

Conclusions

According to our results, the intra-observer repeatability of the quantitative freehand ROI method can be considered at least adequate. The quantitative freehand ROI method can be considered highly reliable for the average ICC and mostly adequate for the single ICC. The reliability of the single measurements was excellent or moderate in 80% of the regions, including all DTI parameters. In the comparison of parameters, for the single ICCs, most of the repeatability results were excellent in terms of the ADC and RD while only moderate in terms of the FA and AD parameters.

As per our results, the freehand method can be considered highly suitable for research and clinical applications assuming a well-experienced observer. Measurements should be repeated at least once in each region to ensure sufficient reliability of the results. The frontobasal area, such as the uncinate fasciculus and forceps minor, as well as the internal capsule and corona radiata regions of the pyramidal tracts were found to be reliable regions in the repeatability analysis. In addition, the only region of gray matter—the thalamus—was found to be reliable. Therefore, they could be considered as regions which yield the most accurate quantitative ROI measurements in clinical settings. In general, it would be highly beneficial to favor regions with high reliability and repeatability in ROI measurements, if possible. Additionally, special care should be taken in ROI delineation in subjects with image artifacts.

When using the results of healthy adults as a control for patient groups, it should be noted that the results are most reliable on adults less than 30 years of age whose brain white matter does not yet have age-related changes.

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.

References

Mori S, Crain BJ, Chacko VP, van Zijl PC. Three-dimensional tracking of axonal projections in the brain by magnetic resonance imaging. Ann Neurol. 1999;45(2):265–9. https://doi.org/10.1002/1531-8249(199902)45:2%3c265::aid-ana21%3e3.0.co;2-3.
Article PubMed CAS Google Scholar
Assaf Y, Pasternak O. Diffusion tensor imaging (DTI)-based white matter mapping in brain research: a review. J Mol Neurosci. 2008;34(1):51–61. https://doi.org/10.1007/s12031-007-0029-0.
Article PubMed CAS Google Scholar
Farrell JA, Landman BA, Jones CK, Smith SA, Prince JL, van Zijl PC, Mori S. Effects of signal-to-noise ratio on the accuracy and reproducibility of diffusion tensor imaging-derived fractional anisotropy, mean diffusivity, and principal eigenvector measurements at 1.5 T. J Magn Reson Imaging. 2007;26(3):756–67. https://doi.org/10.1002/jmri.21053.
Article PubMed PubMed Central Google Scholar
Budde MD, Xie M, Cross AH, Song SK. Axial diffusivity is the primary correlate of axonal injury in the experimental autoimmune encephalomyelitis spinal cord: a quantitative pixelwise analysis. J Neurosci. 2009;29(9):2805–13. https://doi.org/10.1523/JNEUROSCI.4605-08.2009.
Article PubMed PubMed Central CAS Google Scholar
Song SK, Yoshino J, Le TQ, Lin SJ, Sun SW, Cross AH, Armstrong RC. Demyelination increases radial diffusivity in corpus callosum of mouse brain. Neuroimage. 2005;26(1):132–40. https://doi.org/10.1016/j.neuroimage.2005.01.028.
Article PubMed Google Scholar
Alexander AL, Lee JE, Lazar M, Field AS. Diffusion tensor imaging of the brain. Neurotherapeutics. 2007;4(3):316–29. https://doi.org/10.1016/j.nurt.2007.05.011.
Article PubMed PubMed Central Google Scholar
Song SK, Sun SW, Ramsbottom MJ, Chang C, Russell J, Cross AH. Dysmyelination revealed through MRI as increased radial (but unchanged axial) diffusion of water. Neuroimage. 2002;17(3):1429–36. https://doi.org/10.1006/nimg.2002.1267.
Article PubMed Google Scholar
Perlbarg V, Puybasset L, Tollard E, Lehéricy S, Benali H, Galanaud D. Relation between brain lesion location and clinical outcome in patients with severe traumatic brain injury: a diffusion tensor imaging study using voxel-based approaches. Hum Brain Mapp. 2009;30(12):3924–33. https://doi.org/10.1002/hbm.20817.
Article PubMed PubMed Central Google Scholar
Boespflug EL, Storrs JM, Allendorfer JB, Lamy M, Eliassen JC, Page S. Mean diffusivity as a potential diffusion tensor biomarker of motor rehabilitation after electrical stimulation incorporating task specific exercise in stroke: a pilot study. Brain Imaging Behav. 2014;8(3):359–69. https://doi.org/10.1007/s11682-011-9144-1.
Article PubMed Google Scholar
Klawiter EC, Schmidt RE, Trinkaus K, Liang HF, Budde MD, Naismith RT, Song SK, Cross AH, Benzinger TL. Radial diffusivity predicts demyelination in ex vivo multiple sclerosis spinal cords. Neuroimage. 2011;55(4):1454–60. https://doi.org/10.1016/j.neuroimage.2011.01.007 (Epub 2011 Jan 13).
Article PubMed Google Scholar
Mukherjee P. Diffusion tensor imaging and fiber tractography in acute stroke. Neuroimaging Clin N Am. 2005;15(3):655–65. https://doi.org/10.1016/j.nic.2005.08.010.
Article PubMed Google Scholar
Lee CE, Danielian LE, Thomasson D, Baker EH. Normal regional fractional anisotropy and apparent diffusion coefficient of the brain measured on a 3 T MR scanner. Neuroradiology. 2009;51(1):3–9. https://doi.org/10.1007/s00234-008-0441-3 (Epub 2008 Aug 13).
Article PubMed Google Scholar
Brander A, Kataja A, Saastamoinen A, Ryymin P, Huhtala H, Ohman J, Soimakallio S, Dastidar P. Diffusion tensor imaging of the brain in a healthy adult population: normative values and measurement reproducibility at 3 T and 1.5 T. Acta Radiol. 2010;51(7):800–7. https://doi.org/10.3109/02841851.2010.495351.
Article PubMed Google Scholar
Ilvesmäki T, Luoto TM, Hakulinen U, Brander A, Ryymin P, Eskola H, Iverson GL, Ohman J. Acute mild traumatic brain injury is not associated with white matter change on diffusion tensor imaging. Brain. 2014;137(Pt 7):1876–82. https://doi.org/10.1093/brain/awu095 (Epub 2014 May 11).
Article PubMed Google Scholar
Sala S, Agosta F, Pagani E, Copetti M, Comi G, Filippi M. Microstructural changes and atrophy in brain white matter tracts with aging. Neurobiol Aging. 2012;33(3):488-498.e2. https://doi.org/10.1016/j.neurobiolaging.2010.04.027 (Epub 2010 Jul 1).
Article PubMed Google Scholar
Yoon B, Shim YS, Lee KS, Shon YM, Yang DW. Region-specific changes of cerebral white matter during normal aging: a diffusion-tensor analysis. Arch Gerontol Geriatr. 2008;47(1):129–38. https://doi.org/10.1016/j.archger.2007.07.004 (Epub 2007 Aug 30).
Article PubMed Google Scholar
Bach M, Laun FB, Leemans A, Tax CM, Biessels GJ, Stieltjes B, Maier-Hein KH. Methodological considerations on tract-based spatial statistics (TBSS). Neuroimage. 2014;100:358–69. https://doi.org/10.1016/j.neuroimage.2014.06.021 (Epub 2014 Jun 16).
Article PubMed Google Scholar
Ilvesmäki T, Koskinen E, Brander A, Luoto T, Öhman J, Eskola H. Spinal cord injury induces widespread chronic changes in cerebral white matter. Hum Brain Mapp. 2017;38(7):3637–47. https://doi.org/10.1002/hbm.23619 (Epub 2017 Apr 21).
Article PubMed PubMed Central Google Scholar
Choi JY, Hart T, Whyte J, Rabinowitz AR, Oh SH, Lee J, Kim JJ. Myelin water imaging of moderate to severe diffuse traumatic brain injury. Neuroimage Clin. 2019;22: 101785. https://doi.org/10.1016/j.nicl.2019.101785 (Epub 2019 Mar 16).
Article PubMed PubMed Central Google Scholar
Dailey NS, Smith R, Bajaj S, Alkozei A, Gottschlich MK, Raikes AC, Satterfield BC, Killgore WDS. Elevated aggression and reduced white matter integrity in mild traumatic brain injury: a DTI study. Front Behav Neurosci. 2018;12:118. https://doi.org/10.3389/fnbeh.2018.00118.
Article PubMed PubMed Central Google Scholar
Cox SR, Ritchie SJ, Tucker-Drob EM, Liewald DC, Hagenaars SP, Davies G, Wardlaw JM, Gale CR, Bastin ME, Deary IJ. Ageing and brain white matter structure in 3,513 UK Biobank participants. Nat Commun. 2016;7:13629. https://doi.org/10.1038/ncomms13629.
Article PubMed PubMed Central CAS Google Scholar
Kodiweera C, Alexander AL, Harezlak J, McAllister TW, Wu YC. Age effects and sex differences in human brain white matter of young to middle-aged adults: A DTI, NODDI, and q-space study. Neuroimage. 2016;128:180–92. https://doi.org/10.1016/j.neuroimage.2015.12.033 (Epub 2015 Dec 24).
Article PubMed Google Scholar
Banaszek A, Bladowska J, Pokryszko-Dragan A, Podemski R, Sąsiadek MJ. Evaluation of the degradation of the selected projectile, commissural and association white matter tracts within normal appearing white matter in patients with multiple sclerosis using diffusion tensor MR imaging—a preliminary study. Pol J Radiol. 2015;80:457–63. https://doi.org/10.12659/PJR.894661.
Article PubMed PubMed Central Google Scholar
Lipton ML, Gulko E, Zimmerman ME, Friedman BW, Kim M, Gellella E, Gold T, Shifteh K, Ardekani BA, Branch CA. Diffusion-tensor imaging implicates prefrontal axonal injury in executive function impairment following very mild traumatic brain injury. Radiology. 2009;252(3):816–24. https://doi.org/10.1148/radiol.2523081584 (Epub 2009 Jun 30).
Article PubMed Google Scholar
Messé A, Caplain S, Pélégrini-Issac M, Blancho S, Montreuil M, Lévy R, Lehéricy S, Benali H. Structural integrity and postconcussion syndrome in mild traumatic brain injury patients. Brain Imaging Behav. 2012;6(2):283–92. https://doi.org/10.1007/s11682-012-9159-2.
Article PubMed Google Scholar
Shen JM, Xia XW, Kang WG, Yuan JJ, Sheng L. The use of MRI apparent diffusion coefficient (ADC) in monitoring the development of brain infarction. BMC Med Imaging. 2011;11:2. https://doi.org/10.1186/1471-2342-11-2.
Article PubMed PubMed Central Google Scholar
Alegiani AC, MacLean S, Braass H, Gellißen S, Cho TH, Derex L, Hermier M, Berthezene Y, Nighoghossian N, Gerloff C, Fiehler J, Thomalla G. Dynamics of water diffusion changes in different tissue compartments from acute to chronic stroke—a serial diffusion tensor imaging study. Front Neurol. 2019;10:158. https://doi.org/10.3389/fneur.2019.00158.
Article PubMed PubMed Central Google Scholar
Jones DK. Precision and accuracy in diffusion tensor magnetic resonance imaging. Top Magn Reson Imaging. 2010;21(2):87–99. https://doi.org/10.1097/RMR.0b013e31821e56ac.
Article PubMed Google Scholar
Jones DK, Cercignani M. Twenty-five pitfalls in the analysis of diffusion MRI data. NMR Biomed. 2010;23(7):803–20. https://doi.org/10.1002/nbm.1543.
Article PubMed Google Scholar
Westin CF, Maier SE, Mamata H, Nabavi A, Jolesz FA, Kikinis R. Processing and visualization for diffusion tensor MRI. Med Image Anal. 2002;6(2):93–108. https://doi.org/10.1016/s1361-8415(02)00053-1.
Article PubMed Google Scholar
Anderson AW. Theoretical analysis of the effects of noise on diffusion tensor imaging. Magn Reson Med. 2001;46(6):1174–88. https://doi.org/10.1002/mrm.1315.
Article PubMed CAS Google Scholar
Lazar M, Alexander AL. An error analysis of white matter tractography methods: synthetic diffusion tensor field simulations. Neuroimage. 2003;20(2):1140–53. https://doi.org/10.1016/S1053-8119(03)00277-5.
Article PubMed Google Scholar
Leemans A, Jones DK. The B-matrix must be rotated when correcting for subject motion in DTI data. Magn Reson Med. 2009;61(6):1336–49. https://doi.org/10.1002/mrm.21890.
Article PubMed Google Scholar
Dietrich O, Reiser MF, Schoenberg SO. Artifacts in 3-T MRI: physical background and reduction strategies. Eur J Radiol. 2008;65(1):29–35. https://doi.org/10.1016/j.ejrad.2007.11.005 (Epub 2007 Dec 26).
Article PubMed Google Scholar
Heiervang E, Behrens TE, Mackay CE, Robson MD, Johansen-Berg H. Between session reproducibility and between subject variability of diffusion MR and tractography measures. Neuroimage. 2006;33(3):867–77. https://doi.org/10.1016/j.neuroimage.2006.07.037 (Epub 2006 Sep 26).
Article PubMed CAS Google Scholar
Mukherjee P, Chung SW, Berman JI, Hess CP, Henry RG. Diffusion tensor MR imaging and fiber tractography: technical considerations. AJNR Am J Neuroradiol. 2008;29(5):843–52. https://doi.org/10.3174/ajnr.A1052 (Epub 2008 Mar 13).
Article PubMed PubMed Central CAS Google Scholar
Smith SM, Jenkinson M, Johansen-Berg H, Rueckert D, Nichols TE, Mackay CE, Watkins KE, Ciccarelli O, Cader MZ, Matthews PM, Behrens TE. Tract-based spatial statistics: voxelwise analysis of multi-subject diffusion data. Neuroimage. 2006;31(4):1487–505. https://doi.org/10.1016/j.neuroimage.2006.02.024 (Epub 2006 Apr 19).
Article PubMed Google Scholar
Merisaari H, Tuulari JJ, Karlsson L, Scheinin NM, Parkkola R, Saunavaara J, Lähdesmäki T, Lehtola SJ, Keskinen M, Lewis JD, Evans AC, Karlsson H. Test-retest reliability of diffusion tensor imaging metrics in neonates. Neuroimage. 2019;197:598–607. https://doi.org/10.1016/j.neuroimage.2019.04.067 (Epub 2019 Apr 25).
Article PubMed Google Scholar
Lilja Y, Gustafsson O, Ljungberg M, Nilsson D, Starck G. Impact of region-of-interest method on quantitative analysis of DTI data in the optic tracts. BMC Med Imaging. 2016;16(1):42. https://doi.org/10.1186/s12880-016-0145-9.
Article PubMed PubMed Central Google Scholar
Hakulinen U, Brander A, Ryymin P, Öhman J, Soimakallio S, Helminen M, Dastidar P, Eskola H. Repeatability and variation of region-of-interest methods using quantitative diffusion tensor MR imaging of the brain. BMC Med Imaging. 2012;12:30. https://doi.org/10.1186/1471-2342-12-30.
Article PubMed PubMed Central Google Scholar
Nenonen M, Hakulinen U, Brander A, Ohman J, Dastidar P, Luoto TM. Possible confounding factors on cerebral diffusion tensor imaging measurements. Acta Radiol Open. 2015;4(2):2047981614546795. https://doi.org/10.1177/2047981614546795.
Article PubMed PubMed Central Google Scholar
Koskinen E, Brander A, Hakulinen U, Luoto T, Helminen M, Ylinen A, Ohman J. Assessing the state of chronic spinal cord injury using diffusion tensor imaging. J Neurotrauma. 2013;30(18):1587–95. https://doi.org/10.1089/neu.2013.2943 (Epub 2013 Aug 9).
Article PubMed Google Scholar
Wakana S, Jiang H, Nagae-Poetscher LM, van Zijl PC, Mori S. Fiber tract-based atlas of human white matter anatomy. Radiology. 2004;230(1):77–87. https://doi.org/10.1148/radiol.2301021640 (Epub 2003 Nov 26).
Article PubMed Google Scholar
Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet. 1986;1(8476):307–10.
Article CAS Google Scholar
Marenco S, Rawlings R, Rohde GK, Barnett AS, Honea RA, Pierpaoli C, Weinberger DR. Regional distribution of measurement error in diffusion tensor imaging. Psychiatry Res. 2006;147(1):69–78. https://doi.org/10.1016/j.pscychresns.2006.01.008 (Epub 2006 Jun 21).
Article PubMed PubMed Central Google Scholar
Shimony JS, McKinstry RC, Akbudak E, Aronovitz JA, Snyder AZ, Lori NF, Cull TS, Conturo TE. Quantitative diffusion-tensor anisotropy brain MR imaging: normative human data and anatomic analysis. Radiology. 1999;212(3):770–84. https://doi.org/10.1148/radiology.212.3.r99au51770.
Article PubMed CAS Google Scholar
Virta A, Barnett A, Pierpaoli C. Visualizing and characterizing white matter fiber structure and architecture in the human pyramidal tract using diffusion tensor MRI. Magn Reson Imaging. 1999;17(8):1121–33. https://doi.org/10.1016/s0730-725x(99)00048-x.
Article PubMed CAS Google Scholar
Pierpaoli C, Jezzard P, Basser PJ, Barnett A, Di Chiro G. Diffusion tensor MR imaging of the human brain. Radiology. 1996;201(3):637–48. https://doi.org/10.1148/radiology.201.3.8939209.
Article PubMed CAS Google Scholar
Bisdas S, Bohning DE, Besenski N, Nicholas JS, Rumboldt Z. Reproducibility, interrater agreement, and age-related changes of fractional anisotropy measures at 3T in healthy subjects: effect of the applied b-value. AJNR Am J Neuroradiol. 2008;29(6):1128–33. https://doi.org/10.3174/ajnr.A1044 (Epub 2008 Mar 27).
Article PubMed PubMed Central CAS Google Scholar
Huisman TA, Loenneker T, Barta G, Bellemann ME, Hennig J, Fischer JE, Il’yasov KA. Quantitative diffusion tensor MR imaging of the brain: field strength related variance of apparent diffusion coefficient (ADC) and fractional anisotropy (FA) scalars. Eur Radiol. 2006;16(8):1651–8. https://doi.org/10.1007/s00330-006-0175-8 (Epub 2006 Mar 11).
Article PubMed Google Scholar
Hunsche S, Moseley ME, Stoeter P, Hedehus M. Diffusion-tensor MR imaging at 1.5 and 3.0 T: initial observations. Radiology. 2001;221(2):550–6. https://doi.org/10.1148/radiol.2212001823.
Article PubMed CAS Google Scholar
Deng F, Zhao L, Liu C, Lu M, Zhang S, Huang H, Chen L, Wu X, Niu C, He Y, Wang J, Huang R. Plasticity in deep and superficial white matter: a DTI study in world class gymnasts. Brain Struct Funct. 2018;223(4):1849–62. https://doi.org/10.1007/s00429-017-1594-9 (Epub 2017 Dec 18).
Article PubMed Google Scholar
Lieberman G, Shpaner M, Watts R, Andrews T, Filippi CG, Davis M, Naylor MR. White matter involvement in chronic musculoskeletal pain. J Pain. 2014;15(11):1110–9. https://doi.org/10.1016/j.jpain.2014.08.002 (Epub 2014 Aug 15).
Article PubMed PubMed Central Google Scholar
Seizeur R, Magro E, Prima S, Wiest-Daesslé N, Maumet C, Morandi X. Corticospinal tract asymmetry and handedness in right- and left-handers by diffusion tensor tractography. Surg Radiol Anat. 2014;36(2):111–24. https://doi.org/10.1007/s00276-013-1156-7 (Epub 2013 Jun 27).
Article PubMed Google Scholar
Jahanshad N, Lee AD, Barysheva M, McMahon KL, de Zubicaray GI, Martin NG, Wright MJ, Toga AW, Thompson PM. Genetic influences on brain asymmetry: a DTI study of 374 twins and siblings. Neuroimage. 2010;52(2):455–69. https://doi.org/10.1016/j.neuroimage.2010.04.236 (Epub 2010 Apr 27).
Article PubMed Google Scholar
Bonekamp D, Nagae LM, Degaonkar M, Matson M, Abdalla WM, Barker PB, Mori S, Horská A. Diffusion tensor imaging in children and adolescents: reproducibility, hemispheric, and age-related differences. Neuroimage. 2007;34(2):733–42. https://doi.org/10.1016/j.neuroimage.2006.09.020 (Epub 2006 Nov 7).
Article PubMed Google Scholar
Inano S, Takao H, Hayashi N, Abe O, Ohtomo K. Effects of age and gender on white matter integrity. AJNR Am J Neuroradiol. 2011;32(11):2103–9. https://doi.org/10.3174/ajnr.A2785 (Epub 2011 Oct 13).
Article PubMed PubMed Central CAS Google Scholar
Lebel C, Gee M, Camicioli R, Wieler M, Martin W, Beaulieu C. Diffusion tensor imaging of white matter tract evolution over the lifespan. Neuroimage. 2012;60(1):340–52. https://doi.org/10.1016/j.neuroimage.2011.11.094 (Epub 2011 Dec 8).
Article PubMed CAS Google Scholar
Ciccarelli O, Parker GJ, Toosy AT, Wheeler-Kingshott CA, Barker GJ, Boulby PA, Miller DH, Thompson AJ. From diffusion tractography to quantitative white matter tract measures: a reproducibility study. Neuroimage. 2003;18(2):348–59. https://doi.org/10.1016/s1053-8119(02)00042-3.
Article PubMed CAS Google Scholar
Müller MJ, Mazanek M, Weibrich C, Dellani PR, Stoeter P, Fellgiebel A. Distribution characteristics, reproducibility, and precision of region of interest-based hippocampal diffusion tensor imaging measures. AJNR Am J Neuroradiol. 2006;27(2):440–6.
PubMed PubMed Central Google Scholar
Pfefferbaum A, Adalsteinsson E, Sullivan EV. Replicability of diffusion tensor imaging measurements of fractional anisotropy and trace in brain. J Magn Reson Imaging. 2003;18(4):427–33. https://doi.org/10.1002/jmri.10377.
Article PubMed Google Scholar
Papinutto ND, Maule F, Jovicich J. Reproducibility and biases in high field brain diffusion MRI: an evaluation of acquisition and analysis variables. Magn Reson Imaging. 2013;31(6):827–39. https://doi.org/10.1016/j.mri.2013.03.004 (Epub 2013 Apr 24).
Article PubMed Google Scholar
Polders DL, Leemans A, Hendrikse J, Donahue MJ, Luijten PR, Hoogduin JM. Signal to noise ratio and uncertainty in diffusion tensor imaging at 1.5, 3.0, and 7.0 Tesla. J Magn Reson Imaging. 2011;33(6):1456–63. https://doi.org/10.1002/jmri.22554.
Article PubMed Google Scholar
Laganà M, Rovaris M, Ceccarelli A, Venturelli C, Marini S, Baselli G. DTI parameter optimisation for acquisition at 1.5 T: SNR analysis and clinical application. Comput Intell Neurosci. 2010;2010:254032. https://doi.org/10.1155/2010/254032 (Epub 2010 Jan 5).
Article PubMed Central Google Scholar
Shahim P, Holleran L, Kim JH, Brody DL. Test-retest reliability of high spatial resolution diffusion tensor and diffusion kurtosis imaging. Sci Rep. 2017;7(1):11141. https://doi.org/10.1038/s41598-017-11747-3.
Article PubMed PubMed Central CAS Google Scholar
Stieltjes B, Kaufmann WE, van Zijl PC, Fredericksen K, Pearlson GD, Solaiyappan M, Mori S. Diffusion tensor imaging and axonal tracking in the human brainstem. Neuroimage. 2001;14(3):723–35. https://doi.org/10.1006/nimg.2001.0861.
Article PubMed CAS Google Scholar

Download references

Acknowledgements

Thanks to my colleague Pertti Ryymin, who checked the technical MRI section of the background.

Funding

This study was financially partly supported by Tampere University Hospital Support Foundation, Tampere University Hospital. The funding organization had no role in the design, implementation, and manuscript of the study.

Author information

Authors and Affiliations

Department of Medical Physics, Medical Imaging Center of Pirkanmaa Hospital District, Tampere, Finland
Ullamari Hakulinen
Department of Radiology, Medical Imaging Center of Pirkanmaa Hospital District, Tampere, Finland
Ullamari Hakulinen, Antti Brander & Hannu Eskola
Faculty of Medicine and Health Technology, Tampere University, Tampere, Finland
Ullamari Hakulinen, Tero Ilvesmäki, Teemu M. Luoto & Hannu Eskola
Faculty of Social Sciences, Health Sciences, Tampere University, Tampere, Finland
Mika Helminen
Tays Research Services, Tampere University Hospital, Tampere, Finland
Mika Helminen
Department of Neurosurgery, Tampere University Hospital and Tampere University, Tampere, Finland
Juha Öhman & Teemu M. Luoto

Authors

Ullamari Hakulinen
View author publications
You can also search for this author in PubMed Google Scholar
Antti Brander
View author publications
You can also search for this author in PubMed Google Scholar
Tero Ilvesmäki
View author publications
You can also search for this author in PubMed Google Scholar
Mika Helminen
View author publications
You can also search for this author in PubMed Google Scholar
Juha Öhman
View author publications
You can also search for this author in PubMed Google Scholar
Teemu M. Luoto
View author publications
You can also search for this author in PubMed Google Scholar
Hannu Eskola
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.Ö. and T.L. contributed to the design of the study. T.L. recruits control subjects. U.H. designed and performed the measurements, analyzed the results, made plots and figures and wrote the manuscript as a first-author. A.B. performed inter-observer measurements and actively participated in the manuscript writing process. H.E acted as supervisor in the manuscript and technical section. M.H. contributes to statistical analyzes. T.I. critically reviewed the analysis and commented on the manuscript. Everyone participated in the evaluation of the manuscript. All authors read and approved the final manuscript..

Corresponding author

Correspondence to Ullamari Hakulinen.

Ethics declarations

Ethics approval and consent to participate

An ethics approval was obtained from the Ethical Committee of the Pirkanmaa Hospital District. Informed consent was obtained from all participants. All methods were carried out in accordance with relevant guidelines and regulations.

Consent for publication

Not applicable.

Competing interests

The authors have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Hakulinen, U., Brander, A., Ilvesmäki, T. et al. Reliability of the freehand region-of-interest method in quantitative cerebral diffusion tensor imaging. BMC Med Imaging 21, 144 (2021). https://doi.org/10.1186/s12880-021-00663-8

Download citation

Received: 04 January 2021
Accepted: 01 September 2021
Published: 04 October 2021
DOI: https://doi.org/10.1186/s12880-021-00663-8

Reliability of the freehand region-of-interest method in quantitative cerebral diffusion tensor imaging