Assessment of the impact of the scanner-related factors on brain morphometry analysis with Brainvisa
© Shokouhi et al.; licensee BioMed Central Ltd. 2011
Received: 28 April 2011
Accepted: 21 December 2011
Published: 21 December 2011
Brain morphometry is extensively used in cross-sectional studies. However, the difference in the estimated values of the morphometric measures between patients and healthy subjects may be small and hence overshadowed by the scanner-related variability, especially with multicentre and longitudinal studies. It is important therefore to investigate the variability and reliability of morphometric measurements between different scanners and different sessions of the same scanner.
We assessed the variability and reliability for the grey matter, white matter, cerebrospinal fluid and cerebral hemisphere volumes as well as the global sulcal index, sulcal surface and mean geodesic depth using Brainvisa. We used datasets obtained across multiple MR scanners at 1.5 T and 3 T from the same groups of 13 and 11 healthy volunteers, respectively. For each morphometric measure, we conducted ANOVA analysis and verified whether the estimated values were significantly different across different scanners or different sessions of the same scanner. The between-centre and between-visit reliabilities were estimated from their contribution to the total variance, using a random-effects ANOVA model. To estimate the main processes responsible for low reliability, the results of brain segmentation were compared to those obtained using FAST within FSL.
In a considerable number of cases, the main effects of both centre and visit factors were found to be significant. Moreover, both between-centre and between-visit reliabilities ranged from poor to excellent for most morphometric measures. A comparison between segmentation using Brainvisa and FAST revealed that FAST improved the reliabilities for most cases, suggesting that morphometry could benefit from improving the bias correction. However, the results were still significantly different across different scanners or different visits.
Our results confirm that for morphometry analysis with the current version of Brainvisa using data from multicentre or longitudinal studies, the scanner-related variability must be taken into account and where possible should be corrected for. We also suggest providing some flexibility to Brainvisa for a step-by-step analysis of the robustness of this package in terms of reproducibility of the results by allowing the bias corrected images to be imported from other packages and bias correction step be skipped, for example.
Brain morphometry has proven to be a powerful tool in identifying biomarkers of many neurological and psychiatric disorders. Several studies have investigated the link between the changes in the brain morphology and certain diseases or disorders such as Alzheimer's disease, schizophrenia, Autism, Epilepsy, and Bipolar disorder [1–6].
One of the popular software packages for brain morphometry is Brainvisa . In addition to the most common morphometry metrics, this program allows a sulcus-based morphometry. This is possible using the automatic sulci recognition feature of the program which automatically identifies the sulci of each individual brain. Sulcal parameters such as volume, depth, location, and pattern can then be computed for each sulcus. Exploring the sulcal parameters provides valuable information as it has been shown that changes in such parameters can be associated with pathology [5, 8].
Brainvisa has been used for cortical morphometry and the abnormality-related changes in parameters such as sulcal mean depth and surface for patients with cerebral autosomal dominant arteriolopathy with subcortical infarcts and leukoencephalopathy (CADASIL) . It has also been used to show the decrease in average cortical thickness and sulcal span with normal aging . Moreover, decreased global sulcal index (GSI), the ratio between the folded surface and the unfolded surface of the cortex, has been reported for schizophrenic patients with auditory hallucinations as well as patients with bipolar disorder and unipolar depression [11–13].
Nonetheless, the reliability of the brain morphometry is a major issue in cross-sectional studies (evaluation of the differences between normal and abnormal brains), where the abnormality-related variation may be small and dominated by the low measurement reliabilities.
Scanner-related factors can complicate the cross-sectional studies where both between-group variability and within-group variability (due to inter-individual variability in brain anatomy) already exist. Scanner instability and variations over time may result in bias in the derived morphometric measures as already shown for functional MRI and should be taken into account especially in longitudinal studies such as normal brain aging . Furthermore, there has been a growing interest in multicentre studies as they provide the researchers with larger datasets by pooling data from different sites and hence improve the statistical power [15, 16]. Nevertheless, multicentre studies may introduce a between-centre variance component which can overshadow small between-group (patients vs. normal subjects) variances. Consequently it is essential to verify the effect of scanner-related factors (either within-centre or between-centre) on the estimated values for the morphological parameters.
While a considerable amount of studies on the scanner-related variability have focused on functional MRI [17–19], a number of similar studies concerning structural MRI have also been reported. Using their previously developed algorithm, Schnack et al. studied the variability of brain tissue segmentation for data acquired from multiple centres, different manufacturers and under different acquisition protocols [20, 21]. Han et al. looked into the effects of scanner-related factors such as field strength, scanner manufacturer, upgrade, and pulse sequence as well as data processing factors on the cortical thickness measurement using FreeSurfer [22, 23]. Moorhead et al. investigated both within-scanner and between-scanner variability in the segmentation of grey and white matters using SPM5 [24, 25]. Suckling et al. studied both within-centre and between-centre variability in the distribution of grey matter using FSL with the aim of power calculation for two-group, cross-sectional study designs [26, 27]. The above studies have considered some of the currently used morphometric measures, however there has been no similar reports for the automatically computed sulcal attributes despite being used in morphometry analysis. More research regarding such measures still needs to be carried out.
In addition, the measurement process introduces another source of variance which may vary among different packages. Thus, the reliability also depends on the measurement method and the package used for the analysis. Nonetheless, a study on the assessment of morphometry with Brainvisa has not been reported yet. This paper presents a comprehensive study on the assessment of reliability and robustness of morphometry using Brainvisa against the scanner-related variability, whilst also investigating the possible causes which reduce the reliability. Our aim in this study has been twofold: 1) assessment of viability of multicentre and longitudinal studies using Brainvisa and 2) investigating the robustness and reproducibility of morphometry with Brainvisa using repeated scans (both between- and within-centre) of the same subjects.
To cover the most commonly used morphological parameters, we estimated brain tissue volumes and GSI as well as the sulcal attributes. For sulcal attributes, the assessments were performed independently with each of the four recognition algorithms provided in Brainvisa, as they produce slightly different results. The above-mentioned choice of parameters is also useful in the assessment of the reliability associated with each particular pre-processing step within Brainvisa. To further investigate the possible causes which could have an impact on the reliability, brain segmentation was repeated using FSL and the results were compared to those obtained with Brainvisa.
Moreover, to verify the performance of Brainvisa in different field strengths (and hence various degrees of signal to noise ratio) we used two separate groups of data acquired with 1.5 T and 3 T scanners and investigated the variability and the reliability within each group.
The retrospective data we used in this study included two sets of 3D T1-weighted MR scans, pooled from 1.5 T and 3 T scanners of CaliBrain (funded by a Chief Scientist Office -Scotland Project Grant: CZB/4/427) and Neuro/Psygrid projects, respectively [28, 29]. Both datasets had been obtained from healthy individuals with no history of head injury, psychiatric or neurological disorder. Two subjects (one from the CaliBrain dataset and one from the Neuro/Psygrid dataset) with incomplete data (i.e. repeated scans at all centres) were excluded from the study. The study was approved and the permission to use the retrospective data was granted by the West of Scotland Research Ethics Committee.
The CaliBrain project included MR scans from thirteen healthy participants (ten male, mean age 36.3, age range 22-51 years). The subjects had been scanned twice at three different sites: The Department of Radiology, University of Aberdeen; The Division of Psychiatry and The SFC Brain Imaging Research Centre within The Centre for Clinical Brain Sciences (CCBS) at The University of Edinburgh; and The Institute of Neurological Sciences, NHS Greater Glasgow South University Hospitals Division. The T1-weighted scans were acquired using a 3D inversion recovery-prepared fast gradient echo volume sequence. All three 1.5 T scanners were manufactured by General Electric (GE Healthcare, Milwaukee, Wisconsin).
The scanners specifications and imaging parameters for the 1.5 T group
GE Signa NVi/CVi 1.5 T
GE Signa LX 1.5 T
GE Signa 1.5 T
Flip angle (°)
Image size (voxels)
256 × 256 × 124
256 × 256 × 124
256 × 256 × 125
Voxel size (mm3)
0.86 × 0.86 × 1.7
0.86 × 0.86 × 1.7
0.86 × 0.86 × 1.8
The scanners specifications and imaging parameters for the 3 T group
GE 3 T HD
GE 3 T HDx
Siemens 3 T Tim Trio
Siemens 3 T Tim Trio
Philips 3 T Intera-Achieva
ASSET factor 2
SENSE factor 2
Flip angle (°)
Image size (voxels)
260 × 260 × 160
260 × 260 × 160
256 × 256 × 160
256 × 256 × 160
256 × 256 × 160
Voxel size (mm3)
1.1 × 1.1 × 1
1.1 × 1 × 1
1 × 1 × 1
1 × 1 × 1
1 × 1 × 1
The outputs of bias correction, brain extraction and split, and segmentation steps were visually examined and were corrected by tuning the parameters and repeating the process where necessary. In one or two cases, manual corrections were used to produce reasonable results.
The folds gathered in the relational graph structure were grouped together to form the sulci. The sulci recognition algorithm uses prior knowledge about the location of each sulcus for labelling the folds. Sulci labelling with Brainvisa is based on the sulcal root theory and returns 59 sulcal labels for each hemisphere .
For sulci recognition we used the Statistical Parametric Anatomy Map (SPAM) algorithm which uses a probabilistic model as the a priori information for sulci recognition. This probabilistic model returns the probability of presence of each sulcus at a given 3D position [36, 37]. There are four variations of the SPAM algorithms: Talairach, Global, Local, and Markovian. The Talairach method uses the probabilities of sulci locations in the Talairach space. However, as the sulci alignment of different subjects is not accurately achieved by registering the brain to the Talairach atlas, the other three algorithms use three approaches to improve between-individual sulcal alignment. The Global approach is based on iterative registration and labelling of the cortical folds to the SPAM maps where the same registration parameters are applied to the whole cortical surface. The Local method which is performed following a Global registration locally optimizes the registration on a sulcus-by-sulcus basis; hence each sulcus has its unique registration parameters. The Markovian algorithm also follows the Global registration and uses the information about the relations and distances between neighbouring sulcal segments for labelling the sulci. It should be noted that sulci recognition involves a virtual registration to a template and the final results (including sulcal parameters) are expressed in the native space.
For this study we employed all four algorithms independently. This allowed us to estimate the sulcus-specific morphometric measures for each method and compare the reliability obtained by each method.
Using Brainvisa, the morphometric measures were calculated for every scan of each subject. The measurements were performed for each cerebral hemisphere independently (Figure 1d). The morphometric measures were either global (brain tissue volumes, and global sulcal index) or sulcal (parameters that were calculated for each sulcus independently; i.e. sulcus surface and sulcus mean geodesic depth).
Brain tissues volumes: Brain segmentation (Figure 1d) was used to estimate the volumes of WM, GM, and CSF for each hemisphere.
Global Sulcal Index (GSI): This is an estimation of the cortex gyrification and is defined as the ratio between the total area of all the cortical folds and the area of the outer cortical surface (unfolded cortex).
The following sulcal parameters were calculated for each of the four SPAM algorithms separately:
Sulcus surface: Since the sulci are formed from the skeleton segments (Figure 1g) which are only one voxel thick, their volumes depend on the voxel size and orientation. Instead, the surface area of the sulcus which is not affected by voxel size provides a better estimate of sulcus size. Therefore we used sulcus surface area as the proxy for its volume.
Sulcus mean geodesic depth: The geodesic depth of sulcus is defined as the geodesic distance (along the cortical mesh) between the external line of the fold (on the brain hull), and the bottom line of the sulcus.
For each morphometric measure, Analysis of Variance (ANOVA) was conducted using Minitab 16 at the significance level of 0.05 to investigate whether the values corresponding to different centres or different visits were significantly different. Thus an effect was considered significant if the observed p-value under the null hypothesis that the effect is non-significant was smaller than 0.05. P-values were not adjusted for multiple testing and therefore have to be considered as descriptive. In order to take the between-subject difference into account, a "subject" factor was included in the ANOVA model. For the analysis of GM, WM, and CSF volumes as well as GSI, the brain volume was also included as covariate. For sulcal parameters, both brain volume and GSI were included as covariates.
Where Vtotal is the total variance corresponding to all sources of variability and is obtained from the sum of all variance components, and Vcentre (or Vvisit) is the sum of the variances associated with the centre (or visit) factor and its interactions with other factors. The reliability ranges from zero to one, representing cases where the variance associated with the centre (or visit) factor is the only source of variance or is negligible, respectively.
Segmentation with FSL vs. Brainvisa
In order to further explore the factors limiting the reliability of Brainvisa morphometry we used FSL (version 4.1.4) and compared the results of the two packages. Since bias correction and histogram analysis are the fundamental steps with great influence on the final results, a comparison between the two packages in brain segmentation (which is a direct result of these two steps) can be helpful in the assessment of robustness of each of these processes.
While it is usually suggested to use SIENA/SIENAX tools within FSL for longitudinal or cross-sectional studies, these tools may introduce bias since they involve a registration step. Consequently, we chose the FAST algorithm (version 4.1) of FSL which employs a Hidden Markovian Random Field model that for voxel classification, takes the tissue type of its neighbouring voxels into account, whilst also correcting for the bias field . To ensure that the comparison between the two packages only concerns the segmentation process and not the brain extraction, the same skull-stripped brain created with Brainvisa was used and segmented using FAST.
Two metrics were used to compare the bias correction of FAST and Brainvisa: the coefficient of variation within each tissue type, and the WM/GM contrast. More robust bias correction should result in smaller values for the coefficient of variation (due to narrower histogram peaks) and higher WM/GM contrast. Masks of all three tissue types were applied to each bias corrected T1 image (bias corrected with Brainvisa or FAST) and the coefficient of variation was calculated within each mask. To avoid bias towards either of the two programs, each tissue mask was defined as the intersection of the masks obtained with FAST and Brainvisa. To estimate the WM/GM contrast, the WM and GM peaks were computed from the distribution of grey levels within the WM and GM masks. The WM/GM contrast was then calculated as the ratio between the WM and GM peaks. It should be noted that this method of peak detection is different and independent from peak detection with the histogram analysis in Brainvisa.
The mean and standard deviation of GM, WM, and CSF volumes across subjects for the 1.5 T group
GM volume (cm3)
WM volume (cm3)
CSF volume (cm3)
259 ± 35
275 ± 36
296 ± 45
253 ± 44
87 ± 18
132 ± 20
259 ± 34
270 ± 32
293 ± 47
249 ± 39
89 ± 20
127 ± 16
322 ± 65
286 ± 34
274 ± 29
234 ± 33
62 ± 21
153 ± 18
321 ± 68
285 ± 35
274 ± 24
236 ± 34
64 ± 19
151 ± 21
249 ± 31
274 ± 38
288 ± 48
243 ± 41
96 ± 28
131 ± 23
247 ± 34
270 ± 32
293 ± 51
239 ± 38
96 ± 26
128 ± 15
The mean and standard deviation of GM, WM, and CSF volumes across subjects for the 3 T group
GM volume (cm3)
WM volume (cm3)
CSF volume (cm3)
308 ± 31
298 ± 20
249 ± 29
228 ± 20
93 ± 17
112 ± 12
294 ± 25
293 ± 19
264 ± 28
227 ± 18
91 ± 14
116 ± 15
324 ± 28
282 ± 17
250 ± 42
211 ± 19
111 ± 21
111 ± 15
310 ± 27
271 ± 20
268 ± 31
202 ± 17
105 ± 11
130 ± 16
328 ± 36
309 ± 22
241 ± 23
245 ± 21
105 ± 11
122 ± 15
311 ± 32
312 ± 21
278 ± 43
246 ± 22
94 ± 13
129 ± 14
332 ± 39
298 ± 18
247 ± 28
232 ± 16
87 ± 18
144 ± 13
314 ± 37
295 ± 19
261 ± 28
232 ± 18
88 ± 18
143 ± 18
321 ± 30
301 ± 17
267 ± 34
245 ± 20
87 ± 26
136 ± 13
346 ± 38
299 ± 18
237 ± 34
244 ± 19
87 ± 17
133 ± 14
As Figure 2 and table 3 suggest, for the 1.5 T group the distribution and the mean values are relatively consistent between the two visits with both Brainvisa and FAST. However, there is less consistency between centres and this inconsistency is more pronounced with Brainvisa. More specifically, the values estimated for centre B seem to considerably vary from those of the other two centres.
The p-values for centre and visit effects
1.5 T Group
3 T Group
For both groups, on average, Brainvisa seems to be classifying more voxels as WM compared to FAST. Figure 2 also shows that within the 1.5 T group, the GSI values corresponding to the two visits to centre B are less consistent compared to those for centres A or C. This observation can also be confirmed from table 5 which shows that the interaction of centre and visit is significant (p = 0.024), indicating that the between-visit variability for centre B is significantly different for those for centres A and C. The mean GSI was equal to 1.44 and 1.66 for the 1.5 T and 3 T groups, respectively. As GSI presents the ratio of the buried to unburied cortex, the fraction of the whole cortex that is buried in sulci is equal to GSI/(1+GSI). Thus on average, it has been estimated that for the 1.5 T group 59% and for the 3 T group 62% of the cortex is buried in sulci.
Visit and centre effects
The p-values of the main effects for centre and visit as well as their interaction are given in table 5 for all morphometric measures. For sulcal parameters, the p-values were computed for all recognition algorithms.
The volume of cerebral hemisphere was calculated from the sum of GM, WM, and CSF for each hemisphere of each subject's brain. The comparison between the results for the hemisphere volume and those for the GM, WM, and CSF volumes allows for the assessment of robustness of the segmentation process.
As shown in table 5, with the exception of GSI for the 1.5 T group and GM and WM volumes for the 3 T group, for all other parameters the interaction between the visit and centre factors was non-significant. The significance of the interaction term is indicative of inconsistency in the between-visit variability across different scanners. There are two possible situations which can lead to such inconsistency and therefore significance of the interaction between centre and visit factors: 1) compared to the first visit to each centre, the values estimated with the second visit are higher for some scanners but lower for others, or 2) for some scanners, the two visits produce similar results whereas for other scanners the estimations for the two visits are significantly different. As visits don't follow any logical order, the significance of the interaction term in the first case may not necessarily be indicative of real inconsistency between scanners. This seems to be the case for GM and WM volumes of the 3 T group. In this case, as can be inferred from Figure 2, the direction of change from the first to the second visit varies across scanners. Conversely, for GSI of the 1.5 T group the second explanation applies as the significance of the interaction terms arises from higher between-visit variability for centre B compared to those for centres A and C.
Table 5 also indicates that for the 1.5 T group, the two visits to the same centre did not produce significantly different values for GM, WM, CSF, and hemisphere volumes, however these values were significantly different across centres.
In the 3 T group, although the two visits produced similar results for hemisphere volume, the GM, WM, and CSF volumes varied significantly between the two visits. In addition, all volumes significantly vary across scanners. Nonetheless, both visits to all five centres produced similar results for GSI.
For sulcal parameters, the centre and visit factors as well as their interaction were non-significant for the 1.5 T group, however for the 3 T group, the centre factor was significant for both surface and depth and with all four algorithms.
Between-visit and between-centre reliabilities
The between-visit and between-centre reliabilities were computed for all parameters according to equation 1. As a rule of thumb, the reliability values smaller than 0.50, between 0.50 and 0.70, between 0.70 and 0.90, and greater than 0.90 were considered poor, moderate, good, and excellent, respectively.
The between-visit and between-centre reliabilities for global parameters calculated using Brainvisa
1.5 T Group
3 T Group
Cerebral hemisphere volume
For the 3 T group, the between-centre and between-visit reliabilities of the hemisphere volume were almost excellent. However following segmentation, the reliabilities of the resulting segmented volumes were mostly poor. It can be therefore concluded that segmentation has led to a significant reduction in the reliability. The between-visit and between-centre reliabilities of GSI for the 3 T group were moderate and poor, respectively.
The average between-visit and between-centre reliabilities of sulcal parameters
Mean geodesic depth
Segmentation with FAST vs. Brainvisa
The p-values for segmentation with FAST
1.5 T Group
3 T Group
For the 3 T group, the centre and visit factors as well as their interaction were significant for all volumes which suggests that some scanners produce less consistent results between the two visits. Figure 2 shows that this inconsistency mainly arises from centre E.
The between-visit and between-centre reliabilities for segmentation with FAST
1.5 T Group
3 T Group
We have assessed the robustness of Brainvisa in brain morphometry and estimation of the most widely used morphometric measures in terms of scanner-related variability and reliability. For this purpose we used two groups of retrospective datasets from two multicentre studies which included repeated scans acquired at 1.5 T and 3 T from healthy volunteers.
In some cases the morphometry results were significantly different across different scanners or across the two visits to the same scanner. It should be noted however, that the results of this study correspond to small groups of subjects and therefore for a larger dataset, even the non-significant cases may become significant. In addition, both between-centre and within-centre reliabilities ranged from poor to excellent for most parameters which also emphasizes the impact of scanner-related factors. However the within-centre reliability was found to be better than the between-centre reliability for almost all morphometric measures.
A comparison between the reliability values for the cerebral hemisphere volume and the segmented tissues (GM, WM, and CSF) revealed that while hemisphere volumes were very consistent both between- and within-scanners, the segmented volumes showed considerably different result in most cases. This implies that the inconsistency between the brain tissue volumes had arisen from the segmentation process. When the segmentation was carried out using the FAST algorithm within FSL, the reliabilities were mostly improved. Further investigation indicated that despite comparable values for the coefficients of variation within each tissue obtained with the two methods, FAST resulted in significantly higher WM/GM contrasts compared to Brainvisa. A comparison between WM/GM contrast obtained for different scanners, revealed that scanner B of the 1.5 T group had significantly higher values compared to the other two scanners of the same group (A and C). This might be one reason for the disparity between the estimated volumes and GSI for centre B and those for centres A and C. However for the 3 T group, an association between GM/WM contrasts and the estimated morphometric parameters was not observed. For example, despite significantly higher GM/WM contrasts for centres F and G compared to centre H, the estimations for the three centres were mostly consistent. It should be noted however that the average WM/GM contrasts were higher in the 3 T group relative to the 1.5 T group. This therefore raises the possibility of existence of a threshold for the WM/GM contrast in order to have an effect on the morphometry results using Brainvisa.
It should be noted that the image acquisition parameters were slightly different across scanners within each group. Such disparities may add to the variability in the morphometric parameters across scanners.
As the bias correction with FAST is more robust compared to Brainvisa, it is suggested to perform the bias correction using FAST and repeat all the following steps to assess the reliabilities. This would allow the evaluation of the robustness of the following analysis steps using Brainvisa. The histogram-based approach of Brainvisa can then be compared to the Hidden Markovian Random Field model-based approach of FAST. Such a step-by-step comparison can in turn help in identifying the ways in which the program can be more robust. Currently, it is not possible to skip the bias correction step and import a bias corrected image. Instead, all steps need to be performed subsequently within the program.
Using both Brainvisa and FAST, the between-visit and between-centre reliabilities for GM and WM volumes were mostly smaller compared to the calibration study of Schnack et al. which confirms the effectiveness of using calibration factor for brain segmentation in multicentre studies.
For sulcus-specific attributes, the evaluation was performed with each of the four sulci identification algorithms so that the different algorithms could be compared. These algorithms vary in their approach in registering the brain to a 3D probabilistic atlas of the sulci (SPAM). On average, Local and Markovian algorithms predicted higher values for mean geodesic depths compared to Global and Talairach algorithms, suggesting that Local and Markovian tend to group deeper folds with the same label. In terms of reliability, for all sulcal parameters, Talairach showed lowest reliabilities whereas Markovian and Global achieved highest reliabilities. This confirms the fact that registration of different brains to Talairach atlas entails poor sulci alignment between individuals and that the other three algorithms are more successful in sulci alignment.
Nevertheless, the average reliabilities with all algorithms were mostly moderate. This clearly limits the suitability of sulcal surface and mean geodesic depth for multicentre or longitudinal studies. Further improvement may be achieved by improving the primary steps (bias correction and possibly histogram analysis) in order to obtain more reproducible estimations.
Due to the various sources of variability between the 1.5 T and 3 T groups (e.g. subjects, age, gender, field strength, number and types of scanners, and image acquisition parameters), the contribution of each source of variability to the total variance between the two groups can not be estimated. Nonetheless, a qualitative comparison suggests that while reliabilities of segmented brain volumes are higher for the 1.5 T group compared to the 3 T group (with both Brainvisa and FAST), for sulcal parameters the reliabilities are higher for the 3 T group. This suggests that fold detection and brain segmentation are not equally affected by these factors. A prospective study with various field strengths, scanner types, and image acquisition parameters on the same group of subjects would be useful for independently investigating the effect of each factor on the reliability of the results. Further investigation of the contribution of each factor, may also be useful for correcting for scanner-related variability in addition to providing information about the required criteria (image resolution, for example) for achieving more robust morphometry results and higher reliabilities with Brainvisa.
In this paper we explored the consistency of brain morphometry results using Brainvisa among different scanners and different sessions of the same scanner. Our results indicate that there is occasionally considerable disparity between the values estimated for different scanners and different sessions. However, different scans of the same scanners produced more consistent results compared to those obtained with different scanners. These findings emphasize that for any kind of morphometry analysis with Brainvisa using data from multicentre or longitudinal studies, the scanner-related variability must be taken into account and where possible the resultant inconsistency should be corrected for. Furthermore, our findings provide a first step for investigation of the possibilities for improvement of Brainvisa.
This study was supported by SINAPSE http://www.sinapse.ac.uk. AB is supported by NARSAD Young Investigator Award. The CaliBrain study was funded by a Chief Scientist Office (Scotland) Project Grant (CZB/4/427), Chief Investigator Prof. S. Lawrie. Also part of the data came from the Psygrid consortium http://www.psygrid.org/ and the NeuroPsyGrid collaborative project http://www.neuropsygrid.org/. The authors thank Alex McConnachie and Martina Messow from Robertson Centre for Biostatistics, University of Glasgow, for their advice regarding the statistical analysis.
- Thompson PM, Hayashi KM, de Zubicaray G, Janke AL, Rose SE, Semple J, Herman D, Hong MS, Dittmer SS, Doddrell DM, Toga AW: Dynamics of gray matter loss in Alzheimer's disease. J Neurosci. 2003, 23: 994-1005.PubMedGoogle Scholar
- Garcia-Marti G, Aguilar EJ, Lull JJ, Marti-Bonmati L, Escarti MJ, Manjon JV, Moratal D, Robles M, Sanjuán J: Schizophrenia with auditory hallucinations: A voxel-based morphometry study. Prog Neuropsychopharmacol Biol Psychiatry. 2008, 32: 72-80. 10.1016/j.pnpbp.2007.07.014.View ArticlePubMedGoogle Scholar
- Shenton ME, Dickey CC, Frumin M, McCarley RW: A review of MRI findings in schizophrenia. Schizophr Res. 2001, 49: 1-52.View ArticlePubMedPubMed CentralGoogle Scholar
- Bigler ED, Abildskov TJ, Petrie JA, Johnson M, Lange N, Chipman J, Lu J, McMahon W, Lainhart JE: Volumetric and Voxel-Based Morphometry Findings in Autism Subjects With and Without Macrocephaly. Dev Neuropsychol. 2010, 35: 278-295. 10.1080/87565641003696817.View ArticlePubMedPubMed CentralGoogle Scholar
- Kim H, Bernasconi N, Bernhardt B, Colliot O, Bernasconi A: Basal temporal sulcal morphology in healthy controls and patients with temporal lobe epilepsy. Neurology. 2008, 70: 2159-2165. 10.1212/01.wnl.0000313150.62832.79.View ArticlePubMedGoogle Scholar
- Moorhead TWJ, McKirdy J, Sussmann JED, Hall J, Lawrie SM, Johnstone EC, McIntosh AM: Progressive gray matter loss in patients with bipolar disorder. Biol Psychiatry. 2007, 62: 894-900. 10.1016/j.biopsych.2007.03.005.View ArticlePubMedGoogle Scholar
- Brainvisa/Anatomist. [http://brainvisa.info]
- Cykowski MD, Kochunov PV, Ingham RJ, Ingham JC, Mangin JF, Riviere D, Lancaster JL, Fox PT: Perisylvian sulcal morphology and cerebral asymmetry patterns in adults who stutter. Cereb Cortex. 2008, 18: 571-583.View ArticlePubMedGoogle Scholar
- Jouvent E, Mangin JF, Porcher R, Viswanathan A, O'Sullivan M, Guichard JP, Dichgans M, Bousser MG, Chabriat H: Cortical changes in cerebral small vessel diseases: a 3D MRI study of cortical morphology in CADASIL. Brain. 2008, 131:Google Scholar
- Kochunov P, Thompson PM, Coyle TR, Lancaster JL, Kochunov V, Royall D, Mangin JF, Rivière D, Fox PT: Relationship among neuroimaging indices of cerebral health during normal aging. Hum Brain Mapp. 2008, 29: 36-45. 10.1002/hbm.20369.View ArticlePubMedGoogle Scholar
- Cachia A, Paillere-Martinot ML, Galinowski A, Januel D, de Beaurepaire R, Bellivier F, Artiges E, Andoh J, Bartrés-Faz D, Duchesnay E, Rivière D, Plaze M, Mangin JF, Martinot JL: Cortical folding abnormalities in schizophrenia patients with resistant auditory hallucinations. Neuroimage. 2008, 39: 927-935. 10.1016/j.neuroimage.2007.08.049.View ArticlePubMedGoogle Scholar
- Penttila J, Cachia A, Martinot JL, Ringuenet D, Wessa M, Houenou J, Galinowski A, Bellivier F, Gallarda T, Duchesnay E, Artiges E, Leboyer M, Olié JP, Mangin JF, Paillère-Martinot ML: Cortical folding difference between patients with early-onset and patients with intermediate-onset bipolar disorder. Bipolar Disord. 2009, 11: 361-370. 10.1111/j.1399-5618.2009.00683.x.View ArticlePubMedGoogle Scholar
- Penttilae J, Paillere-Martinot ML, Martinot JL, Ringuenet D, Wessa M, Houenou J, Gallarda T, Bellivier F, Galinowski A, Bruguière P, Pinabel F, Leboyer M, Olié JP, Duchesnay E, Artiges E, Mangin JF, Cachia A: Cortical folding in patients with bipolar disorder or unipolar depression. J Psychiatry Neurosci. 2009, 34: 127-135.Google Scholar
- Yoo SS, Wei XC, Dickey CC, Guttmann CRG, Panych LP: Long-term reproducibility analysis of fMRI using hand motor task. Int J Neurosci. 2005, 115: 55-77. 10.1080/00207450490512650.View ArticlePubMedGoogle Scholar
- O'Sullivan M, Jouvent E, Saernann PG, Mangin JF, Viswanathan A, Gschwendtner A, Bracoud L, Pachai C, Chabriat H, Dichgans M: Measurement of brain atrophy in subcortical vascular disease: A comparison of different approaches and the impact of ischaemic lesions. Neuroimage. 2008, 43: 312-320. 10.1016/j.neuroimage.2008.07.049.View ArticlePubMedGoogle Scholar
- Friedman L, Glover GH, FBIRN Consortium: Reducing interscanner variability of activation in a multicenter fMRI study: Controlling for signal-to-fluctuation-noise-ratio (SFNR) differences. Neuroimage. 2006, 33: 471-481. 10.1016/j.neuroimage.2006.07.012.View ArticlePubMedGoogle Scholar
- Friedman L, Stern H, Brown GG, Mathalon DH, Turner J, Glover GH, Gollub RL, Lauriello J, Lim KO, Cannon T, Greve DN, Bockholt HJ, Belger A, Mueller B, Doty MJ, He J, Wells W, Smyth P, Pieper S, Kim S, Kubicki M, Vangel M, Potkin SG: Test-retest and between-site reliability in a multicenter fMRI study. Hum Brain Mapp. 2008, 29: 958-972. 10.1002/hbm.20440.View ArticlePubMedPubMed CentralGoogle Scholar
- Yendiki A, Greve DN, Wallace S, Vangel M, Bockholt J, Mueller BA, Magnotta V, Andreasen N, Manoach DS, Gollub RL: Multi-site characterization of an fMRI working memory paradigm: Reliability of activation indices. Neuroimage. 2010, 53: 119-131. 10.1016/j.neuroimage.2010.02.084.View ArticlePubMedGoogle Scholar
- Gountouna VE, Job DE, McIntosh AM, Moorhead TWJ, Lymer GKL, Whalley HC, Hall J, Waiter GD, Brennan D, McGonigle DJ, Ahearn TS, Cavanagh J, Condon B, Hadley DM, Marshall I, Murray AD, Steele JD, Wardlaw JM, Lawrie SM: Functional Magnetic Resonance Imaging (fMRI) reproducibility and variance components across visits and scanning sites with a finger tapping task. Neuroimage. 2010, 49: 552-560. 10.1016/j.neuroimage.2009.07.026.View ArticlePubMedGoogle Scholar
- Schnack HG, Hulshoff Pol HE, Baare WFC, Viergever MA, Kahn RS: Automatic Segmentation of the Ventricular System from MR Images of the Human Brain. Neuroimage. 2001, 14: 95-104. 10.1006/nimg.2001.0800.View ArticlePubMedGoogle Scholar
- Schnack HG, van Haren NEM, Pol HEH, Picchioni M, Weisbrod M, Sauer H, Cannon T, Huttunen M, Murray R, Kahn RS: Reliability of brain volumes from multicenter MRI acquisition: A calibration study. Hum Brain Mapp. 2004, 22: 312-320. 10.1002/hbm.20040.View ArticlePubMedGoogle Scholar
- Han X, Jovicich J, Salat D, van der Kouwe A, Quinn B, Czanner S, Busa E, Pacheco J, Albert M, Killiany R, Maguire P, Rosas D, Makris N, Dale A, Dickerson B, Fischl B: Reliability of MRI-derived measurements of human cerebral cortical thickness: The effects of field strength, scanner upgrade and manufacturer. Neuroimage. 2006, 32: 180-194. 10.1016/j.neuroimage.2006.02.051.View ArticlePubMedGoogle Scholar
- FreeSurfer. [http://surfer.nmr.mgh.harvard.edu]
- SPM - Statistical Parametric Mapping. [http://www.fil.ion.ucl.ac.uk/spm]
- Moorhead TW, Gountouna VE, Job DE, McIntosh AM, Romaniuk L, Lymer GK, Whalley HC, Waiter GD, Brennan D, Ahearn TS, Cavanagh J, Condon B, Steele JD, Wardlaw JM, Lawrie SM: Prospective multi-centre Voxel Based Morphometry study employing scanner specific segmentations: procedure development using CaliBrain structural MRI data. BMC Med Imaging. 2009, 9: 8-10.1186/1471-2342-9-8.View ArticlePubMedPubMed CentralGoogle Scholar
- FSL. [http://www.fmrib.ox.ac.uk/fsl]
- Suckling J, Barnes A, Job D, Brenan D, Lymer K, Dazzan P, Marques TR, MacKay C, McKie S, Williams SR, Williams SC, Lawrie S, Deakin B: Power Calculations for Multicenter Imaging Studies Controlled by the False Discovery Rate. Hum Brain Mapp. 2010, 31: 1183-1195.PubMedGoogle Scholar
- PsyGrid. [http://www.psygrid.org]
- Neuropsygrid.org: The Leading Neuro PSY Grid Site on the Net. [http://www.neuropsygrid.org]
- Magnetic Resonance TIP - MRI Database: mprage. [http://www.mr-tip.com/serv1.php?type=db1&dbs=mprage]
- Mangin JF: Entropy minimization for automatic correction of intensity nonuniformity. Proceedings of the IEEE Workshop on Mathematical Methods in Biomedical Image Analysis. 2000, 162-169.Google Scholar
- Mangin JF, Coulon O, Frouin V: Robust brain segmentation using histogram scale-space analysis and mathematical morphology. Proceedings of Medical Image Computing and Computer-Assisted Intervention - Miccai'98. Edited by: William M. Wells III, Alan C. F. Colchester, Scott L. Delp. 1998, Springer, 1496: 1230-1241.View ArticleGoogle Scholar
- Mangin JF, Regis J, Frouin V: Shape bottlenecks and conservative flow systems. Proceedings of the IEEE Workshop on Mathematical Methods in Biomedical Image Analysis. 1996, 319-328.View ArticleGoogle Scholar
- Mangin JF, Frouin V, Bloch I, Regis J, Lopez-Krahe J: From 3D magnetic resonance images to structural representations of the cortex topography using topology preserving deformations. J Math Imaging Vis. 1995, 5: 297-318. 10.1007/BF01250286.View ArticleGoogle Scholar
- Regis J, Mangin JF, Ochiai T, Frouin V, Riviere D, Cachia A, Tamura M, Samson Y: "Sulcal root" generic model: a hypothesis to overcome the variability of the human cortex folding patterns. Neurol Med Chir (Tokyo). 2005, 45: 1-17. 10.2176/nmc.45.1.View ArticleGoogle Scholar
- Perrot M, Riviere D, Mangin JF: Identifying cortical sulci from localization, shape and local organization. Proceedings of the IEEE International Symposium on Biomedical Imaging: from Nano to Macro. 2008, 1-4: 420-423.Google Scholar
- Perrot M, Riviere D, Tucholka A, Mangin JF: Joint Bayesian Cortical Sulci Recognition and Spatial Normalization. Proceedings of Inf Process Med Imaging. 2009, 5636: 176-187. 10.1007/978-3-642-02498-6_15.View ArticleGoogle Scholar
- Zhang YY, Brady M, Smith S: Segmentation of brain MR images through a hidden Markov random field model and the expectation-maximization algorithm. IEEE Trans Med Imaging. 2001, 20: 45-57. 10.1109/42.906424.View ArticlePubMedGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2342/11/23/prepub
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.