- Research article
- Open Access
Measuring liver fat fraction with complex-based chemical shift MRI: the effect of simplified sampling protocols on accuracy
BMC Medical Imaging volume 19, Article number: 14 (2019)
The assessment of liver percentage fat fraction (%FF) using proton density fat fraction sequences is becoming increasingly accessible. Previous studies have tended to use multiple small ROIs that focus on Couinaud segments. In an effort to simplify day-to-day analysis, this study assesses the impact of using larger, elliptical ROIs focused on a single hepatic lobe. Additionally, we assess the impact of sampling fewer transhepatic slices when measuring %FF.
Retrospective analysis of prospectively obtained images from 34 volunteers using an IDEAL IQ sequence. Two observers independently measured %FF using three different protocols: freehand whole-liver ROI (fh-ROI), elliptical-ROI on the right lobe (rt-ROI) and elliptical-ROI on the left lobe (lt-ROI).
Inter-observer reliability for all measurements techniques was ‘excellent’ (Spearman’s rank correlation coefficients 0.81–0.98). There was a significant difference (Paired Wilcoxon Test: p < 0.001) between the median %FF obtained using fh-ROI when compared to the rt-ROI method, the maximum mean difference between the two techniques was 2.79% (95% CI). For all sampling methods a Kruskall-Wallis analysis demonstrated no significant difference in mean %FF when the number of slices sampled was reduced from 11 to 1. The mean coefficient of variance increased when more slices were sampled (3 slices = 0.1, 11 slices = 0.17, p < 0.001).
Simplified ROIs focused on one hepatic lobe provide %FF measurements that are unlikely to be sufficiently accurate for use in clinical practice. Freehand whole-liver ROIs should be used in preference.
A single freehand ROI measurement taken at the level of the hepatic hilum yields a %FF that is representative of the mean whole liver % FF. Multiple slices are needed to measure heterogeneity.
Non-alcoholic fatty liver disease (NAFLD) is the most common chronic liver disease in developed nations with a prevalence of 20–30% in adults  and as much as 70–91% in high-risk patients such as those with obesity  and diabetes [3,4,5]. Among patients who have evidence of hepatic steatosis a proportion will go on to develop non-alcoholic steatohepatitis (NASH) and a subpopulation will progress  to cirrhosis, end-stage liver disease or related complications such as hepatocellular carcinoma (HCC) [7, 8].
The current gold standard for diagnosing and grading hepatic steatosis is core biopsy. This method is invasive and prone to sampling errors caused by heterogeneity of liver fat deposition . Alternative techniques that have been developed for the assessment of liver percentage fat fraction (%FF) include MRI and MR spectroscopy (MRS). These are non-invasive and thus more suitable for longitudinal follow-up as well as allowing larger regions of interest (ROI) to be sampled thus reducing sampling error .
A number of MRI-based methods currently used to quantify liver %FF have been validated by comparison to histological sampling and MRS [10,11,12]. MRS is regarded as the most accurate MRI method for the measurement of liver %FF.  MRI techniques (e.g. dual-echo Dixon or the more recent complex-based approaches that enable measurement of proton density fat fraction (PDFF)) have been shown to provide reliable quantification of liver %FF and are more widely available than MRS [13,14,15,16,17].
One widely used PDFF technique is the water–fat separation method “iterative decomposition of water and fat with echo asymmetry and least squares estimation” (IDEAL) . Co-registration and recombination of fat and water images provides pixel by pixel fat fraction maps . Measurement of fat fraction requires manual selection of an ROI on one or more slices, providing one or more values of average liver %FF. The sample of liver examined will depend on the area of the ROIs chosen and the number of slices of liver interrogated. Most of the previously described methods have used ROIs that are very small relative to the size of the liver  or focus on using ROIs centred on each Couinaud segment .
Ideally a method that samples all of the imaged liver parenchyma should be used, indeed some semi-automated computer-based post-processing methods that do this have been described . However, these tools use image recognition software that is not widely available and often still requires manual corrections. For day-to-day analysis in a non-research setting post-processing commonly involves the manual drawing of an ROI using software provided by the MRI manufacturer. In this situation if the entire liver parenchyma were to be sampled this would require individual freehand ROI to be drawn around the liver on every slice where parenchyma can be seen, but this is time-consuming and labour-intensive.
Evaluation times could be reduced by employing the following techniques:
Using single large elliptical ROIs that approximate to the size of a hepatic lobe, rather than laboriously drawing a freehand ROI around the liver edge.
Reducing the number of slices sampled, rather than aggregating ROIs which have been drawn on every slice that contains liver parenchyma.
However, in theory these simplified protocols increase the chance of sampling error.
The purpose of this study is to assess the accuracy of %FF measurements obtained when using these simplified sampling protocols (i.e. large elliptical ROIs and reducing the number of slices sampled). Freehand drawn ROIs that sample the whole liver will be used as a reference standard.
Methods and materials
Ethical approval for this study was obtained from the local Research Ethics Committee. Informed written consent was obtained from each participant. 34 volunteers were recruited from Radiology department staff at the Norfolk and Norwich University Hospital. An equal number of male and female volunteers were selected from a variety of different body mass indices (BMIs), these ranged from normal (BMI 18–25) to obese (BMI > 30). The population used in this study was originally recruited to take part in two test-retest reliability studies investigating whole body muscle and body-compartment fat quantification [22, 23].
Magnetic resonance imaging
Each participant underwent MR imaging of the liver on a MR750w wide bore 3 T MR machine (General Electric Medical Systems Ltd., Hatfield, UK) using an integrated quadrature body coil. IDEAL IQ (3D technique, NEX: 0.5, Bandwidth: 111.11 kHz, TEs per scan: 6, Number Shots: 2, Minimum TE: 0.9 ms, Maximum TE: 4.6 ms, Auto Flip angle: 3°, Echo train length: 3, TR: 6 ms, FOV: 50.0 × 50.0 cm, Slice Thickness: 10.0 mm) MR images were acquired with the volunteers supine and with arms by their sides. Fourteen axial slices were taken through the region of the liver and used for our analysis.
Regions of interest (ROI)
Two observers (radiology residents) independently measured the %FF using two different protocols on a dedicated post-processing workstation (AW VolumeShare 5, GE Healthcare). The first protocol was to generate a freehand whole liver ROI (fh-ROI) drawn around the margins of the entire liver excluding fat at the porta hepatis and falciform ligament. The second protocol was to draw an elliptical ROI over the centre of the right (rt-ROI) and left lobes (lt-ROI) of the liver respectively aiming to create as large an ellipse as possible whilst keeping it within the boundaries of each lobe (Fig. 1). The inferior vena cava and portal vein were excluded from the ROIs but smaller intrahepatic vessels and bile ducts were not excluded.
The mean %FF within each type of ROI was recorded for each observer and measurements were made on all fourteen slices where there was identifiable liver parenchyma. To allow for the variable amount of liver covered by each individual scan, and to facilitate slice-to-slice comparison between individuals, a note was made of the axial slice number that included the hilum of the porta hepatis. For the purposes of further analysis this hilar slice was set as “slice 0” with the more cranial slices recorded as positive, and the more caudal slices as negative, positions relative to the hilum (Fig. 2). As a result of anatomical differences and some variations in liver coverage, this ‘hilar correction’ generated slice positions ranging from − 10 through to + 9.
Statistics were performed using the R software environment . Q-Q plots and Shapiro-Wilk tests for normality were performed. Descriptive statistics included a coefficient of variation (CV), which was calculated as a relative standard deviation of the fat fraction and presented as a percentage. Inter-observer reliability was measured using Spearman’s rank correlation coefficients and 95% limits of agreement derived from Bland-Altman plots. Kruskal-Wallis tests, with subsequent post-hoc analyses, were used for multiple hypothesis testing of non-parametric data.
Slice-by-slice mean %FF and the standard deviation were calculated for the fh-ROI, rt-ROI and lt-ROI measurements obtained from all volunteers. When these calculations were plotted against slice number, deviations from the mean and increases in standard deviation were demonstrated at the most cranial and most caudal slices (Fig. 3).
Frequency histograms of the mean %FF demonstrated a skewed distribution for all sampling methods (Fig. 4). Q-Q plots and Shapiro-Wilk tests for normality confirmed that all samples failed to conform to a parametric distribution (Shapiro-Wilk P = 0.001 or less for all datasets at significance level of 0.05) and therefore non-parametric descriptive and hypothesis testing statistics were used.
The median %FF for fh-ROIs was 4.0 (IQR 3.1–6.0) for observer 1 and 4.51 (IQR 3.2–6.2) for observer 2. The medians for rt-ROI and lt-ROI were 3.31 (IQR 2.4–5.6) and 4.54 (IQR 3.9–6.5) respectively for observer 1, and 3.31 (IQR 2.4–5.6) and 4.75 (IQR 3.7–6.1) for observer 2. The Spearman’s rank correlation coefficients demonstrated “almost perfect” agreement  between the two observers with reasonably narrow 95% limits of agreement derived from the Bland-Altman plots (Table 1).
There were significant differences demonstrated between the three sampling techniques (Kruskal-Wallis test P = 0.032 for observer 1, and P = 0.036 for observer 2). Post hoc pairwise analysis with Wilcoxon Rank Sum tests performed with Holm correction for multiplicity in paired data demonstrated that the significant differences were between rt-ROI and fh-ROI, as well as between rt-ROI and lt-ROI (Table 2). The difference between the mean rt-ROI and fh-ROI values was found to be at most 2.79%.
The mean %FF and %FF variance were then compared for datasets comprising 1 to 19 slices in two slice increments. For each category interval one slice was added above and below to the slice centred on the hilum at position “0”. Comparison of each of these datasets was performed using the Kruskal-Wallis test, which revealed that there was no significant difference in the mean %FF (P = 0.31). Post-hoc analysis demonstrated no significant pairwise differences for any comparisons between 1 and 11 slices. A number of significant pairwise differences were present between the 17 and 19 slice datasets, which included the most cranial and caudal slices, and all other slices. The Kruskal-Wallis test demonstrated a significant difference in the multiple comparisons of coefficient of variance (P < 0.001) (Fig. 5). Post hoc analysis revealed that significant differences were present in all pairwise comparisons except between the “3” and “5” slice datasets (P = 0.19).
While drawing regions of interest it became evident that the most cranial and caudal slices through the liver were prone to uncharacteristically high %FF values. This can be explained by the inclusion of the surrounding intra-abdominal and peri-hepatic fat within the most cranial and most caudal voxels by partial volume effect. The same finding was also noted by Kim et al. when evaluating different ROI measurements using MRS .
Our study demonstrated that left lobe %FF measured, using an ellipsoid ROI, was significantly different to the right lobe, with the %FF values for the left lobe being higher. A higher %FF in the left lobe has not been reported by previous studies, which have suggested that there is either no significant difference [20, 26] or that the %FF within the right lobe is higher than the left [9, 27, 28]. Reasons for this difference could relate to technical factors. Difficulty drawing a representative ellipse on some slices because of the small size of the left lobe might have introduced partial volume effects, or the inclusion of small vessels in the large right ROI could have contributed to a reduced %FF, although not all prior studies have excluded small vessels from their ROIs. Population differences between studies could also account for these findings, the current study is the first to assess multiple healthy volunteers from a range of different BMIs.
A significant difference was found when comparing the median %FF obtained using the freehand whole liver ROI and the elliptical right lobe ROI in both observers (p < 0.001). This finding might suggest that the simpler, quicker method of drawing an ellipse cannot replace the more laborious freehand drawn ROI. This result is supported by similar findings reported by Vu et al. and Hong et al. who suggest that sampling a single lobe using combinations of smaller ROIs is not sufficient. Both of these studies have suggested the use of ROI strategies that sample at minimum both lobes, and if possible each liver segment [20, 29].
When the difference between the mean right elliptical ROI and the reference freehand whole liver ROI values was calculated, it was found to be at most 2.79%. If a 3% margin of error is a clinically acceptable trade-off for a faster measurement in day-to-day use, then an elliptical ROI of the right lobe could still be used as a faster way to measure mean %FF. However, although not universally agreed, a commonly quoted %FF cut-off value for defining “hepatic steatosis” using MRI-PDFF is 5.6% . Important decisions such as transplant donor acceptability may be made using this threshold . In these cases a %FF value derived from an elliptical ROI could vary by up to 3% from the reference, a patient could only be acceptably classified as “steatotic” when their elliptical right lobe ROI was > 9%. This is likely to be too high to have practical uses.
Our study measured %FF on every slice that contained visible liver parenchyma. From this data set it was possible to establish whether the number of slices used to calculate the average %FF caused this to change significantly. A Kruskal-Wallis analysis demonstrated that when using a freehand drawn whole liver ROI there is no significant difference in the overall mean %FF if only one slice was sampled compared to sampling multiple slices. This suggests that one freehand whole liver ROI drawn on a single slice will provide a %FF that is representative of the entire liver.
On the other hand the variance of %FF values increased as more slices were included in the calculations. This trend was seen with both freehand whole liver ROI and elliptical ROI methods. This result is likely to be attributable to the heterogeneous distribution of fat within the liver e.g. due to anatomical variance of focal fatty infiltration. Therefore if a measure of the heterogeneity of liver fat distribution is of clinical interest, then an accurate measurement can only be obtained by using ROIs on a large number of slices. However as mentioned previously, extending the number of slices to include the most cranial and caudal sections may reduce the accuracy of whole liver %FF measurement.
The results showed “excellent” inter-observer agreement for both the freehand whole liver ROI and elliptical ROI protocols, with narrow limits of agreement, which suggests that both types of ROI yield reproducible results.
This study has some limitations relating to standardization of method compared to previous studies. These measurements were made using a vendor specific (IDEAL IQ) sequence on a wide bore 3 T MR machine and therefore may be specific to this arrangement, although a previously published study has demonstrated good reproducibility between different platforms when using IDEAL IQ . In our study no attempt was made to exclude small hepatic vessels and bile ducts from the ROI. This methodology is similar to other published studies . Although some studies using smaller ROIs that do not sample the whole liver parenchyma have avoided intrahepatic vessels, [17, 21, 32] it is not known whether or not this significantly affects accuracy.
Our study is the first to assess %FF sampling the use of large, solitary, elliptical ROIs focused on one hepatic lobe as an alternative to using freehand drawn ROIs that enclose the whole liver. Our findings, consistent with prior studies demonstrate that the difference between %FF obtained when using these simplified methods is unlikely to be clinically acceptable. We would therefore recommend that freehand drawn ROIs encompassing the whole liver are used in preference.
Additionally, this study assesses the impact that varying the number of slices analysed has on the average %FF obtained when compared to whole liver %FF. Our findings suggest that there is no significant difference in the overall mean %FF if only one slice was sampled compared to sampling multiple slices. However, if insight into the variability of liver fat deposition is of interest, then multiple slices that contain visible liver parenchyma should be assessed. Although the data suggests that measurements from the most cranial and caudal slices show marked differences in %FF that are likely to be artefactual and inclusion of these slices may reduce accuracy.
Both the simplified elliptical ROI and freehand ROI methods demonstrated good inter-observer reliability and consistency at a wide range of BMI measurements.
Conclusions in brief
The use of large, solitary, elliptical ROIs focused on one hepatic lobe provides %FF measurements that are unlikely to be sufficiently accurate for use in clinical practice. Freehand whole-liver ROIs should be used in preference.
When using freehand ROI measurements analysis of a single slice at the level of the hepatic hilum yields a %FF that is representative of the mean % FF of the whole liver.
Both the simplified elliptical ROI and freehand ROI methods demonstrated good inter-observer reliability and consistency at a wide range of BMI measurements.
Percentage fat fraction
Coefficient of variation
Free-hand (whole liver) region of interest
- IDEAL IQ:
Iteraterative decomposition of water and fat with echo asymmetry and least-squares estimation quantitation sequence
Inter quartile range
Elliptical left lobe region of interest
Magnetic resonance imaging
Magnetic resonance spectroscopy
Non-alcoholic fatty liver disease
Proton density fat fraction
Region of interest
Elliptical right lobe region of interest
Clark JM. The epidemiology of nonalcoholic fatty liver disease in adults. J Clin Gastroenterol. 2006;40(Suppl 1):S5–10.
Gholam PM, Flancbaum L, Machan JT, Charney DA, Kotler DP. Nonalcoholic fatty liver disease in severely obese subjects. Am J Gastroenterol. 2007;102:399–408.
Tolman KG, Fonseca V, Dalpiaz A, Tan MH. Spectrum of liver disease in type 2 diabetes and management of patients with diabetes and liver disease. Diabetes Care. 2007;30:734–43.
Angulo P. Nonalcoholic fatty liver disease. N Engl J Med. 2002;346:1221–31.
Browning JD, Szczepaniak LS, Dobbins R, Nuremberg P, Horton JD, Cohen JC, et al. Prevalence of hepatic steatosis in an urban population in the United States: impact of ethnicity. Hepatol Baltim Md. 2004;40:1387–95.
Adams LA, Lymp JF, St Sauver J, Sanderson SO, Lindor KD, Feldstein A, et al. The natural history of nonalcoholic fatty liver disease: a population-based cohort study. Gastroenterology. 2005;129:113–21.
Bugianesi E, Leone N, Vanni E, Marchesini G, Brunello F, Carucci P, et al. Expanding the natural history of nonalcoholic steatohepatitis: from cryptogenic cirrhosis to hepatocellular carcinoma. Gastroenterology. 2002;123:134–40.
Starley BQ, Calcagno CJ, Harrison SA. Nonalcoholic fatty liver disease and hepatocellular carcinoma: a weighty connection. Hepatol Baltim Md. 2010;51:1820–32.
Kim KY, Song JS, Kannengiesser S, Han YM. Hepatic fat quantification using the proton density fat fraction (PDFF): utility of free-drawn-PDFF with a large coverage area. Radiol Med (Torino). 2015;120:1083–93.
Idilman IS, Keskin O, Celik A, Savas B, Elhan AH, Idilman R, et al. A comparison of liver fat content as determined by magnetic resonance imaging-proton density fat fraction and MRS versus liver histology in non-alcoholic fatty liver disease. Acta Radiol. 2016;57:271–8.
d’Assignies G, Kauffmann C, Boulanger Y, Bilodeau M, Vilgrain V, Soulez G, et al. Simultaneous assessment of liver volume and whole liver fat content: a step towards one-stop shop preoperative MRI protocol. Eur Radiol. 2011;21:301–9.
Bannas P, Kramer H, Hernando D, Agni R, Cunningham AM, Mandal R, et al. Quantitative magnetic resonance imaging of hepatic steatosis: validation in ex vivo human livers. Hepatol Baltim Md. 2015;62:1444–55.
Hetterich H, Bayerl C, Peters A, Heier M, Linkohr B, Meisinger C, et al. Feasibility of a three-step magnetic resonance imaging approach for the assessment of hepatic steatosis in an asymptomatic study population. Eur Radiol. 2016;26:1895–904.
Kinner S, Reeder SB, Yokoo T. Quantitative imaging biomarkers of NAFLD. Dig Dis Sci. 2016;61:1337–47.
Krishan S, Jain D, Bathina Y, Kale A, Saraf N, Saigal S, et al. Non-invasive quantification of hepatic steatosis in living, related liver donors using dual-echo Dixon imaging and single-voxel proton spectroscopy. Clin Radiol. 2016;71:58–63.
Kang GH, Cruite I, Shiehmorteza M, Wolfson T, Gamst AC, Hamilton G, et al. Reproducibility of MRI-determined proton density fat fraction across two different MR scanner platforms. J Magn Reson Imaging JMRI. 2011;34:928–34.
Yokoo T, Shiehmorteza M, Hamilton G, Wolfson T, Schroeder ME, Middleton MS, et al. Estimation of hepatic proton-density fat fraction by using MR imaging at 3.0 T. Radiology. 2011;258:749–59.
Reeder SB, McKenzie CA, Pineda AR, Yu H, Shimakawa A, Brau AC, et al. Water-fat separation with IDEAL gradient-echo imaging. J Magn Reson Imaging JMRI. 2007;25:644–52.
Reeder SB, Robson PM, Yu H, Shimakawa A, Hines CDG, McKenzie CA, et al. Quantification of hepatic steatosis with MRI: the effects of accurate fat spectral modeling. J Magn Reson Imaging JMRI. 2009;29:1332–9.
Vu K-N, Gilbert G, Chalut M, Chagnon M, Chartrand G, Tang A. MRI-determined liver proton density fat fraction, with MRS validation: comparison of regions of interest sampling methods in patients with type 2 diabetes. J Magn Reson Imaging JMRI. 2016;43:1090–9.
Campo CA, Hernando D, Schubert T, Bookwalter CA, Pay AJV, Reeder SB. Standardized approach for ROI-based measurements of proton density fat fraction and R2* in the liver. Am J Roentgenol. 2017;209:592–603.
Thomas MS, Newman D, Leinhard OD, Kasmai B, Greenwood R, Malcolm PN, et al. Test-retest reliability of automated whole body and compartmental muscle volume measurements on a wide bore 3T MR system. Eur Radiol. 2014;24:2279–91.
Newman D, Kelly-Morland C, Leinhard OD, Kasmai B, Greenwood R, Malcolm PN, et al. Test-retest reliability of rapid whole body and compartmental fat volume quantification on a widebore 3T MR system in normal-weight, overweight, and obese subjects. J Magn Reson Imaging JMRI. 2016;44:1464–73.
R Core Team. R: a language and environment for statistical computing. Vienna: foundation for statistical Computing; 2013. http://www.R-project.org/
Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33:159–74.
Idilman IS, Aniktar H, Idilman R, Kabacam G, Savas B, Elhan A, et al. Hepatic steatosis: quantification by proton density fat fraction with MR imaging versus liver biopsy. Radiology. 2013;267:767–75.
Bonekamp S, Tang A, Mashhood A, Wolfson T, Changchien C, Middleton MS, et al. Spatial distribution of MRI-determined hepatic proton density fat fraction in adults with nonalcoholic fatty liver disease. J Magn Reson Imaging JMRI. 2014;39:1525–32.
Capitan V, Petit J-M, Aho S, Lefevre P-H, Favelier S, Loffroy R, et al. Macroscopic heterogeneity of liver fat: an MR-based study in type-2 diabetic patients. Eur Radiol. 2012;22:2161–8.
Hong CW, Wolfson T, Sy EZ, Schlein AN, Hooker JC, Fazeli Dehkordy S, et al. Optimization of region-of-interest sampling strategies for hepatic MRI proton density fat fraction quantification: optimization of ROI strategies for MRI-PDFF. J Magn Reson Imaging. 2018;47:988–94.
European Association for the Study of the Liver (EASL). European Association for the Study of diabetes (EASD), European Association for the Study of obesity (EASO). EASL-EASD-EASO clinical practice guidelines for the management of non-alcoholic fatty liver disease. J Hepatol. 2016;64:1388–402.
Wu B, Han W, Li Z, Zhao Y, Ge M, Guo X, et al. Reproducibility of intra- and inter-scanner measurements of liver fat using complex confounder-corrected chemical shift encoded MRI at 3.0 tesla. Sci Rep. 2016;6:19339.
McPherson S, Jonsson JR, Cowin GJ, O’Rourke P, Clouston AD, Volp A, et al. Magnetic resonance imaging and spectroscopy accurately estimate the severity of steatosis provided the stage of fibrosis is considered. J Hepatol. 2009;51:389–97.
Norfolk and Norwich imaging department research fund (to cover publication costs).
Availability of data and materials
The datasets generated and/or analysed during the current study are not publicly available due to patient confidentiality but are available from the corresponding author on reasonable request.
Ethics approval and consent to participate
Ethical approval for this study was obtained from the local Research Ethics Committee (NRES Committee East of England – Essex) on 12/10/2012.
REC reference: 12/EE/0448.
Informed written consent was obtained from each participant.
Consent for publication
Informed written consent was obtained from each participant.
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
About this article
Cite this article
Procter, A.J., Sun, J.Y., Malcolm, P.N. et al. Measuring liver fat fraction with complex-based chemical shift MRI: the effect of simplified sampling protocols on accuracy. BMC Med Imaging 19, 14 (2019). https://doi.org/10.1186/s12880-019-0311-y