Abstract
Background:
Magnetic resonance imaging-estimated proton-density-fat-fraction (MRI-PDFF) has been shown to be a noninvasive, accurate and reproducible imaging-based biomarker for assessing steatosis and treatment response in nonalcoholic steatohepatitis (NASH) clinical trials. However, there are no data on the magnitude of MRI-PDFF reduction corresponding to histologic response in the setting of a NASH clinical trial. The aim of this study was to quantitatively compare the magnitude of MRI-PDFF reduction between histologic responders versus histologic nonresponders in NASH patients.
Methods:
This study is a secondary analysis of the MOZART trial, which included 50 patients with biopsy-proven NASH randomized to ezetimibe 10 mg/day orally or placebo for 24 weeks. The primary aim was to perform a head-to-head comparative analysis of histologic responders [defined as a ⩾2-point reduction in the nonalcoholic fatty liver disease (NAFLD) Activity Score (NAS) without worsening fibrosis] versus nonresponders, and the corresponding quantitative change in liver fat content measured via MRI-PDFF.
Results:
Of the 35 patients who underwent paired liver biopsy and MRI-PDFF assessment at the beginning and end of treatment, 10 demonstrated a histologic response. Compared with histologic nonresponders, histologic responders had a statistically significant reduction in MRI-PDFF of −4.1% ± 4.9 versus −0.6 ± 4.1 (p < 0.04) with a mean relative percent change of −29.3% ± 33.0 versus +2.0% ± 24.0 (p < 0.004), respectively.
Conclusions:
Utilizing paired MRI-PDFF and liver histology data, we demonstrate that a relative reduction of 29% in liver fat on MRI-PDFF is associated with a histologic response in NASH. After external validation by independent research groups, these results can be incorporated into designing future NASH clinical trials, especially those utilizing change in hepatic fat quantified by MRI-PDFF, as a treatment endpoint.
Keywords
Introduction
Nonalcoholic steatohepatitis (NASH) has become one of the most common causes of liver disease in Western countries [Chalasani et al. 2012; Loomba and Sanyal, 2013] and an important indication for liver transplantation [Charlton et al. 2011]. Nonalcoholic fatty liver disease (NAFLD) afflicts 80–100 million Americans [Browning et al. 2004; Vernon et al. 2011; Williams et al. 2011]. Approximately 30% of these individuals may have the progressive form of NAFLD, termed as NASH. It is associated with insulin resistance, diabetes mellitus type 2, and dyslipidemia [Neuschwander-Tetri et al. 2010; Chalasani et al. 2012; Loomba et al. 2012]. Liver histology findings associated with NASH include macrovesicular steatosis in acinar zone 3, lobular inflammation, and hepatocyte ballooning, all of these with or without perisinusoidal fibrosis [Matteoni et al. 1999]. NASH can further progress to advanced fibrosis, cirrhosis and hepatocellular carcinoma (HCC) [Matteoni et al. 1999; Adams et al. 2005; Vernon et al. 2011; Singh et al. 2014]. Liver biopsy is the current gold standard for diagnosing NASH, but it is an expensive, invasive procedure with high sampling error and is associated with complications such as pain, bleeding and death [Bravo et al. 2001; Ratziu et al. 2005; Chalasani et al. 2012; Sumida et al. 2014]. It is not feasible to perform liver biopsy assessment for all NASH patients and given its subjective nature, it often requires a large number of patients in clinical trials to show treatment response; thus noninvasive objective biomarkers are needed.
Imaging modalities are increasingly used as noninvasive methods for diagnosis and assessment of NASH. Ultrasound and computed tomography (CT) are convenient, widely available and safe but have their limitations [Saadeh et al. 2002; Mishra and Younossi, 2007]. In a meta-analysis, ultrasound studies had limited sensitivity with mild hepatic steatosis <30% and were found to be an unreliable and operator-dependent imaging technique [Bohte et al. 2011]. CT also has low sensitivity for mild steatosis, along with costs and radiation exposure. More importantly neither of these studies can quantify liver fat content [Bohte et al. 2011]. Magnetic resonance spectroscopy (MRS) is a safe, noninvasive and accurate method for hepatic fat quantification with high sensitivity and specificity but remains primarily a research tool due to its low availability and limited clinical application [Szczepaniak et al. 2005; Schwenzer et al. 2009; Wong et al. 2012]. Furthermore, MRS has been used for assessment of treatment response [Keating et al. 2015] but the precise decline in hepatic fat associated with histologic change remains unknown. Advanced magnetic resonance imaging (MRI) can measure the proton density fat fraction (PDFF), an objective and quantitative indicator of hepatic fat content, across the entire liver in an accurate, reproducible manner [Yokoo et al. 2009; Kang et al. 2011; Tang et al. 2013; Negrete et al. 2014] while minimizing errors due to factors such as T1 bias, T2* decay, and multifrequency signal interference effects of protons in fat that confound fat quantification with conventional MRI [Liu et al. 2007; Reeder and Sirlin, 2010]. MRI-estimated PDFF has been validated against MRS [Yokoo et al. 2009, 2011; Kang et al. 2011] as well as liver biopsy histological data [Tang et al. 2013, 2015] and was found to be more sensitive in detecting changes in hepatic fat content than histology-determined steatosis grade [Noureddin et al. 2013]. MRI-PDFF is emerging as the leading biomarker for assessing treatment response in NASH trials and has been utilized in several early phase II studies to monitor longitudinal changes in hepatic fat [Le et al. 2012; Patel et al. 2014].
Despite the growing need for noninvasive methods of screening and evaluation of NASH progression, along with the increasing use of MRI-PDFF in clinical trials for hepatic fat assessment, there are no data on the magnitude of MRI-PDFF reduction corresponding to histologic response in the setting of a NASH clinical trial. Given the emergence of MRI and MRS-PDFF as the leading biomarkers for hepatic steatosis, these data are of paramount importance for the design of future and ongoing clinical trials. In this study, we aimed to quantitatively compare MRI-PDFF change between histologic responders versus histologic nonresponders in patients with biopsy-proven NASH participating in a randomized clinical trial.
Methods
Study design and patient population
This is a secondary longitudinal analysis of the MOZART trial, a randomized, placebo-controlled, double-blind study of the efficacy of ezetimibe versus placebo in reducing hepatic fat as measured by MRI-PDFF technique in biopsy-proven NASH patients [Loomba et al. 2014]. In the MOZART trial, 50 patients were randomized to receive ezetimibe 10 mg orally daily or placebo for 24 weeks while also undergoing biochemical testing, liver biopsy, and MRI at baseline and at completion of therapy. In this trial, patients were enrolled from San Diego Integrated NAFLD Research Consortium (SINC) cohort, a collaboration between the University of California San Diego (UCSD) Medical Center, Sharp Health System, Naval Medical Center San Diego, and Kaiser Permanente Southern California to study NAFLD and led by the principal investigator (Rohit Loomba). Clinical research assessments were performed at the UCSD NAFLD Translational Research Unit and MRI-PDFF was performed by the UCSD Liver Imaging Group. All enrolled patients provided written informed consent for their participation in the MOZART trial. The study protocol was approved by the UCSD Institutional Review Board.
Patients in the MOZART trial who underwent both liver biopsy and MRI-PDFF at weeks 0 and 24 were included in this secondary analysis (Figure 1). Patients were classified as ‘histologic responders’ if they had ⩾2 point reduction in NAFLD Activity Score (NAS) without any increase in fibrosis stage and as ‘histologic nonresponders’ if they did not meet this criterion. As previously shown in a meta-analysis of NASH clinical trials, a 2-point change in histologic parameters is rarely seen in placebo patient groups and is a reliable indicator for treatment-related changes [Loomba et al. 2008].

Study design and derivation of cohort for this secondary analysis.
Inclusion and exclusion criteria
Inclusion criteria included: patients aged ⩾18 years with liver biopsy-confirmed NASH as defined by the Nonalcoholic Steatohepatitis Clinical Research Network (NASH CRN) histologic scoring system, MRI with evidence of hepatic steatosis with MRI-PDFF ⩾5%, liver biopsy and MRI-PDFF performed at week 0 and week 24 of MOZART trial, and serum alanine aminotransferase (ALT) or aspartate aminotransferase (AST) levels above the upper limit of normal (19 U/l for women and 30 U/l for men). The criteria included AST and ALT values that were lower than standard upper limits of normal at most laboratories since they were determined from prior studies to have higher sensitivities for NAFLD [Prati et al. 2002]. Exclusion criteria included the following: (1) evidence of other forms of liver disease based on liver biopsy findings or laboratory findings of positive hepatitis B surface antigen, hepatitis C viral RNA, autoimmune hepatitis, hemochromatosis, Wilson’s disease, and alpha-1-antitrypsin deficiency, (2) alcohol intake above 30 g per day in the prior 10 years or above 10 g per day in the previous 1 year, (3) Child-Pugh score >7 points, and (4) active substance abuse. Patients were also excluded if they had severe systemic illnesses including renal insufficiency, human immunodeficiency virus infection, positive pregnancy test, evidence of hepatocellular carcinoma, contraindications to liver biopsy or MRI exam, or any use of steatogenic medications or drug agents such as vitamin E or pioglitazone, known to improve NASH.
Primary outcome measurement and rationale for study
The primary aim was to perform a head-to-head comparative analysis of histologic responders, defined as a ⩾2-point reduction in the NAS (please see liver histologic assessment section) without worsening fibrosis, versus nonresponders, and the corresponding quantitative change in liver fat content measured via MRI-PDFF. These data are needed for future NASH phase IIa and IIb clinical trial design utilizing MRI-PDFF as a biomarker for treatment response.
Baseline clinical evaluation
All patients were screened with history, physical examination, an alcohol history assessment at the Clinical and Translational Research Institute at UCSD. The Alcohol Use Disorders Identification Test and the Skinner Lifetime Drinking Questionnaire were used to assess alcohol use history. All patients prior to treatment initiation underwent an evaluation, including routine history and physical exam, vital sign measurements, body height and weight measurements, and body mass index (BMI) calculation performed by a trained investigator. All subjects had blood tests for measurement of AST, ALT, gamma-glutamyl transpeptidase (GGT), alkaline phosphatase, albumin, prothrombin time, lipid panel, hemoglobin A1c (HbA1c), fasting glucose, insulin, total bilirubin, and free fatty acids.
Liver histology assessment
Liver biopsy was performed on all patients prior to inclusion in the trial and at the conclusion of the trial at week 24 on patients that consented for repeat biopsy. The liver biopsy specimens obtained had an average length of 17.0 mm (SD ± 6.6) with average number of 14.9 (SD ± 7.9) portal tracts, which are characteristic of acceptable liver biopsy specimens performed for NASH clinical trials [Neuschwander-Tetri et al. 2010; Le et al. 2012]. All liver histology assessments for the study were interpreted by a single hepatopathologist (Mark Valasek) blinded to clinical and imaging data, prior liver biopsy data and order of liver biopsy. The biopsies were scored using NASH-CRN histologic scoring system [Kleiner et al. 2005]. The NAS ranges from 0–8 and is the sum of scores of histologic variables of steatosis (0–3), lobular inflammation (0–3), and hepatocellular ballooning (0–2). Liver fibrosis was scored as stage 0–4 using the NASH-CRN scoring system, with stage 4 indicating cirrhosis.
Magnetic resonance imaging protocol
MRI exams were performed at baseline and at week 24 to measure PDFF as a noninvasive indicator of liver fat content using a previously described, validated and reproducible technique that estimates PDFF [Reeder et al. 2009; Hines et al. 2011; Permutt et al. 2012; Tang et al. 2013, 2015]. In brief, this advanced technique utilizes a gradient echo sequence with low flip angle to minimize T1 bias and acquires multiple echoes at echo times at which fat and water signals are nominally in phase or out of phase with each other. The acquired source images are processed through an algorithm that, pixel by pixel, corrects for the T2* signal decay and multi-fat-peak spectral interference effects that confound results of conventional MRI examinations. The resulting parametric PDFF map illustrates fat distribution and quantity across the entire liver. Hepatic fat quantification via MRI-PDFF has been previously shown to be sensitive in detecting liver fat changes and has been utilized in NASH clinical trials for quantitative fat assessment [Le et al. 2012; Permutt et al. 2012; Doycheva et al. 2015; Loomba et al. 2015; Zarrinpar et al. 2015].
All MRI-PDFF parametric maps were analyzed under the supervision of an experienced hepatoradiologist (Claude Sirlin) [Hines et al. 2011; Kang et al. 2011] who was blinded to clinical and histological data, and order of scan. Trained image analysts placed regions of interest (ROIs) about 300–400 mm2 in area in each of the nine liver segments while carefully evading blood vessels, bile ducts and artifacts in the region. To quantify longitudinal hepatic fat content changes between the baseline exam and follow-up exam, ROIs in each segment were colocalized using anatomic landmarks on maps obtained at each time point, and PDFF change in each segment was calculated.
Statistical analysis
Patients with NAS score reduction ⩾2 points were classified as histologic responders and the remaining patients were classified as histologic nonresponders. For categorical variables, the Chi-square test or Fisher’s exact test was used and for continuous variables, the Wilcoxon-Mann-Whitney test or independent samples Student’s t-test was used to compare histologic responder and nonresponder groups. For the comparative analyses within groups, we utilized the Wilcoxon signed rank sum test or paired samples Student’s t-test as appropriate. A p-value < 0.05 was considered statistically significant. All statistical analyses were performed using SAS version 9.4 (Cary, NC, USA).
Results
Demographic, biochemical, and histological baseline characteristics of patients
Of the 50 patients randomized into the MOZART Trial, 35 patients were included in this secondary analysis with 10 patients in the histologic responders group and 25 patients in the histologic nonresponders group (Figure 1). Histologic responders and nonresponders had similar baseline demographics and histological characteristics, with a median age of 60.5 years and 70% female patients for histologic responders, and a median age of 49 years and 64% female patients for histologic nonresponders (Table 1). There was a significant difference in BMI between the two groups with a median BMI of 30.6 in the histologic responder group versus 35.0 in the histologic nonresponder group (p = 0.01), although no significant difference was observed for weight and height (Table 1). Biochemical profiles of both groups were comparable with the exception of the homeostatic model assessment of insulin resistance (HOMA-IR) (p = 0.02; Table 1).
Baseline demographic, biochemical, and histological characteristics of patients.
All above values are reported as median (interquartile range) or n (%). p-values were determined using Wilcoxon-Mann-Whitney test for continuous variables and Chi-square test or Fisher’s Exact test for categorical variables as appropriate.
ALT, alanine aminotransferase; Alk Phos, alkaline phosphatase; AST, aspartate aminotransferase; BMI, body mass index; GGT, gamma-glutamyl transferase; HbA1c, glycated hemoglobin; HOMA-IR, homeostatic model assessment – insulin resistance; LDL, low-density lipoprotein; NAFLD, nonalcoholic fatty liver disease; NAS, NAFLD activity score.
Primary outcome: change in MRI-PDFF estimated hepatic fat associated with histologic response
There was a significant reduction in MRI-PDFF in the histologic responder group (−4.1% ± 4.9) compared with nonresponder group (−0.6% ± 4.1), (p = 0.04; Table 2). The significant decline in MRI-PDFF was consistently seen across most liver segments in the histologic responder group (Table 2). Compared with histologic nonresponders, histologic responders had a statistically significant proportional reduction in MRI-PDFF relative to baseline with a mean percent change of −29.3% ± 33.0 versus 2.0% ± 24.0, (p < 0.004; Figure 2).
MRI-PDFF-estimated liver fat by segments in histologic responders and histologic nonresponders.
Data values are reported as median (interquartile range) or mean ± standard deviation. p-values were determined using Wilcoxon signed rank sum test or paired t-test. p-value difference was determined using Wilcoxon-Mann-Whitney test or independent samples t-test.
MRI, magnetic resonance imaging; PDFF, proton density fat fraction.

Overall mean % change in MRI-PDFF estimated liver fat content between week 0 and week 24 in two groups with histologic responders on left (shown in red color bar), and histologic nonresponders (shown in blue color bar). There was a significant difference in fat change % from baseline between two groups (p < 0.004).
The histologic responders had a median decrease in hepatic steatosis grade of 1, which was significant compared with histologic nonresponders with no overall net change over 24 weeks (p = 0.01; supplementary online Table A). Similar decline was seen in hepatocellular ballooning median grade in histologic responders compared with nonresponders (p = 0.03; supplementary online Table A). There was no significant change in lobular inflammation scores and fibrosis scores over 24 weeks within groups as well as between groups (supplementary online Table A).
Secondary outcome: change in biochemical characteristics associated with histologic change
We compared the weight and biochemical profiles during the study time period between the two groups. The median weight decreased more in the histologic responders group compared with the nonresponders group but did not reach statistical significance (supplementary online Table B). Similarly, patients with a NAS change ⩾2 points had no significant changes in commonly tested laboratory values of AST, ALT and GGT (supplementary online Table B). Fatty liver index was calculated for the two groups and no significant difference was seen between two groups.
Discussion
In this secondary analysis of a randomized, double-blinded, placebo-controlled clinical trial with biopsy-proven NASH patients, we showed that patients with ⩾2-point reduction in NAS, without any worsening of fibrosis on liver biopsy, had a proportional reduction in MRI-PDFF of 29.3% from baseline which was statistically significant when compared with histologic nonresponders. Thus, this longitudinal study provides quantitative MRI-PDFF change data that are associated with changes in liver histologic parameters, which can be incorporated into the design of future trials as well as to assess for intervention effect in early phase NASH trial in which MRI- related fat change is being used as a treatment endpoint.
This study utilized an advanced, noninvasive, accurate MRI-based biomarker for liver fat quantification that has been shown to be superior to other noninvasive imaging biomarkers that are currently available such as ultrasound or CT scan-derived fat assessment. Liver biopsy remains the gold standard for diagnosis, classification, and monitoring of NASH. However, it has its limitations due to sampling variability, inter-observer rating variability and procedure-related risks and complications [Bravo et al. 2001; Ratziu et al. 2005]. Also, since liver biopsy only examines a small portion of liver, and fat distribution can be uneven in NASH, this can lead to errors in diagnosis as well as not detecting longitudinal changes in liver fat when utilized in clinical trials. Ultrasound imaging is low in cost and risks, with wide availability but it is limited due to operator-dependency and inability to quantify hepatic fat. CT imaging has associated risks of ionizing radiation exposure and lacks accuracy and precision for diagnosis of hepatic steatosis compared with MRI. Although MR spectroscopy is considered a reference standard for liver fat quantification, it is limited in applicability due to its requirements for expert analysis and low availability compared with MRI. MRS is also limited due to its sampling error since only a single, small voxel (2 × 2 × 2 cm3) is utilized for fat quantification whereas the MRI-PDFF technique provides fat mapping for the entire liver.
In prior studies conducted at our center and by others, the MRI-PDFF technique has been validated against both histology-determined steatosis grade data and MRS fat quantification and shown to be an accurate and precise method for hepatic fat quantification [Meisamy et al. 2011; Permutt et al. 2012; Idilman et al. 2013; Tang et al. 2013, 2015]. We have also shown that the MRI-PDFF-estimated liver fat technique is more sensitive in detecting longitudinal fat changes compared with liver histology, which supports its use in NASH clinical trials [Le et al. 2012; Noureddin et al. 2013].
Strengths and limitations
The study’s major strengths include: (1) the use of a cohort derived from a randomized, placebo-controlled, double-blinded study, the MOZART trial, which allowed for prospective, systematic longitudinal assessment of biopsy-proven NASH patients at baseline and 24 weeks, (2) the MR imaging performed under standardized protocol and processed under the supervision of a single, blinded radiologist, (3) similarly, the liver biopsies were reviewed using NASH-CRN histological scoring system by a single, blinded hepatopathologist, (4) extensive MR assessment of liver fat including precise quantification and fat mapping of the entire liver, and (5) comparison of colocalized ROIs for fat changes in each of nine liver segments between week 0 and week 24. Despite these strengths of our study, we recognize several limitations. This study was performed at a single site, the UCSD NAFLD Translational Research Unit, and validation at other clinical sites will be needed for wide generalizability. Despite the excellent precision and sensitivity of MRI-PDFF to detect hepatic steatosis, its utility in future clinical trials especially those including patients with advanced fibrosis may be limited and require further investigation in this specific population [Idilman et al. 2013]. Although histologic responders in the study experienced reduction in lobular inflammation scores, they did not reach statistical significance and it remains to be validated whether a 29% reduction in liver fat by MRI-PDFF could serve as marker of treatment response in NASH. In the current state, the application of MRI-PDFF will be limited by its cost and requirement of expert analysis. Furthermore, our study is limited by a small sample size and our results would need further validation by larger, multicenter cohort studies prior to utilization of MRI-PDFF technique in future NASH clinical trials.
Implications
MRI-PDFF is a novel and accurate biomarker to quantify liver fat. These data suggest that a relative reduction of 29% in MRI-PDFF may be associated with a clinically important improvement in liver histology in the setting of a clinical trial in NASH. MRI-PDFF has the potential to be a cost-effective and convenient method for liver fat quantification in future clinical trials, as it only requires a single 20-second breath hold and an estimated time of about 5 minutes in an MRI scanner suite. This study has implications for the design of future NASH clinical trials.
Footnotes
Acknowledgements
Janki Patel provided analysis and interpretation of data, statistical analysis, drafting of the manuscript, critical revision of the manuscript, and approved the final submission.
Ricki Bettencourt provided data analysis, critical revision of the manuscript, and approved the final submission.
Jeffrey Cui, Hamed Aryafar and Mark Valasek provided a critical revision of the manuscript and approved the final submission.
Jonathan Hooker, William Haufe and Catherine Hooker conducted analysis of MRI data, critically reviewed the manuscript, and approved the final submission.
Joanie Salotti, Archana Bhatt, Carolyn Hernandez, Lisa Richards and Phirum Nguyen conducted patient visits, critically reviewed the manuscript, and approved the final submission.
Claude Sirlin provided analysis and interpretation of data, drafting of the manuscript, MRI analysis, critical revision of the manuscript, and approved the final submission.
Rohit Loomba provided the study concept and design, analysis and interpretation of data, drafting of the manuscript, critical revision of the manuscript, obtained funding, study supervision, and approved the final submission.
Funding
This work was supported by an investigator-initiated study grant to Dr Rohit Loomba by Merck & Co. Inc., USA. The study was conducted at the Clinical and Translational Research Institute, UCSD. Dr Rohit Loomba is supported, in part, by the American Gastroenterological Association (AGA) Foundation, Sucampo, ASP Designated Research Award in Geriatric Gastroenterology and by a T. Franklin Williams Scholarship Award. Funding was provided by Atlantic Philanthropies, Inc., the John A. Hartford Foundation, OM, the Association of Specialty Professors, and the American Gastroenterological Association and the National Institutes of Health grant, K23-DK090303.
Conflict of interest statement
The study sponsors had no role in the study design, collection, analysis, interpretation of the data or drafting of the manuscript. All authors report that no conflicts of interest exist.
References
Supplementary Material
Please find the following supplemental material available below.
For Open Access articles published under a Creative Commons License, all supplemental material carries the same license as the article it is associated with.
For non-Open Access articles published, all supplemental material carries a non-exclusive license, and permission requests for re-use of supplemental material or any part of supplemental material shall be sent directly to the copyright owner as specified in the copyright notice associated with the article.
