If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Division of Gastroenterology, Centre for Autoimmune Liver Diseases, Department of Medicine and Surgery, University of Milano-Bicocca, Via Cadore 48, Monza, MB 20900, ItalyEuropean Reference Network on Hepatological Diseases (ERN RARE-LIVER), San Gerardo Hospital, Via Gian Battista Pergolesi 33, Monza 20900, ItalyBicocca Bioinformatics Biostatistics and Bioimaging Centre-B4, School of Medicine and Surgery, University of Milan-Bicocca, Via Cadore 48, Monza, MB 20900, Italy
Bicocca Bioinformatics Biostatistics and Bioimaging Centre-B4, School of Medicine and Surgery, University of Milan-Bicocca, Via Cadore 48, Monza, MB 20900, Italy
Division of Gastroenterology, Centre for Autoimmune Liver Diseases, Department of Medicine and Surgery, University of Milano-Bicocca, Via Cadore 48, Monza, MB 20900, ItalyEuropean Reference Network on Hepatological Diseases (ERN RARE-LIVER), San Gerardo Hospital, Via Gian Battista Pergolesi 33, Monza 20900, Italy
Division of Gastroenterology, Centre for Autoimmune Liver Diseases, Department of Medicine and Surgery, University of Milano-Bicocca, Via Cadore 48, Monza, MB 20900, ItalyEuropean Reference Network on Hepatological Diseases (ERN RARE-LIVER), San Gerardo Hospital, Via Gian Battista Pergolesi 33, Monza 20900, Italy
Division of Gastroenterology, Centre for Autoimmune Liver Diseases, Department of Medicine and Surgery, University of Milano-Bicocca, Via Cadore 48, Monza, MB 20900, ItalyEuropean Reference Network on Hepatological Diseases (ERN RARE-LIVER), San Gerardo Hospital, Via Gian Battista Pergolesi 33, Monza 20900, Italy
Division of Gastroenterology, Centre for Autoimmune Liver Diseases, Department of Medicine and Surgery, University of Milano-Bicocca, Via Cadore 48, Monza, MB 20900, ItalyEuropean Reference Network on Hepatological Diseases (ERN RARE-LIVER), San Gerardo Hospital, Via Gian Battista Pergolesi 33, Monza 20900, Italy
Division of Gastroenterology, Centre for Autoimmune Liver Diseases, Department of Medicine and Surgery, University of Milano-Bicocca, Via Cadore 48, Monza, MB 20900, ItalyEuropean Reference Network on Hepatological Diseases (ERN RARE-LIVER), San Gerardo Hospital, Via Gian Battista Pergolesi 33, Monza 20900, Italy
Division of Gastroenterology, Centre for Autoimmune Liver Diseases, Department of Medicine and Surgery, University of Milano-Bicocca, Via Cadore 48, Monza, MB 20900, ItalyEuropean Reference Network on Hepatological Diseases (ERN RARE-LIVER), San Gerardo Hospital, Via Gian Battista Pergolesi 33, Monza 20900, Italy
Department of Diagnostic Radiology, San Gerardo Hospital, ASST Monza, Via Gian Battista Pergolesi 33, Monza, MB 20900, ItalySchool of Medicine, University of Milano-Bicocca, via Cadore 48, Monza, MB 20900, Italy
Division of Gastroenterology, Centre for Autoimmune Liver Diseases, Department of Medicine and Surgery, University of Milano-Bicocca, Via Cadore 48, Monza, MB 20900, ItalyEuropean Reference Network on Hepatological Diseases (ERN RARE-LIVER), San Gerardo Hospital, Via Gian Battista Pergolesi 33, Monza 20900, Italy
Bicocca Bioinformatics Biostatistics and Bioimaging Centre-B4, School of Medicine and Surgery, University of Milan-Bicocca, Via Cadore 48, Monza, MB 20900, Italy
Bicocca Bioinformatics Biostatistics and Bioimaging Centre-B4, School of Medicine and Surgery, University of Milan-Bicocca, Via Cadore 48, Monza, MB 20900, Italy
Department of Radiology, ASST Papa Giovanni XXIII, piazza OMS, 1, Bergamo, BG 24127, ItalySchool of Medicine, University of Milano-Bicocca, via Cadore 48, Monza, MB 20900, Italy
Division of Gastroenterology, Centre for Autoimmune Liver Diseases, Department of Medicine and Surgery, University of Milano-Bicocca, Via Cadore 48, Monza, MB 20900, ItalyEuropean Reference Network on Hepatological Diseases (ERN RARE-LIVER), San Gerardo Hospital, Via Gian Battista Pergolesi 33, Monza 20900, Italy
Corresponding author at: Center of Autoimmune Liver Disease, University of Milan Bicocca, ERN Rare Liver. Via Cadore 48, 20900, Monza, Italy. Telephone number: +390392334515.
Division of Gastroenterology, Centre for Autoimmune Liver Diseases, Department of Medicine and Surgery, University of Milano-Bicocca, Via Cadore 48, Monza, MB 20900, ItalyEuropean Reference Network on Hepatological Diseases (ERN RARE-LIVER), San Gerardo Hospital, Via Gian Battista Pergolesi 33, Monza 20900, Italy
Magnetic resonance cholangiopancreatography (MRCP) is the gold standard for diagnosis of patients with primary sclerosing cholangitis (PSC). The semi-quantitative MRCP-derived Anali scores proposed for risk stratification, have poor-to-moderate inter-reader agreement.
Aims
To evaluate the prognostic performance of quantitative MRCP metrics in PSC.
Methods
This is a retrospective study of PSC patients undergoing MRCP. Images were processed using MRCP+ software (Perspectum Ltd, Oxford) that provides quantitative biliary features, semi-automatically extracted by artificial intelligence-driven analysis of MRCP-3D images. The prognostic value of biliary features has been assessed for all hepato-biliary complications.
Results
87 PSC patients have been included in the analysis. Median follow-up from MRCP to event/censoring of 30.9 months (Q1-Q3=13.6–46.6). An adverse outcome occurred in 27 (31.0%) patients. The number of biliary strictures (HR=1.05 per unit, 95%CI 1.02–1.08, p < 0.0001), spleen length (HR=1.16 per cm, 95%CI 1.01–1.34, p = 0.039), adjusted for height, age at MRCP, and time from diagnosis to MRCP predicted higher risk of hepatobiliary complications. These were incorporated into a the quantitative MRCP-derived PSC (qMRCP-PSC) score (C-statistic=0.80). After 3-fold cross-validation, qMRCP-PSC outperformed the Anali score in our cohort (C-statistic of 0.78 vs 0.64) and enabled the discrimination of survival of PSC patients (log-rank p < 0.0001).
Conclusions
The qMRCP-PSC score identified patients at higher risk of hepatobiliary complications and outperformed the available radiological scores. It represents a novel quantitative biomarker for disease monitoring and a potential surrogate endpoint for clinical trials.
Primary sclerosing cholangitis (PSC) is a rare, chronic immune-mediated disease of the biliary tree. The hallmark of disease is focal fibro-inflammatory strictures and secondary dilations of the biliary tree, which are detected using magnetic resonance cholangiopancreatography (MRCP). These can cause chronic cholestasis that, in the long term, may lead to biliary cirrhosis and increased risk of biliary and gallbladder neoplasia. To date, there is no disease-modifying therapy in PSC and, in case of biliary strictures not amenable to endoscopic therapy and causing recurrent bacterial cholangitis, end-stage liver disease, or intractable pruritus, the only therapeutic option is liver transplantation (LT) [
A major unmet need in PSC is the lack of established biomarkers of disease activity which enable risk stratification and evaluation of treatment benefit – this prevents an accurate evaluation of efficacy for several novel molecules under investigation. Prognostic models for PSC have been developed which combined markers of disease progression, but no single method can be recommended at present to predict individual patient prognosis [
]. Although alkaline phosphatase (ALP) elevation is characteristic in PSC and its reduction is often a major endpoint in clinical trials, its fluctuation over a short-to-medium time makes it an unreliable disease marker [
Inter- and Intra-individual variation, and limited prognostic utility, of serum alkaline phosphatase in a trial of patients with primary sclerosing cholangitis.
MRCP is essential for diagnosis, staging, and monitoring the disease. However, its prognostic role in PSC has been hampered by the qualitative or semi-quantitative assessment of the biliary tree, often subject to inter-observer variability. Two semi-quantitative scores based on magnetic resonance imaging (MRI) and MRCP (i.e. the Anali scores with and without gadolinium) showed good prognostic power in PSC [
]. These scores included only radiological variables selected to evaluate both biliary tree characteristics, portal hypertension signs and hepatic morphology, since biochemical markers of cholestasis (except for high serum bilirubin in Anali score without gadolinium) were not independently associated with the outcome. However, despite their good performance in outcome prediction, their use in clinical practice is hampered by a poor-to-moderate intra-observer agreement, as they were built relying on the qualitative nature of MRCP [
]. Semi-automated quantitative assessment of biliary features derived from MRCP images may be a more objective and reproducible approach to evaluate the radiological staging and progression of disease and may represent a promising non-invasive biomarker for outcome prediction in PSC.
A new software for the quantitative analysis of the biliary tree has been recently developed (MRCP+, Perspectum Ltd., Oxford) with this purpose. MRCP+ generates biliary metrics in a semi-automated way by post-processing conventional MRCP images. Artificial intelligence(AI)-driven pathfinding and tubular enhancement algorithms reconstruct a 3D model of the biliary tree and calculate a series of novel quantitative measures for the direct assessment of ductal anatomy. To date, quantitative cholangiographic metrics have never been assessed towards clinical endpoints.
The aims of this study were: (i) to identify quantitative cholangiographic metrics, obtained from routinely collected MRCP images using an AI-derived tool (MRCP+), that are predictive of clinical outcomes of PSC patients; (ii) to compare their prognostic performance with the current available MRCP-derived radiological score (Anali score).
2. Methods
2.1 Study design
This was a retrospective cohort study of PSC patients from two Italian tertiary liver referral centres (San Gerardo Hospital, Monza; ASST Papa Giovanni XXIII, Bergamo). Consecutive PSC patients with at least one MRCP with available MRCP-3D sequences performed in the referral center between Jan-2012 and Dec-2019 were enrolled in the study. The study is conformed to the ethical guidelines of the 1975 Declaration of Helsinki and the protocol was approved by the institutional review board with a waiver of written informed consent. Consent was verbally obtained from participants; however, they were not required to sign it. All radiological and clinical data have been anonymised before being analysed.
2.2 Participant selection
Inclusion criteria were the following: age at inclusion ≥ 16 years, diagnosis of large duct PSC according to the European guidelines [
]. The MRCP closest to the diagnosis of PSC for each patient was considered for analysis. Patients with overlap syndrome with autoimmune hepatitis (AIH) were included in the analysis. Only patients with follow up longer than 3 months after study entry were included in the study.
Exclusion criteria were as follows: small duct PSC, previous surgery on biliary tree or LT, MRCP performed on 1 Tesla machine, hepatic comorbidities (viral hepatitis, alcoholic liver disease, or non-alcoholic steatohepatitis), secondary sclerosing cholangitis, hepatic or biliary neoplasms, and decompensated cirrhosis or history of biliary complications (as defined below) at the time of MRCP acquisition.
2.3 MRCP acquisition
All the MRI exams of PSC patients performed using 1.5 or 3 Tesla MRI scanners (Ingenia 1.5T Philips, Achieva 1.5T Philips, Optima MR450w 1.5T GEM, Discovery MR750w 3T GEM) that comprises with T1 and T2-weighted and 3-dimensional isotropic MRCP sequences and good quality images, as defined by an expert biliary radiologist, were included in the analysis. All the patients attended the MRCP exam using the hospital protocol (same in both the centres) after at least 6 h fasting and administration of blueberry juice to reduce signal intensity of overlapping fluid withing the upper gastrointestinal organs. MRCP sequences of PSC patients were read by an expert biliary radiologist. For each patient, spleen maximum coronal diameter (cm) was measured, and presence or absence of gallbladder was derived.
Given the retrospective nature of the study, contrast media was different among patients (according to the radiologist preference or to patients’ age), and not always administered. For this reason, Anali score without gadolinium was the only score calculated and considered for the analysis in our cohort. MRCP were scored by an expert biliary radiologist blinded to clinical data according to Ruiz et al. [
Radiologic course of primary sclerosing cholangitis: assessment by three-dimensional magnetic resonance cholangiography and predictive features of progression.
MRCP-3D sequences were uploaded on a dedicated online portal to Perspectum Ltd. after a complete anonymization and were processed using MRCP+ software. A detailed table of the analytic steps of MRCP+ has been provided in the supplementary materials (Suppl. Table 1). Derivation of quantitative parametric 3D models of the biliary tree was performed after enhancement of tubular structures using multi-scale hessian analysis, intelligent search, and novel duct modeling algorithms. The measurements of duct diameters, duct length, biliary tree and gallbladder volumes were obtained semi-automatically by MRCP+ [
]. Candidate strictures and dilations were defined automatically by MRCP+ based on the absolute and relative changes in diameter along duct profile according to definitions reported in Fig. 1. Metrics were derived for individual ducts and for the whole biliary tree after excluding the cystic and pancreatic ducts (Fig. 1). Potential predictor MRCP+-metrics were chosen based on earlier studies and clinical experience [
Evaluation of quantitative MRCP (MRCP+) for risk stratification of primary sclerosing cholangitis: comparison with morphological MRCP, MR elastography, and biochemical risk scores.
Clinical data were collected retrospectively from patients records in each center. For all patients the following variables have been collected: date of birth, age at diagnosis, height, age at MRCP acquisition, date of diagnosis, date of last follow-up, presence and type of IBD, diagnosis of AIH overlap, diagnosis of cirrhosis (defined on histological specimen or on typical radiological characteristics), presence of gastro-oesophageal varices and disease phenotype (i.e., classic/small duct PSC).
2.6 Endpoints
Patients were observed from the time of the first available MRCP to a clinical endpoint or censoring. We assessed the performance of the MRCP+ metrics towards all the hepato-biliary complications, defined as the earliest occurring event among the following: (1) listing for LT; (2) liver-related death; (3) diagnosis of portal hypertensive complications such as ascites requiring hospital admission or diuretic therapy and hepatic encephalopathy requiring hospital admission; (4) gastroesophageal bleeding; (5) diagnosis of biliary complications (biliary strictures requiring dilation, stenting or external drainage, or hospitalization for acute bacterial cholangitis); (6) diagnosis of cholangiocarcinoma.
2.7 Statistical analysis
Categorical data are presented as numbers (percentages) while continuous variables are expressed as medians, interquartile ranges(Q1-Q3) unless otherwise stated. Spearman correlation coefficient was used to measure association between continuous variables. All tests were 2-sided with a level of significance of 0.05 was considered.
Cox proportional hazards regression was applied to examine associations between covariates and endpoints defined above. The results were expressed as hazard ratios(HR) and 95% confidence intervals(CI).
For the multivariable model, selection of covariates was guided both by results of the univariate analysis and by known clinical importance.
Spline functions have been used to explore the shape of the HR of clinical and radiologic continuous variables. The Proportional Hazards (pH) assumption for all variables was also checked using plots of scaled Schoenfeld residuals against time. The discriminative ability of single MRCP+-derived metrics and of multivariable models to identify individuals at higher risk of developing each endpoint was assessed using Harrel's C statistic internally validated using k-fold cross-validation.
A time-dependent receiver operating characteristic (TD-ROC) curve was estimated to assess the ability of the score, derived from the multivariable Cox model, to predict events within 24 months of follow-up. The optimal cut-off was chosen maximizing the Youden Index. To visualize the discrimination ability provided by the predictive score, Kaplan-Meier survival rates were calculated and compared among two groups defined according to the cut-off chosen using the TDROC curve. Log-Rank test has been used to compare the survival curves between the two groups. Statistical analysis was performed with R software version 4.1.2.
3. Results
3.1 Patients characteristics
One-hundred patients with PSC who performed at least one MRCP in the study period were enrolled. Among them, 13 patients were excluded due to low data quality. Therefore, 87(87.0%) patients were considered for the analysis. Demographic, clinical, and radiological characteristics of PSC patients are presented in Table 1. The median age at diagnosis and at MRCP was 32 (Q1-Q3 23–47) and 40 years (Q1-Q3 26–52), respectively, and 60.9% were male. The median time from diagnosis to first hepatobiliary complications/censoring was 78.3 months [Q1-Q3 37.7–132.6], while the median time from MRCP to first hepatobiliary complication/censoring was 30.9 months [Q1-Q3 13.6–46.6]. A hepatobiliary complication occurred in 27 patients (31.0%); of these, 8 (29.6%) underwent LT, 3 (11.1%) died due to a liver-related cause, 2 (7.4%) developed cirrhosis complications and, 14 (51.9%) had biliary complications/neoplasia (Table 1). Among patients who underwent LT, indications were in 4 jaundice secondary to end-stage biliary disease, in 3 end-stage liver disease and in 1 intractable pruritus. The quantitative cholangiographic metrics analysed and their distribution in the cohort are represented in Table 2.
Table 2Definition and statistical description of selected MRCP+ derived variables.
MRCP + variable
Definition
Median [Q1-Q3]
Biliary tree volume
Total measured biliary tree volume (ml)
10.30 [5.20, 21.95]
Duct number
Total number of modelled ducts (n)
60.00 [31.00, 141.00]
Candidate stricture
A local minimum in modelled duct widths, whose diameter is narrowed with respect to its neighbours (in diameter), by more than 1 mm in absolute terms and more than 30% in relative terms.
Number of strictures
The total number of areas of decreasing width (strictures)
Sum of severity score for strictures. Composite score are computed as the product of the relative severity (expressed as fraction) and the measured length in mm (Fig. 1)
28.07 [12.68, 51.18]
Candidate dilations
A bile duct dilation with at least 1 mm absolute increase and at least 30% relative increase in diameter, compared to its closest (in diameter) neighbouring local minimum.
Number of dilations
The total number of areas of increasing width (candidate dilatations).
The sum of the absolute severity of the dilations (Fig. 1)
21.77 [9.03, 55.85]
Sum of relative dilations’ severity (%)
The sum of relative dilations’ severity
981.21 [379.18, 2206.01]
Sum of dilations’ score
The sum of severity score for dilations. Composite score are computed as the product of the relative severity (expressed as fraction) and the measured length in mm (Fig. 1)
65.69 [30.04, 138.53]
Number of ducts with strictures AND dilations
The number of ducts containing both one or more candidate strictures and one or more candidate dilatations.
5.00 [2.00, 10.00]
Number of ducts with strictures OR dilations
The number of ducts containing one or more candidate abnormalities (strictures or dilatations)
15.00 [6.50, 32.50]
Sum of lengths of abnormal biliary tree tracts
The total length (mm) of detected candidate abnormalities (Sum of strictures’ lengths + Sum of dilations’ lengths
3.2.1 Prognostic performance for hepatobiliary complications
All the MRCP+ metrics had a strong correlation with each other (Spearman coefficient>0.8; Suppl. Fig. 1). Among the MRCP+ metrics, all showed a good prognostic performance at the univariable analysis when tested against all the hepato-biliary complications (Table 3). All MRCP+ metrics and available clinical variables were taken forward for multivariable modeling.
Table 3Cox regression analysis for all hepatobiliary complications in PSC cohort.
Despite the performance of all variables looking at different pathological aspects of biliary tree (e.g. biliary tree volume and number of dilations, Suppl. Table 2 and Suppl. Fig. 2), showing high degree of correlation, to avoid multicollinearity we have selected in the final model only the number of strictures. This variable was selected based on its higher accuracy in the multivariable model respect to the other quantitative biliary features, easier interpretation for clinicians and radiologists, and the robust evidence on its prognostic role [
Evaluation of quantitative MRCP (MRCP+) for risk stratification of primary sclerosing cholangitis: comparison with morphological MRCP, MR elastography, and biochemical risk scores.
In addition, since portal hypertension has largely been associated with outcome in PSC, we explored the prognostic role of spleen length. As this is significantly influenced by height, age and sex we adjusted the model accordingly [
Spleen size is significantly influenced by body height and sex: establishment of normal values for spleen size at us with a cohort of 1200 healthy individuals.
]; however considering the relatively low number of events and the scarce added value in terms of accuracy of the sex, we kept only the height and age at the time of MRCP for the adjustment. Finally, given that MRCP has not been performed at diagnosis for all the patients we adjusted the model with the ‘time from diagnosis to MRCP’.
The final multivariable Cox model, taking correlation structure among covariates and clinical interpretation of their effects into account, included the following variables: number of strictures in biliary tree (p < 0.001), spleen length (p = 0.029), height (p = 0.774), age at MRCP (p = 0.419) and time from diagnosis to MRCP (p = 0.724) (Table 3). Spline functions representing shape of the HR of number of strictures and spleen length are represented in Fig. 2. Scaled Schoenfeld residuals plots to evaluate the plausibility of pH assumption are represented in Supplementary Fig. 3.
Fig. 2Cox model HR by spline functions from the qMRCP-PSC score to explore the shape of the association between the number of strictures (A) and spleen length (B) and the endpoint. Despite the small sample size and the consequent large CI, the assumption of a positive linear association between each of the two covariates and the log-hazard of event is tenable.
3.2.2 Derivation of a quantitative MRCP-derived PSC score
We used the regression coefficients of the selected variables (Table 3) to develop a predictive score of all hepatobiliary complications for each patient according to the following formula:
Quantitative MRCP-derived PSC (qMRCP-PSC) score: Spleen length (cm) * 0.1552 + number of strictures (n) * 0.0509 + age at MRCP (years) * 0.0107 + height (10 cm) * (−0.067) + time from diagnosis to MRCP (months) * 0.0010.
The score demonstrated high discrimination ability with a Harrell's c of 0.80 (Table 3). It ranged from 0.74 to 4.53 with a median of 1.85, Q1-Q3 1.51–2.37, and a mean of 1.92.
To allow for a visual assessment of discriminative power of the qMRCP-PSC score across the study cohort, risk groups were built using as cut-off the one obtained by maximized Youden Index calculated on the time dependent ROC at 24months which was 2.08 (Suppl. Fig. 4). Fig. 3 offers an intuitive depiction of variation in prognosis between patients with different scores according to the derived cut-off (log-rank p < 0.0001).
Fig. 3Kaplan-Meier curves for hepatobiliary-complication-free survival A) in the overall cohort, B) by groups according to qMRCP-PSC score split using maximized Youden Index cut-off derived by time dependent ROC curve (Suppl. Fig. 4) (2.08). Log Rank < 0.0001.
3.3 Cross-validation of qMRCP-PSC score and comparison with Anali score
The qMRCP-PSC score was internally validated with a 3-fold cross-validation approach resulting in a C-statistic of 0.78. This outperformed the Anali score without gadolinium, used as dichotomous variable considering values higher than 2, demonstrating to identify high risk patients, which showed a prognostic performance against hepatobiliary complications (C-statistic=0.64, HR of 3.32 [CI 95% 1.53–7.19], p = 0.002) [
Finally, we explored the prognostic role of the qMRCP-PSC co-variates also toward hard endpoints (i.e., LT, liver-related death, and hepatic decompensation) without adjusting for height, age at MRCP, and time from diagnosis to MRCP due to the lower number of events [13(14.9%)]. The number of strictures and spleen length remain both independently associated with hard outcome development with HR=1.06 ([CI 95 1.02–1.10], p = 0.0009) and HR=1.34 ([CI 95 1.13–1.61], p = 0.004) with good discriminatory power (C-statistic=0.76)(Suppl. Table 3) also after 3-fold cross validation (C-statistic=0.72). When Anali without gadolinium using values higher than two was tested towards hard outcomes, prognostic prediction was increased with a C-statistic=0.76 (HR=11.67 [CI 95% 2.58–52.69], p = 0.001).
4. Discussion
We developed and internally validated a quantitative MRCP-based model for medium-term prediction of hepatobiliary complications in patients with PSC. The scoring system incorporates quantitative and reproducible metrics semi-automatically extracted from MRCP-3D sequences, and spleen length, adjusted for age, height, and time from diagnosis to MRCP.
The qMRCP-PSC score was superior to the Anali without gadolinium in predicting overall hepatobiliary complications with C-statistic of 0.80 and 0.78 in the derived model and after cross-validation, respectively. Several reasons might explain its stronger performance. Radiological variables were treated as continuous, overcoming the limitation of using qualitative or semi-quantitative variables both in terms of reproducibility and intra-observer agreement [
]. The qMRCP-PSC score incorporates not only a quantitative assessment of biliary tree, but quantifies portal hypertension with a continuous variable, i.e. the spleen length, which often enlarges in PSC even before cirrhosis develops, and has previously been shown as a major prognostic parameter [
We do acknowledge that the comparison of prediction accuracy of Anali score without gadolinium with that of qMRCP-PSC score might not be entirely fair since the former has been developed toward hard events and without considering biliary complications; however, to date this is the gold-standard as radiological score. Moreover, our model is comparable to the Anali score even if only LT, liver-related death and hepatic decompensation are considered.
To our knowledge, MRCP+ is the only software available for the quantitative analysis of biliary tree, strictures and dilations, using AI-derived sequences from MRCP-3D. It has been demonstrated to be accurate in distinguishing healthy controls from PSC patients and to have a good-to-excellent interobserver agreement for strictures and dilations metrics [
There are no studies exploring the prognostic role of MRCP+ metrics toward clinical outcomes in PSC. Two studies explored the relation between MRCP+ metrics and estimates of outcomes by prognostic scores (e.g. Mayo Risk Score[MRS] and Amsterdam-Oxford model[AOM]) [
]. Selvaraj et al. identified a good positive correlation between the severity of intrahepatic dilations and surrogate markers of disease severity. Differently with respect to our study, the number and severity of strictures were not found to be positively correlated with disease severity nor to have significant risk stratification ability. Conversely, Ismail et al. showed a strong correlation of the total number of segmental strictures with MRS [
Evaluation of quantitative MRCP (MRCP+) for risk stratification of primary sclerosing cholangitis: comparison with morphological MRCP, MR elastography, and biochemical risk scores.
]. The difference with the former study may rely on the distinct analytical approach. The Oxford group has normalized the biliary metrics by the total length of the biliary tree, to account for variable lengths of biliary tree among different patients. This might have introduced a bias; indeed, the presence of a stricture could lead to a dilation of the upstream bile ducts, resulting in an increase of the whole biliary tree visualized at MRCP. Indeed, we found in our cohort a linear correlation between the sum of bile duct lengths and the number and severity of both strictures and dilations (Suppl. Fig. 5).
The key variables included in the model were the number of strictures and spleen length. The former is a marker of the biliary damage, which shows a strong correlation with the other biliary metrics explored by MRCP+, reflecting that they quantify all the same pathological process. Similarly to our findings, in AOM, which is based on biliary tree aspect visualised during endoscopic cholangiography, the presence of strictures of intrahepatic and extrahepatic ducts were associated with a worst prognosis over time [
The role of magnetic resonance imaging and endoscopic retrograde cholangiography in the evaluation of disease activity and severity in primary sclerosing cholangitis.
]. In our cohort, the spleen length is independently associated with outcome (Table 3) and it remains significant also in multivariable models using other MRCP-derived metrics as co-variates (Suppl. Table 2, Suppl. Fig. 2).
Patients with PSC tend to develop end-stage liver disease within 10–20 years from diagnosis [
]. The long time for development of hard outcomes (i.e. LT or liver-related mortality) hampers drug development. In any clinical trial with a therapy aimed at modifying the clinical course of PSC, intermediate events need to be included [
]. Biliary complications requiring endoscopic treatment and recurrent biliary infections requiring hospitalization compete with liver fibrosis-related complications in determining future liver outcomes. In addition, patients at early stage of disease may present with an insignificant amount of fibrosis respect to the evidence of advanced biliary strictures and dilations at MRCP. Therefore, in this study we chose to explore together with hard endpoints (i) portal hypertensive complications and (ii) biliary complications. By using only MRCP-derived variables, it is possible to stratify patients into high and low risk of development of all hepatobiliary complications in a medium to short-term follow-up (Fig. 3).
Finally, MRCP+ might have a role also in the early detection of CCA, one of the most worrisome complications of PSC. Unfortunately, only 2 patients in our cohort developed CCA (one intra-hepatic and the other peri‑hilar). Both showed a qMRCP-PSC score higher than 2.08 (3.21 and 2.23), however, considering the heterogeneity of the disease and of CCA phenotype and presentation in PSC patients, a more appropriate study, designed with this aim, might be more helpful in evaluating this potential application of the software.
Our study has some limitations. The median follow-up from the MRCP to event/censoring is relatively short with 30.9 months (Table 1). However, considering that biliary complications occur earlier in PSC natural history and tend to re-occur becoming a LT indication, a score estimating medium-term outcomes might be useful both in clinical practice and clinical trials to predict intermediate event. Our study is retrospective and performed in two hospitals using two different MRI scanners and, despite the same acquisition protocols, this might have introduced a bias. However, the MRCP-derived metrics of the two groups when compared did not reveals significant differences (data not shown). Indeed, it might be seen as MRCP+ software strength which could analyze images performed from different MRI scans and, thus, more adaptable to clinical practise. We did not include biochemical parameters to adjust our model since, given the retrospective nature of the study, many of the laboratory investigations were not exactly correspondent to the time of the scanning. However, considering the weakness of liver function tests in prediction in PSC and their fluctuation over time [
Inter- and Intra-individual variation, and limited prognostic utility, of serum alkaline phosphatase in a trial of patients with primary sclerosing cholangitis.
], we anticipate that they would not have a major impact on the discriminative power of the MRCP-based model. Finally, our study does not have an external validation cohort; however, by using a cross-validation approach we are confirm the qMRCP-PSC demonstrates promising out-of-sample generalisability.
In conclusion, the qMRCP PSC model, which includes the number of strictures and the spleen length, can identify PSC patients at higher risk of adverse outcome and outperforms the available radiological score. This represents a novel biomarker for disease monitoring and a potential surrogate endpoint for clinical trials. Further studies, with prospective, standardized data collection and larger patient's cohorts, are needed for confirmation.
Grant support
This research was partially supported by the Italian Ministry of University and Research (MIUR) - Department of Excellence project PREMIA (PREcision MedIcine Approach: bringing biomarker research to clinic) and PRIN 2020 DEMO:Diseases modeling and early therapeutic tools in chronic inflammatory cholangiopathies and primary liver cholangiocarcinoma.
CRediT authorship contribution statement
Laura Cristoferi: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Resources, Visualization, Writing – original draft. Marco Porta: Conceptualization, Data curation, Writing – review & editing. Davide Paolo Bernasconi: Formal analysis, Methodology, Visualization, Supervision, Writing – review & editing. Filippo Leonardi: Resources. Alessio Gerussi: Data curation, Writing – review & editing. Giacomo Mulinacci: Data curation, Writing – review & editing. Andrea Palermo: Data curation, Writing – review & editing. Camilla Gallo: Data curation, Writing – review & editing. Miki Scaravaglio: Data curation, Writing – review & editing. Eliana Stucchi: Data curation, Writing – review & editing. Cesare Maino: Data curation, Writing – review & editing. Davide Ippolito: Data curation, Writing – review & editing. Daphne D'Amato: Data curation, Writing – review & editing. Carlos Ferreira: Software. Alessandra Nardi: Supervision, Formal analysis. Rajarshi Banerjee: Software. Maria Grazia Valsecchi: Supervision, Writing – review & editing. Laura Antolini: Supervision, Writing – review & editing. Rocco Corso: Supervision, Writing – review & editing. Sandro Sironi: Supervision, Writing – review & editing. Stefano Fagiuoli: . Pietro Invernizzi: . Marco Carbone: Conceptualization, Formal analysis, Investigation, Methodology, Visualization, Supervision, Writing – review & editing.
Declaration of Competing Interest
RB is CEO at Perspectum Ltd, CF is employed by Perspectum Ltd. MC is a consultant without fee at Perspetum Ltd. LC, MP, DPB, FL, GM, AP, CG, MS, ES, CM, DI, DD, AN, MGV, LA, RC, SS, SF and PI have nothing to disclose.
Inter- and Intra-individual variation, and limited prognostic utility, of serum alkaline phosphatase in a trial of patients with primary sclerosing cholangitis.
Radiologic course of primary sclerosing cholangitis: assessment by three-dimensional magnetic resonance cholangiography and predictive features of progression.
Evaluation of quantitative MRCP (MRCP+) for risk stratification of primary sclerosing cholangitis: comparison with morphological MRCP, MR elastography, and biochemical risk scores.
Spleen size is significantly influenced by body height and sex: establishment of normal values for spleen size at us with a cohort of 1200 healthy individuals.
The role of magnetic resonance imaging and endoscopic retrograde cholangiography in the evaluation of disease activity and severity in primary sclerosing cholangitis.