Shengping Yang PhD, Gilbert Berdine MD
Corresponding author: Shengping Yang
Contact Information: Shengping.Yang@pbrc.edu
DOI: 10.12746/swjm.v13i55.1445
I am planning a network meta-analysis of four treatment options for asthma. Given the multiple available treatments, I intend to use a network meta-analytic approach and am exploring the use of the Surface Under the Cumulative Ranking Curve (SUCRA) to rank their relative effectiveness. Is SUCRA considered a standard approach in this context?
Network meta-analysis (NMA) is a powerful method for comparing multiple treatments simultaneously, especially when direct head-to-head comparisons are limited. The Surface Under the Cumulative Ranking Curve (SUCRA) is a widely accepted and standard approach for ranking treatments in NMA. It is a metric used to summarize the relative effectiveness or safety of competing interventions.
In evidence-based medicine, treatment comparisons often involve multiple competing interventions. Traditional pairwise meta-analyses are limited because they only compare two treatments at a time, whereas clinical decision-making typically requires evaluating all available options simultaneously.
To address this, NMA was developed. It integrates both direct evidence (from head-to-head trials) and indirect evidence (e.g., trials comparing A vs. B and B vs. C to infer A vs. C) into a unified framework. This allows for the coherent ranking of all treatments, even when some pairs have never been directly compared.1
As NMAs became more common, researchers sought intuitive ways to summarize treatment hierarchies. A metric allows comparison of a property between two objects at the same time or the same object at different times. A metric is a numeric scale such that if two objects have the same value for the metric, the two objects are considered equal in terms of the property being considered. If two objects have different values for the metric, the values can be used to compare, rank, or sort the objects. Simple metrics like “probability of being best” were limited because they ignored the full distribution of rankings (e.g., how often a treatment ranks second or third). The SUCRA method was developed to address this need and was first proposed by Salanti et al.2 It has since become a standard for summarizing treatment rankings in NMAs.
SUCRA is a quantitative metric derived from the cumulative ranking probabilities of treatments in a NMA. Unlike traditional effect-size metrics, such as mean differences or odds ratios, SUCRA captures the entire ranking distribution, providing a broader perspective on a treatment’s performance relative to others, in Bayesian or frequentist NMAs.4
Mathematical definition:
For a treatment i in an NMA comparing K competing treatments:
where Fi(r) = P(Rank ≤ r), is the cumulative probability that treatment i ranks at or above position r.3 Following are the steps for calculating SUCRA.
For each treatment, based on the NMA results (e.g., relative risks or odds ratios), derive the probability that the treatment ranks at each possible position.
For each treatment, compute the cumulative probability Fi(r), i.e., Rank ≤ r. In Bayesian NMAs, these probabilities can be derived from posterior MCMC samples. In frequentist NMAs, they can be estimated via bootstrapping or analytical approximations. For example, if there are 4 treatments (A, B, C, and D), treatment D might have the following cumulative probabilities based on the data: FD(1) = 0.05, FD(2) = 0.25, and FD(3) = 0.5, while FD(4) is always 1.
For the treatment D, SUCRAD is calculated using the formula above. For instance, in the example K = 4:
Note that for any of the treatments, SUCRAi represents the normalized area under the cumulative ranking curve for treatment i. A higher SUCRAi value indicates a better overall ranking of treatment i relative to competing treatments. A SUCRAi of 100% means treatment i is almost always ranked the best, while a SUCRAi of 0% means it is almost always ranked the worst. Meanwhile, a SUCRAi of, for example, 0.8 (80%) means the treatment has an 80% probability of being among the top options (when compared to all other treatments).
Figure 1 provides a graphical illustration of the example above. Since Fi(4) is always equal to 1 for any treatment, it is excluded from the SUCRA calculation and is also not shown in the figure. The SUCRA values for treatments A, B, C, and D are 0.83, 0.52, 0.38, and 0.27, respectively. Based on these values, treatment A is considered the most favorable option in this analysis.
Figure 1. Example SUCRA curves for four treatments. The shaded areas under each cumulative ranking curve correspond to the SUCRA values for the respective treatments.
SUCRA is a key metric derived from NMA, designed to simplify the interpretation of complex comparative effectiveness data. By converting multidimensional ranking distributions into a single numerical value (ranging from 0 to 1), SUCRA allows users to efficiently prioritize treatment options. Its primary applications include.
SUCRA helps translate NMA results into actionable recommendations by quantifying the likelihood that a treatment outperforms its competitors. For example, a guideline panel may favor Treatment A (SUCRA = 0.90) over Treatment B (SUCRA = 0.60) when efficacy is the primary concern.
In HTA, SUCRA complements traditional cost-effectiveness analyses (e.g., cost-per-QALY) by integrating multiple dimensions:
SUCRA is particularly valuable in public health settings, such as during a pandemic, where direct head-to-head comparisons are limited. It helps identify and prioritize promising interventions (e.g., vaccines or preventive measures) using both direct and indirect evidence.
SUCRA also aids formulary committees in selecting therapies for inclusion by emphasizing relative effectiveness across treatments, rather than absolute outcomes alone.
SUCRA values are commonly presented alongside visual tools to enhance interpretability:5 For example, rankograms illustrate the probability distribution of each treatment’s rank (e.g., “Treatment A has a 70% probability of being ranked 1st”); league tables show pairwise comparisons (e.g., odds ratios) with SUCRA rankings for context; and forest plots may include SUCRA values with confidence intervals, presenting both rankings and associated uncertainty.
While the utility of SUCRA is well recognized, several important considerations, both strengths and limitations, should guide its application and interpretation.
SUCRA offers several notable advantages:
Despite its strengths, SUCRA also has several limitations that warrant caution:
SUCRA is a valuable tool in the analysis and interpretation of network meta-analyses. By summarizing a treatment’s ranking probabilities into a single, easy-to-interpret value, SUCRA supports more informed decision-making in contexts where multiple interventions are compared.
However, as with any statistical summary, SUCRA is not without limitations. It should be used in conjunction with other outputs – such as effect sizes, credible intervals, GRADE assessments, and clinical considerations – to ensure comprehensive and nuanced interpretation. When applied appropriately, SUCRA can be a powerful aid in the synthesis of complex evidence landscapes.
Article citation: Yang S, Berdine G. The surface under the cumulative ranking curve. The Southwest Journal of Medicine 2025;13(55):35–38
From: Department of Biostatistics (SY), Pennington Biomedical Research Center, Baton Rouge, LA; Department of Internal Medicine (GB), Texas Tech University Health Sciences Center, Lubbock, Texas
Submitted: 4/14/2025
Accepted: 4/16/2025
Conflicts of interest: none
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.