U.S. flag

An official website of the United States government

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Guidelines for the Prevention, Care and Treatment of Persons with Chronic Hepatitis B Infection. Geneva: World Health Organization; 2015 Mar.

Cover of Guidelines for the Prevention, Care and Treatment of Persons with Chronic Hepatitis B Infection

Guidelines for the Prevention, Care and Treatment of Persons with Chronic Hepatitis B Infection.

Show details

4RECOMMENDATIONS: NON-INVASIVE ASSESSMENT OF LIVER DISEASE STAGE AT BASELINE AND DURING FOLLOW UP

Recommendations

APRI (aspartate aminotransferase [AST]-to-platelet ratio index) is recommended as the preferred non-invasive test (NIT) to assess for the presence of cirrhosis (APRI score >2 in adults) in resource-limited settings. Transient elastography (e.g. FibroScan) or FibroTest may be the preferred NITs in settings where they are available and cost is not a major constraint. (Conditional recommendation, low quality of evidence)

a

This recommendation was formulated assuming that liver biopsy is not a feasible option.

4.1. Background

The spectrum of liver disease in persons with CHB ranges from minimal fibrosis to cirrhosis and HCC. Compensated cirrhosis may progress over time to decompensated cirrhosis, which is associated with the potentially life-threatening complications of ascites and spontaneous bacterial peritonitis, oesophageal varices and bleeding, hepatic encephalopathy, sepsis and renal failure. Persons with cirrhosis, including those with clinical decompensation, need antiviral therapy as a priority in order to prevent further disease progression. While the diagnosis of decompensated cirrhosis is based on clinically obvious features, this is not always the case for compensated cirrhosis. Identifying persons with cirrhosis or advanced CHB in need of treatment is generally based on a combined assessment of clinical features (including hepatomegaly and splenomegaly), the level and ratio of aminotransferases, and other relevant tests, such as albumin and platelet counts, HBV DNA viral load, the degree of fibrosis and/or necroinflammation on liver biopsy or NITs, and liver imaging.

Liver biopsy: Liver biopsy is considered the gold standard method to stage liver disease and assess for the degree of fibrosis, but it is not widely used in resource-limited settings because of its high cost, invasiveness, patient discomfort, risk of complications, sampling error, as well as the need for expert histological interpretation. Several liver biopsy scoring systems have been developed, of which the METAVIR system (Table 4.1), Knodell and Ishak scores (1) are the most widely used.

TABLE 4.1. METAVIR liver-biopsy scoring system.

TABLE 4.1

METAVIR liver-biopsy scoring system.

Non-invasive tests (NITs): Several non-invasive fibrosis tests based on blood or serum indices (APRI, FIB-4 and a commercial assay – FibroTest,) or ultrasound principles (transient elastography [e.g. FibroScan]) (Table 4.2) are now available and increasingly used for evaluating and staging liver fibrosis, which reduces the need for liver biopsy in persons with an established cause of liver disease. The use of accurate and validated NITs in resource-limited settings could help with the optimal selection of persons with CHB for antiviral therapy.

TABLE 4.2. Selected non-invasive tests to assess for stage of liver fibrosis.

TABLE 4.2

Selected non-invasive tests to assess for stage of liver fibrosis.

Blood tests such as the APRI and FIB-4 scores consist of indirect markers of fibrosis such as ALT, AST and platelet count (Figure 4.1), which are more readily available in LMICs, are associated with lower costs, do not require particular expertise in their interpretation, and can be performed in an outpatient setting. Other serum tests such as FibroTest are patented and must be performed in laboratories that meet certain quality standards, and are therefore more expensive and less readily available. Not all of these tests can assess all stages of fibrosis/cirrhosis. For example, APRI has been validated for the diagnosis of both significant fibrosis and cirrhosis, while FIB-4 has not been validated for the diagnosis of cirrhosis. These markers of fibrosis have a high specificity but low sensitivity for significant fibrosis and cirrhosis at their specific cut-off ranges and, therefore, many persons with advanced fibrosis and cirrhosis are missed.

FIGURE 4.1. APRI and FIB-4 calculations.

FIGURE 4.1

APRI and FIB-4 calculations. For APRI, ULN signifies the upper limit of normal for AST in the laboratory where these investigations were undertaken. For example, in a patient with an AST of 82 IU/L (where laboratory ULN for AST is 40 IU/L) and a platelet (more...)

More recently, new techniques that measure liver stiffness have been developed based on ultrasound technology. Of such tests, transient elastography performed with FibroScan (Echosens, Paris) has been the most widely evaluated (Figure 4.2). It is non-invasive, takes less than 10 minutes to perform, can be undertaken in outpatient or community settings, and health-care staff can be easily trained in its use. Factors that limit the use of transient elastography include the high cost of the equipment, the need for preventive and corrective maintenance (regular service/recalibration) and trained operators, and the lack of extensively validated cut-off values for specific stages of fibrosis. Other elastography techniques include 2-D acoustic radiation force impulse imaging (ARFI) and shear-wave elastography. ARFI and shear-wave elastography are similar in principle to transient elastography, and have been incorporated into new ultrasound imaging machines. However, they require more operator training and expertise than FibroScan.

FIGURE 4.2. Operation of transient elastography (FibroScan).

FIGURE 4.2

Operation of transient elastography (FibroScan). Source: http://www.myliverexam.com/en/lexamen-fibroscan.html

4.2. Summary of the evidence

Question: The purpose of the evidence review (see Web appendix 2: SR4) was to compare the diagnostic accuracy and performance of different NITs (APRI, FIB-4, FibroTest and transient elastography [e.g. FibroScan]) in diagnosing cirrhosis and significant liver fibrosis in persons with CHB compared to liver biopsy as the reference standard. Outcomes were the sensitivity, specificity, and positive and negative predictive values of NITs, using defined index test cut-off points for the detection of cirrhosis (stage F4) and significant fibrosis (stage ≥F2) based on the METAVIR staging system. As the presence of cirrhosis was considered a priority criterion for initiation of antiviral therapy, the primary outcome assessment in the review was for diagnosis of cirrhosis (F4).

NIT cut-off values for the detection of cirrhosis and significant fibrosis

The optimal cut-off values for different NITs that correlate with specific stages of liver fibrosis have been derived and (in the case of APRI and FIB-4) also validated. APRI and FIB-4 use two cut-off points for diagnosing specific fibrosis stages, as the use of a single cut-off would result in suboptimal sensitivity and specificity. A high cut-off with high specificity (i.e. fewer false-positive results) is used to diagnose persons with fibrosis (i.e. greater than or equal to a particular stage [e.g. ≥F2]), and a low cut-off with high sensitivity (i.e. fewer false-negative results) to rule out the presence of a particular stage of fibrosis. Some persons will fall in the indeterminate range of test results (i.e. their score will be between the low and the high cut-off) and will need future re-testing and evaluation. Transient elastography (FibroScan) has a range of values between 0 and 75 kPa, and although there are no uniformly established and validated cut-offs for specific stages of fibrosis, it uses a single cut-off. Table 4.3 shows the established high and low cut-off values of APRI, FIB-4, FibroTest, and a range of the most commonly reported cut-offs for transient elastography (FibroScan) for diagnosing cirrhosis (F4) and significant fibrosis (≥F2).

TABLE 4.3. Cut-off values of non-invasive tests for the detection of significant fibrosis and cirrhosis.

TABLE 4.3

Cut-off values of non-invasive tests for the detection of significant fibrosis and cirrhosis.

Separate meta-analyses were performed to evaluate the diagnostic performance of low and high cut-offs of different NITs (APRI, FIB-4, FibroTest and FibroScan) and for each METAVIR stage (F2–F4). There were data from 79 studies (280), which included 38 studies from South-East Asia, two from sub-Saharan Africa and the remainder from various countries and geographical regions (see Web appendix 2: SR4). There were two studies in HBV/HIV-coinfected persons (44,80), one in children (61) but none in adolescents or pregnant women. Overall, the quality of evidence was rated as low, because of bias due to the absence of predetermined index test cut-offs, and selection bias in study populations.

Diagnostic accuracy and performance of NITs

Table 4.4 presents the summary sensitivity, specificity, and positive and negative predictive values for the detection of cirrhosis (F4 stage) and significant fibrosis (≥F2 stage) for APRI, FibroTest and transient elastography (FibroScan). Additional data on all NITs, including FIB-4 (not used for diagnosing F4) and FibroTest are available in Web appendix 2: SR4. For the diagnosis of cirrhosis (F4), FibroScan had similar sensitivity (86%) to FibroTest (88%), but significantly better sensitivity than the APRI low or high cut-offs (65% and 35%, respectively). FibroScan had similar specificity (87%) to the APRI high cut-off (89%), but significantly better specificity than the FibroTest (73%).

TABLE 4.4. Summary of sensitivity, specificity, and positive and negative predictive values of APRI, FibroTest and transient elastography (FibroScan) for the detection of cirrhosis (F4) and significant fibrosis (≥F2).

TABLE 4.4

Summary of sensitivity, specificity, and positive and negative predictive values of APRI, FibroTest and transient elastography (FibroScan) for the detection of cirrhosis (F4) and significant fibrosis (≥F2).

The positive and negative predictive value, number of true-positive, false-positive, true-negative and false-negative results of NITs for the diagnosis of cirrhosis (F4) were also assessed (Tables 4.4 and 4.5). For this analysis, APRI and FibroScan only were selected, as FIB-4 is not used for diagnosing cirrhosis, and FibroTest is less accurate than FibroScan for diagnosing cirrhosis. The prevalence of cirrhosis and fibrosis in the population under evaluation is a major determinant of the predictive value of these tests in practice. The median prevalence (interquartile range) of fibrosis stages F2–F4 in included studies was: for F4 17% (12–25%) and ≥F2 49% (34–62%), but this was based on a highly selected population who had liver biopsy because of various clinical and laboratory indications. The true prevalence in a clinic setting or at a community level will be lower. Table 4.5 presents the number of true- and false-positive and true- and false-negative results, using APRI (low, high or combined cut-offs) and FibroScan for the detection of cirrhosis (F4) in 1000 persons, assuming a prevalence of 10%.

TABLE 4.5. Number of true- and false- positive and -negative results, and indeterminate results using APRI (low, high or combined cut-offs) and transient elastography (FibroScan) for the detection of cirrhosis (F4) in 1000 persons, assuming a prevalence of 10%.

TABLE 4.5

Number of true- and false- positive and -negative results, and indeterminate results using APRI (low, high or combined cut-offs) and transient elastography (FibroScan) for the detection of cirrhosis (F4) in 1000 persons, assuming a prevalence of 10%. (more...)

The positive predictive value (PPV) was low (less than 50%) for all NITs, but FibroScan had a higher PPV (42%) than APRI using either a high or low cut-off (26% and 22%) (Table 4.4). Although using a low APRI cut-off has a much higher sensitivity than the high cut-off, it results in many more false-positive results compared to the high cut-off (225 versus 99 in 1000 persons tested) (Table 4.5). Overall, there would be no significant difference in the number of false-positive and false-negative results between persons tested with FibroScan and those tested using the combined cut-offs of APRI.

Other fibrosis stages

For the diagnosis of fibrosis stages ≥F2, the summary sensitivities of APRI (low cut-off), FibroTest and transient elastography (FibroScan) were 78%, 68% and 76%, respectively, while the summary specificities of APRI (high cut-off), FibroTest and FibroScan were 92%, 92% and 82%, respectively. There were no significant differences between the accuracy of FibroScan and FibroTest in the diagnosis of stages ≥F2 and ≥F3. For the diagnosis of stages ≥F2, the APRI low cut-off had a similar sensitivity and APRI high cut-off had a significantly better specificity than FibroScan.

Overall, there were also no differences in the diagnostic accuracy of the evaluated NITs in relation to ethnicity (South-East Asia versus other ethnicities), but only one study was conducted in sub-Saharan Africa and none in Latin America.

4.3. Rationale for the recommendations

Balance of benefits and harms

The Guidelines Development Group recommended the use of NITs to assist in the assessment of stage of liver disease and diagnosis of cirrhosis, to help prioritize those at greatest risk of morbidity and mortality for antiviral therapy. This avoids the use of liver biopsy, which is an expensive and invasive procedure associated with patient discomfort, carries a small risk of serious bleeding and requires specialist histological interpretation for accurate staging. Based on evidence from the systematic review, the Guidelines Development Group considered that transient elastography (FibroScan) (where resources permit) and APRI were the most useful tests for the assessment of cirrhosis in LMICs. However, the recommendation was conditional because the PPV for detection of cirrhosis was low for all NITs, and in particular for APRI (detecting only one third of persons with cirrhosis), and there has been very limited evaluation of their use in sub-Saharan Africa. FIB-4 was not considered or recommended because it has been developed and validated for the detection of fibrosis stages ≥F3 and not cirrhosis. FibroTest is a commercial assay and less accurate than transient elastography (FibroScan) for diagnosing cirrhosis. Standard ultrasound was also not considered as it only detects advanced cirrhosis, and therefore its use would result in an unacceptably high number of false-negative results.

Potential harms from the use of NITs include treatment decisions based on either false-positive or false-negative APRI test results. A false-positive test result may lead to a patient being treated unnecessarily or prematurely, which would expose them to the inconvenience of long-term treatment, potential drug resistance as well as a small risk of drug toxicities. Conversely, a false-negative result means that a person with cirrhosis would not be identified by NITs, and may therefore not receive prompt antiviral therapy, which might prevent progression to decompensation or decrease the risk of developing HCC.

APRI is based on two indirect markers of fibrosis (AST and platelet count), which are readily available in resource-limited settings. An approach that combined a high and a low cut-off value of APRI would be optimal (a high cut-off with high specificity [i.e. fewer false-positive results] and a low cut-off with high sensitivity [i.e. fewer false-negative results]). However, the Guidelines Development Group recommended the use of a single high cut-off >2 for identifying adults with cirrhosis (F4) and in need of antiviral therapy, and those ≤2 without cirrhosis for several reasons.

  1. Although in adults an APRI score of >2 would detect only one third of persons with cirrhosis, this high cut-off of >2 was used, because the low cut-off would result in an unacceptibly high number of false-positive test results (approximately one quarter of those tested).
  2. It is also likely that adults with cirrhosis not detected using an APRI score >2 would be identified as being in need of antiviral therapy because of other eligibility criteria (such as persistently abnormal ALT levelsa as well as evidence of ongoing HBV replication (HBV DNA >20 000 IU/mL) (see also Chapter 5: Who to treat and not to treat).
  3. It is also simpler and more feasible to use a single cut-off in resource-limited settings.

Clinical evidence of cirrhosis or an APRI score >2 are recommended in these guidelines as key criteria for prioritizing initiation of antiviral therapy among adults in resource-limited settings. Conversely, treatment can be deferred in those without clinical features of cirrhosis (or based on APRI score ≤2), who also have persistently normal ALT concentrations and low levels of HBV replication (HBV DNA <2000 IU/mL), and who can be re-evaluated at subsequent visits. For those with an APRI score ≤2, a proportion will fulfil other criteria for treatment such as persistently abnormal ALT or raised HBV DNA levels. Adults with indeterminate APRI scores (i.e. between 1 and 2 based on the combined APRI cut-off) in particular would need retesting and evaluation every one or two years.

Caveats in the use of NITs: Overall, the Guidelines Development Group considered that the benefits of using NITs outweighed these potential harms. The benefits included the potential increase in treatment availability resulting from access to non-invasive monitoring, and reduced risk of adverse events from liver biopsy.

However, a number of very important caveats were noted in the use of NITs. Overall, the PPV of all NITs for the diagnosis of cirrhosis was low, especially for APRI, and many cases of cirrhosis will be missed using NITs alone. It is therefore important that NITs are used alongside clinical criteria and other laboratory criteria (ALT and HBV DNA levels) to identify those in need of treatment. Second, the results of NITs may be impacted by intercurrent diseases that may falsely increase or decrease the scores. For example, heavy alcohol intake (due to AST elevation from alcoholic hepatitis), and conditions such as malaria or HIV (due to a decrease in platelet count), or use of drugs and traditional herbal medicines may also cause falsely high APRI scores. Hepatitis flares or acute hepatitis, congestive heart failure or a recent meal may also cause a high liver stiffness measurement on elastography (81). The impact of different comorbidities on the diagnostic accuracy of the APRI score has not been fully evaluated and, in particular, there has been no evaluation of NITs, particularly APRI in people from sub-Saharan Africa or among children.

Limitations with transient elastography (FibroScan) include the following: it uses a single cut-off and therefore reported sensitivities and specificities of FibroScan may be overestimated across fibrosis stages; there are no uniformly established and validated cut-offs for specific stages of fibrosis; accuracy is diminished in obese persons, in the presence of moderate/severe necroinflammation, right-sided heart failure, and food intake. Examination is not feasible in the presence of ascites and is contraindicated in pregnant women. Data on its use in children are limited, and special probes are required.

Although the data on HBV/HIV coinfection were limited, the performance of NITs in such persons is unlikely to be significantly different from that in HBV-monoinfected persons.

Values and preferences

From a patient’s perspective, the Guidelines Development Group felt that the APRI test would be acceptable, as it requires only phlebotomy, is routinely available and can be undertaken by untrained staff. Similarly, transient elastography (FibroScan) is non-invasive, takes less than 10 minutes to perform, can be undertaken in outpatient or community settings, and health-care staff can be easily trained in its use. Factors that limit the use of transient elastography include the high cost of the equipment, the need for preventive and corrective maintenance, regular service/recalibration, trained operators, and the lack of extensively validated cut-off values for specific stages of fibrosis.

Resource use

The lower cost of the blood-based NITs compared to transient elastography was a key factor in the recommendation for the use of APRI as the preferred NIT. The blood tests that are needed to calculate APRI score are routinely available at most health-care facilities, even in LMICs, and are inexpensive (less than a few dollars each). The results of APRI using a high cut-off of >2 in adults to diagnose cirrhosis are also relatively straightforward to interpret. Cost considerations were a concern with the FibroTest. It is a patented test that is costly (approximately US$ 73/test) and requires a certified laboratory or the processing of specimens at a centralized laboratory in France.

In contrast to APRI, the cost of acquiring, running and maintaining (requires regular service/recalibration) a transient elastography machine such as the FibroScan is high; the machine costs US$ 50 000 (or US$ 34 000 for the portable machine), and yearly maintenance is US$ 8500/year. However, consumable costs are minimal for FibroScan, and the cost per test could be less than US$ 10 in some settings. FibroScan also requires a trained operator, and the interpretation of the results needs an understanding of the indications and limitations of the method, especially given the lack of well-validated cut-off values for specific stages of fibrosis. However, the training process is relatively straightforward and the inter- and intra-observer variability of the test is low (81). FibroScan in children requires a specially designed probe and a different specific probe for those with a body mass index (BMI) >30 kg/m2. For these reasons, the use of transient elastography and FibroTest was considered to be less feasible in most LMICs.

Research gaps

  • Conduct comparative assessments of NITs for use in high-prevalence resource-limited settings, i.e. APRI, FIB-4, transient elastography, as well as other elastography techniques (e.g. ARFI) to identify persons with cirrhosis and advanced fibrosis (requiring treatment) as well as those with minimal disease (not requiring treatment).
  • Evaluate the performance of NITs, especially in populations from sub-Saharan Africa and Latin America, and in other underresearched populations, including persons with HBV/HIV coinfection, HBV/HDV coinfection, pregnant women, children and adolescents, and those with non-alcoholic fatty liver disease. Conduct studies on the cost–effectiveness of NITs in the context of LMICs.
  • Evaluate the impact of hepatitis flares and other factors on the diagnostic accuracy and performance of the APRI score.
  • Establish and validate FIB-4 cut-offs for the diagnosis of cirrhosis and advanced fibrosis.

Footnotes

a

ALT levels fluctuate in persons with chronic hepatitis B and require longitudinal monitoring to determine the trend. Upper limits for normal ALT have been defined as below 30 U/L for men and 19 U/L for women, although local laboratory normal ranges should be applied. Persistently abnormal or normal may be defined as three ALT determinations above or below the upper limit of normal, made at unspecified intervals during a 6–12-month period or predefined intervals during a 12-month period.

Copyright © World Health Organization 2015.

All rights reserved. Publications of the World Health Organization are available on the WHO website (www.who.int) or can be purchased from WHO Press, World Health Organization, 20 Avenue Appia, 1211 Geneva 27, Switzerland (tel.: +41 22 791 3264; fax: +41 22 791 4857; e-mail: tni.ohw@sredrokoob).

Requests for permission to reproduce or translate WHO publications – whether for sale or for non-commercial distribution – should be addressed to WHO Press through the WHO website (www.who.int/about/licensing/copyright_form/en/index.html).

Bookshelf ID: NBK305554

Views

  • PubReader
  • Print View
  • Cite this Page
  • PDF version of this title (3.0M)

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...