U.S. flag

An official website of the United States government

NCBI Bookshelf. A service of the National Library of Medicine, National Institutes of Health.

Simpkin AJ, Rooshenas L, Wade J, et al. Development, validation and evaluation of an instrument for active monitoring of men with clinically localised prostate cancer: systematic review, cohort studies and qualitative study. Southampton (UK): NIHR Journals Library; 2015 Jul. (Health Services and Delivery Research, No. 3.30.)

Cover of Development, validation and evaluation of an instrument for active monitoring of men with clinically localised prostate cancer: systematic review, cohort studies and qualitative study

Development, validation and evaluation of an instrument for active monitoring of men with clinically localised prostate cancer: systematic review, cohort studies and qualitative study.

Show details

Chapter 5Comparison of prostate-specific antigen markers used to alert men to worsening prostate cancer

Aims and background

This part of the project involved:

  • investigating the prediction of PCa outcomes
  • validating the ability of a PSARR to predict PCa outcomes
  • testing the PSARR method against the commonly used methods of PSADT and PSAv.

Chapter 4 above has shown that the model of age-related PSA change offers reasonably accurate predictions of future PSA levels in modern monitoring cohorts. In this chapter, we compare PSARRs with PSADT and PSAv in retrospective analyses of three historical cohorts where several measures of PCa progression are available.

Methods

Prostate-specific antigen markers

Prostate-specific antigen doubling time is the most popular PSA kinetic measure used to identify those patients on AM who need clinical review. PSADT is calculated from a model of log(PSA) against time:

log(PSA)=α+β×time.
(6)

The doubling time is then calculated as log(2)/β and is interpreted as the estimated time for a man’s PSA to double. Thus, low values of doubling time coincide with rapidly rising PSA and are a cause for concern. Conversely, high values correspond to a steady trend of PSA, and those patients above a threshold of PSADT are advised to remain on monitoring. For a patient’s doubling time to be calculated using the regression method, at least three PSA test results are required. We consider a PSADT of < 3 years to be a marker for rapidly rising PSA.79,84,86,89

An alternative to PSADT is PSAv. This explicitly measures the rate of change in PSA over time calculated as the slope term (β) from a model of PSA against time on monitoring:

PSA=α+β×time.
(7)

Given that PSAv is the rate of increase of PSA, low positive values refer to steady PSA change and high PSAv equates to sharply increasing PSA. Several studies have used a threshold of PSAv > 1 ng/ml/year22,71,100 as a trigger for clinical review during monitoring. For this paper, we have calculated each man’s PSAv at each visit beyond their third (calculation of PSAv requires at least three observations of PSA). We have used PSAv > 1 ng/ml/year as a marker for rapidly rising PSA.

Prostate-specific antigen reference ranges have been established as alternatives to these two PSA kinetic measures.33 A model for such reference ranges has been previously developed93 on a cohort of 512 men enrolled in the AM arm of the ProtecT trial.15 For example, a 95% reference range, above which the fastest increasing 5% of men with clinically localised PCa lie, is constructed as:

PSA^+1.68var(PSA^).
(8)

where PSA^ and var(PSA^) are found using the ProtecT trial model described in Chapter 3. If subsequent PSA test results fall above this reference range, this implies that the patient’s PSA growth is above what would be expected as a result of normal ageing, such that a recommendation for clinical review could be warranted (e.g. Figure 8). Using the reference range model, we calculate whether or not a man’s PSA result is inside the reference range at each visit; if it falls outside the reference range, this suggests an increasing PSA beyond expected normal changes and that further clinical review may be necessary.

FIGURE 8. An example of a PSARR.

FIGURE 8

An example of a PSARR. The PSA result represented by a triangle would lead to a recommendation for clinical review.

Study populations

The PSA markers were tested on the RMH AS cohort and the SPCG4 and UCHC WW cohorts described in Chapter 4, Methods, Study populations (see also Table 24). The data were again restricted to remove any single PSA test result > 50 ng/ml and any man with a PSA at diagnosis of > 20 ng/ml. Hence, the sample sizes differ (Table 12 and see Table 25).

TABLE 12

TABLE 12

Summary of PCa outcomes used for validation in each cohort, with the percentage positive for each outcome during cohort follow-up

Outcomes

All-cause mortality data from UCHC are not used in this analysis to validate reference ranges, because they probably contain many non-PCa-related deaths. Although the data from RMH include PCSM, no such deaths are reported. To exploit these cohorts fully, we introduce outcomes pertaining to a PSA milestone, namely whether or not a man reaches a PSA of 20 ng/ml during follow-up. The outcomes used for validation, along with the proportion of men experiencing these outcomes, are given in Table 12. Given that it is not clinically useful to have an alert (through PSADT, PSAv or PSARRs) close to, say, PCSM, we have only included those alerts occurring at least 5 years prior to PCSM or metastases and 2 years prior to reaching a PSA of 20 ng/ml.

Statistical analysis

Statistical analyses were performed in Stata 12.70 We report results solely for men with Gleason score of 3 + 3, because this low-risk group represents the vast majority of modern AM cohorts.17 We first compared 90%, 95% and 99% PSARRs to choose the optimal cut-off for rapidly rising PSA using the area under the receiver operator characteristic (ROC) curve statistic for onset of metastases and PCSM in SPCG4. Cox proportional hazards (Cox PH) models105 were used to estimate the association between markers and time-to-event outcomes. Time zero was set to be the date of beginning of monitoring, whereas date of an event or end of monitoring was the time at risk. For each cohort we fit a base model controlling for T-stage (where applicable), baseline PSA and baseline age. We report the HRs for each PSA marker and outcome, together with their 95% CIs. Finally, we compare PSADT, PSAv and PSARR using the difference of deviances from the Cox models using each of these in turn. A better model will have a lower deviance (see Table 16).

TABLE 16

TABLE 16

Differences in deviance (predictive ability) between the three PSA marker models

The ability of each PSA marker to discriminate between higher- and lower-risk men is calculated using Harrell’s concordance statistic (or c-statistic) from each Cox PH model.105,106 This is equivalent to an area under the ROC curve for time-to-event responses, such as those of interest here. The c-statistic estimates the probability that, of two randomly chosen men, the man with the higher-predicted probability of an outcome will reach an outcome before the patient with the lower-risk score. By adding each PSA marker in turn to the baseline models, we can compare the change caused by the marker, by looking at the difference in c-statistic from base model to marker model.

Results

For onset of metastases, using 95%, 97.5% and 99.5% reference ranges led to an area under the curve (AUC) of 0.58, 0.59 and 0.55, respectively. For the PCSM outcome, using 95%, 97.5% and 99.5% reference ranges resulted in AUCs of 0.59, 0.56 and 0.54 respectively. Given these results, we chose to proceed using the 95% reference ranges.

Metastases

Using an alert based on PSARR alone to recommend clinical review would miss 15 out of 30 future metastatic cancers (sensitivity 50%, 95% CI 31% to 69%) while causing 38 unnecessary rebiopsies to be performed in the 107 men with non-life-threatening disease (specificity 65%,95% CI 55% to 74%). The PSADT < 3-year marker would lead to 13 missed metastatic cancers out of 30 (sensitivity 57%, 95% CI 37% to 75%; Table 13) and would unnecessarily biopsy 42 of the 107 men with disease that would not progress to metastases during the follow-up of the cohort (specificity 61%, 95% CI 51% to 70%). Finally, the PSAv > 1 ng/ml/year marker alone would miss just eight of the 30 future metastatic cancers (sensitivity 73%, 95% CI 54% to 88%) but would cause 70 unnecessary biopsies (specificity 35%, 95% CI 26% to 45%).

TABLE 13

TABLE 13

Sensitivity and specificity comparing the PSA markers in alerting men with onset of metastases

Initial age, PSA and T-stage were not useful to identify those men developing metastases (c-statistic = 0.58, see Table 15). There was weak evidence of positive associations between the PSADT marker (HR 2.1, 95% CI 0.97 to 4.4) and PSARR (HR 2.4, 95% CI 0.9 to 6.7) and development of metastases (see Table 15), and no evidence that men alerted by the PSAv > 1 ng/ml/year marker were more likely to develop metastases in the future (HR 1.1, 95% CI 0.5 to 2.7; see Table 15). However, the PSA markers slightly improved the discriminative ability of the baseline model, with increases in c-statistic of 9% (PSADT), 7% (PSARRs) and 2% (PSAv). Comparing the PSA marker models, there was some weak evidence of a better predictive ability using doubling time over velocity (p = 0.06; see Table 16).

TABLE 15

TABLE 15

Hazard ratio and c-statistic improvements for the three PSA markers on each of the four outcomes

Prostate cancer-specific mortality

The PSARR marker led to 56% sensitivity (95% CI 31% to 79%; Table 14) and 61% specificity (95% CI 52% to 70%) when used alone to determine which men would die from PCa. This would mean that 8 out of 18 men with lethal cancer would be missed by the PSARR, and 48 unnecessary biopsies would be performed. Using the PSADT marker alone would miss 6 out of 18 lethal cancers (sensitivity 67%, 95% CI 41% to 87%) and cause 52 unnecessary rebiopsies to be performed in the 124 men without lethal cancer (specificity 58%, 95% CI 49% to 67%). If the PSAv marker were used as a standalone recommendation for clinical review, just three of the 18 lethal cancers would be missed (sensitivity 83%, 95% CI 59% to 96%), but 83 unnecessary biopsies would be performed in the 124 men with non-life-threatening disease (specificity 33%, 95% CI 25 to 42%).

TABLE 14

TABLE 14

Sensitivity and specificity comparing the PSA markers in alerting men with PCa specific mortality

A model using baseline characteristics of PSA, T-stage and age was not useful to discriminate between those who died as a result of PCa and those who did not (c-statistic = 0.53; Table 15). Those men with a PSADT < 3 years were more likely to die from PCa (HR 3.2, 95% CI 1.1 to 9.2) and this improved the c-statistic from the baseline model by 0.13 (25%). There was some weak evidence for an association between PSARR (HR 3.3, 95% CI 0.9 to 12.1) and little evidence for an association between PSAv (HR 2.1, 95% CI 0.6 to 7.7) and death from PCa. There was some weak evidence that the PSADT model was a better predictor of outcome than the PSAv (p = 0.05; Table 16) model.

Reaching a prostate-specific antigen of 20 ng/ml during surveillance

In the RMH cohort, baseline age, PSA and T-stage had a moderate discriminative ability in separating men who would reach a PSA of 20 ng/ml (c-statistic = 0.74; see Table 15). There was no evidence that those men with a PSA outside their reference ranges were more likely to reach a PSA of 20 ng/ml (HR 1.5, 95% CI 0.5 to 4.1), and including PSARR only trivially improved the c-statistic. There was some evidence that men with a PSADT < 3 years were more likely to reach a PSA of 20 ng/ml (HR 2.2, 95% CI 1.0 to 4.6); including PSADT improved the c-statistic by 0.03 (5%). Those who would have been alerted by a PSAv > 1 ng/ml/year were more likely to reach a PSA of 20 ng/ml (HR 10.8, 95% CI 1.5 to 80.8). The PSAv marker led to an increase of 0.05 (7%) in the c-statistic from the baseline model. Comparing these models, the PSAv marker model was preferred to those including PSARR (p = 0.001; see Table 16) and PSADT (p = 0.006), with some weak evidence that the PSADT model improved on the PSARR model (p = 0.06).

For the UCHC data, baseline age and PSA were highly discriminative of reaching PSA 20 ng/ml in at least 2 years, with a c-statistic for the baseline model of 0.82 (see Table 15). Men with a PSA result over their PSARR were more likely to reach 20 ng/ml (HR 12, 95% CI 5.4 to 26.9; see Table 15). Given that there are very few men in the Gleason score of 3 + 3 category who reach a PSA of 20 ng/ml, the CI is very imprecise. Including PSARR led to an increase in c-statistic of 0.03 (4%) over the baseline model. Associations were found between both PSADT (HR 6.0, 95% CI 2.8 to 12.8) and PSAv (HR 4.3, 95% CI 2.5 to 7.5) and reaching a PSA of 20 ng/ml in the UCHC cohort. The inclusion of each of these markers led to increases of 0.02 over the baseline model, or 3%. There was strong evidence that the PSARR model had a higher predictive ability than both the PSAv and PSADT models (both p < 0.0001; see Table 16).

Discussion

Comparing the ability of the three PSA markers to alert men to progression of their disease, the findings here are mixed. There is some weak evidence that PSADT offers better prediction of metastases and PCSM compared with PSAv, but there was no evidence that PSADT is to be preferred to PSARRs. For classifying patients who may reach a PSA milestone of 20 ng/ml, the evidence provided here is marginally in favour of using PSAv for a modern AS cohort in RMH and strongly in favour of PSARR in data on an older pre-PSA cohort from UCHC.

Using the PSARR marker results in good specificity (i.e. leads to fewer unnecessary biopsies) in the SPCG4 men. However, this marker also misses some lethal cancers in the SPCG4 cohort (lower sensitivity). The PSAv marker has the reverse effect, with few lethal cancers missed but many unnecessary biopsies performed.

Using baseline characteristics of age, PSA and T-stage was not useful to discriminate men who go on to experience either metastases (c-statistic = 0.58) or PCSM (c-statistic = 0.53). PSADT and PSARRs increased the c-statistic of the PCSM model by 0.13 (21%) and 0.11 (25%). An important finding of this analysis is that using baseline age, PSA and T-stage can discriminate those men who are likely to reach a PSA of 20 ng/ml (c-statistics of 0.74 for RMH and 0.82 for UCHC). For this outcome, there were only marginal improvements to these c-statistics by any of the three PSA markers.

Using the PSARR model to track men on monitoring has several advantages compared with PSA kinetics. First, it has been shown that alerts based on PSA kinetic measures are heavily associated with the number of PSA tests.33 For instance, one high PSA result following four unremarkable tests will lead to a small value of PSADT (and possible alert to clinical review), whereas this measure would be relatively unaffected if the high result followed 20 unremarkable PSA tests. Using the PSARR is less sensitive to the number of test results available. Second, PSA modelling is based on empirical data on the natural history of age-related PSA increase in men with clinically localised PCa, rather than a mathematical model. This model is, therefore, more robust to PSA changes (i.e. has better ability to spot an abnormal test result) than using PSA kinetic estimates obtained from a single series of PSA data. Third, using one model, such as the ProtecT trial model proposed here, to predict PSA avoids the problem of inconsistent calculation of PSA kinetics. Fourth, the calculations are all done at the outset, with no further calculations required of the clinician as further PSA results are considered.

Outcomes from the SPGC4 cohort were useful for comparison, because the number of events was relatively large and many serial measurements of PSA were taken some time before these outcomes occurred. However, in the other cohorts, a lack of clinical outcomes, such as PCSM or metastases (i.e. UCHC) or a lack of events (i.e. RMH), diminishes our ability to validate the PSARR method or to compare it with PSA dynamics. We created outcomes involving PSA milestones, which may be clinically useful. However, stronger conclusions would be drawn using PCSM. Given that men enrolling in AM are the lowest-risk group, it is unlikely that many such events will be observed until many more years of follow-up pass or many more men are enrolled in such studies.

Copyright © Queen’s Printer and Controller of HMSO 2015. This work was produced by Simpkin et al. under the terms of a commissioning contract issued by the Secretary of State for Health. This issue may be freely reproduced for the purposes of private research and study and extracts (or indeed, the full report) may be included in professional journals provided that suitable acknowledgement is made and the reproduction is not associated with any form of advertising. Applications for commercial reproduction should be addressed to: NIHR Journals Library, National Institute for Health Research, Evaluation, Trials and Studies Coordinating Centre, Alpha House, University of Southampton Science Park, Southampton SO16 7NS, UK.

Included under terms of UK Non-commercial Government License.

Bookshelf ID: NBK305578

Views

  • PubReader
  • Print View
  • Cite this Page
  • PDF version of this title (2.6M)

Other titles in this collection

Recent Activity

Your browsing activity is empty.

Activity recording is turned off.

Turn recording back on

See more...