Do changes in health reveal the possibility of undiagnosed pancreatic cancer? Development of a risk-prediction model based on healthcare claims data

PLoS One. 2019 Jun 25;14(6):e0218580. doi: 10.1371/journal.pone.0218580. eCollection 2019.

Abstract

Background and objective: Early detection methods for pancreatic cancer are lacking. We aimed to develop a prediction model for pancreatic cancer based on changes in health captured by healthcare claims data.

Methods: We conducted a case-control study on 29,646 Medicare-enrolled patients aged 68 years and above with pancreatic ductal adenocarcinoma (PDAC) reported to the Surveillance Epidemiology an End Results (SEER) tumor registries program in 2004-2011 and 88,938 age and sex-matched controls. We developed a prediction model using multivariable logistic regression on Medicare claims for 16 risk factors and pre-diagnostic symptoms of PDAC present within 15 months prior to PDAC diagnosis. Claims within 3 months of PDAC diagnosis were excluded in sensitivity analyses. We evaluated the discriminatory power of the model with the area under the receiver operating curve (AUC) and performed cross-validation by bootstrapping.

Results: The prediction model on all cases and controls reached AUC of 0.68. Excluding the final 3 months of claims lowered the AUC to 0.58. Among new-onset diabetes patients, the prediction model reached AUC of 0.73, which decreased to 0.63 when claims from the final 3 months were excluded. Performance measures of the prediction models was confirmed by internal validation using the bootstrap method.

Conclusion: Models based on healthcare claims for clinical risk factors, symptoms and signs of pancreatic cancer are limited in classifying those who go on to diagnosis of pancreatic cancer and those who do not, especially when excluding claims that immediately precede the diagnosis of PDAC.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Adenocarcinoma / diagnosis*
  • Adenocarcinoma / epidemiology
  • Administrative Claims, Healthcare / statistics & numerical data*
  • Aged
  • Aged, 80 and over
  • Female
  • Health Status*
  • Humans
  • Male
  • Models, Statistical
  • Pancreatic Neoplasms / diagnosis*
  • Pancreatic Neoplasms / epidemiology
  • Undiagnosed Diseases / diagnosis*
  • Undiagnosed Diseases / epidemiology