Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer

Roxanne Wadia; Kathleen Akgun; Cynthia Brandt; Brenda T Fenton; Woody Levin; Andrew H Marple; Vijay Garla; Michal G Rose; Tamar Taddei; Caroline Taylor

doi:10.1200/CCI.17.00069

Comparison of Natural Language Processing and Manual Coding for the Identification of Cross-Sectional Imaging Reports Suspicious for Lung Cancer

JCO Clin Cancer Inform. 2018 Dec:2:1-7. doi: 10.1200/CCI.17.00069.

Authors

Affiliation

¹ Roxanne Wadia, Kathleen Akgun, Cynthia Brandt, Brenda T. Fenton, Andrew H. Marple, Vijay Garla, Michal G. Rose, and Tamar Taddei, Yale University School of Medicine, New Haven; and Roxanne Wadia, Kathleen Akgun, Cynthia Brandt, Brenda T. Fenton, Woody Levin, Michal G. Rose, Tamar Taddei, and Caroline Taylor, Veterans Affairs Connecticut Healthcare System, West Haven, CT.

Abstract

Purpose: To compare the accuracy and reliability of a natural language processing (NLP) algorithm with manual coding by radiologists, and the combination of the two methods, for the identification of patients whose computed tomography (CT) reports raised the concern for lung cancer.

Methods: An NLP algorithm was developed using Clinical Text Analysis and Knowledge Extraction System (cTAKES) with the Yale cTAKES Extensions and trained to differentiate between language indicating benign lesions and lesions concerning for lung cancer. A random sample of 450 chest CT reports performed at Veterans Affairs Connecticut Healthcare System between January 2014 and July 2015 was selected. A reference standard was created by the manual review of reports to determine if the text stated that follow-up was needed for concern for cancer. The NLP algorithm was applied to all reports and compared with case identification using the manual coding by the radiologists.

Results: A total of 450 reports representing 428 patients were analyzed. NLP had higher sensitivity and lower specificity than manual coding (77.3% v 51.5% and 72.5% v 82.5%, respectively). NLP and manual coding had similar positive predictive values (88.4% v 88.9%), and NLP had a higher negative predictive value than manual coding (54% v 38.5%). When NLP and manual coding were combined, sensitivity increased to 92.3%, with a decrease in specificity to 62.85%. Combined NLP and manual coding had a positive predictive value of 87.0% and a negative predictive value of 75.2%.

Conclusion: Our NLP algorithm was more sensitive than manual coding of CT chest reports for the identification of patients who required follow-up for suspicion of lung cancer. The combination of NLP and manual coding is a sensitive way to identify patients who need further workup for lung cancer.

Publication types

Comparative Study
Research Support, N.I.H., Extramural
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Aged
Algorithms
Clinical Coding / methods*
Connecticut
Female
Humans
Lung Neoplasms / diagnostic imaging*
Male
Middle Aged
Natural Language Processing*
Sensitivity and Specificity
Tomography, X-Ray Computed

Grants and funding

T15 LM007056/LM/NLM NIH HHS/United States