Controlling testing volume for respiratory viruses using machine learning and text mining

Mark V Mai; Michael Krauthammer

Controlling testing volume for respiratory viruses using machine learning and text mining

AMIA Annu Symp Proc. 2017 Feb 10:2016:1910-1919. eCollection 2016.

Authors

Mark V Mai¹, Michael Krauthammer²

Affiliations

¹ The Children's Hospital of Philadelphia, Philadelphia, PA.
² Yale School of Medicine, New Haven, CT.

PMID: 28269950
PMCID: PMC5333257

Abstract

Viral testing for pediatric inpatients with respiratory symptoms is common, with considerable associated charges. In an attempt to reduce testing volumes, we studied whether data available at the time of admission could aid in identifying children with low likelihood of having a particular viral origin of their symptoms, and thus safely forgo broad viral testing. We collected clinical data for 1,685 pediatric inpatients receiving respiratory virus testing from 2010-2012. Machine-learning on the data allowed us to construct pre-test models predicting whether a patient would test positive for a particular virus. Text mining improved the predictions for one viral test. Cost-sensitive models optimized for test sensitivity showed reasonable test specificities and an ability to reduce test volume by up to 46% for single viral tests. We conclude that diverse forms of data in the electronic medical record can be used productively to build models that help physicians reduce testing volumes.

MeSH terms

Adolescent
Child
Child, Preschool
Data Mining*
Decision Trees
Electronic Health Records*
Female
Hospitalization
Humans
Infant
Machine Learning*
Male
Models, Theoretical
ROC Curve
Respiratory Tract Infections / diagnosis*
Retrospective Studies
Sensitivity and Specificity
Virus Diseases / diagnosis*
Viruses / isolation & purification