Structural group auditing of a UMLS semantic type's extent

J Biomed Inform. 2009 Feb;42(1):41-52. doi: 10.1016/j.jbi.2008.06.001. Epub 2008 Jun 17.

Abstract

Each UMLS concept is assigned one or more of the semantic types (STs) from the Semantic Network. Due to the size and complexity of the UMLS, errors are unavoidable. We present two auditing methodologies for groups of semantically similar concepts. The straightforward procedure starts with the extent of an ST, which is the group of all concepts assigned this ST. We divide the extent into groups of concepts that have been assigned exactly the same set of STs. An algorithm finds subgroups of suspicious concepts. The human auditor is presented with these subgroups, which purportedly exhibit the same semantics, and thus she will notice different concepts with wrong or missing ST assignments. The dynamic procedure detects concepts which become suspicious in the course of the auditing process. Both procedures are applied to two semantic types. The results are compared with a comprehensive manual audit and show a very high error recall with a much higher precision.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Abstracting and Indexing
  • Algorithms
  • Animals
  • Semantics*
  • Terminology as Topic
  • Unified Medical Language System*