Metadata-driven creation of data marts from an EAV-modeled clinical research database

Int J Med Inform. 2002 Nov 12;65(3):225-41. doi: 10.1016/s1386-5056(02)00047-3.

Abstract

Generic clinical study data management systems can record data on an arbitrary number of parameters in an arbitrary number of clinical studies without requiring modification of the database schema. They achieve this by using an Entity-Attribute-Value (EAV) model for clinical data. While very flexible for creating transaction-oriented systems for data entry and browsing of individual forms, EAV-modeled data is unsuitable for direct analytical processing, which is the focus of data marts. For this purpose, such data must be extracted and restructured appropriately. This paper describes how such a process, which is non-trivial and highly error prone if performed using non-systematic approaches, can be automated by judicious use of the study metadata-the descriptions of measured parameters and their higher-level grouping. The metadata, in addition to driving the process, is exported along with the data, in order to facilitate its human interpretation.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Breast Neoplasms / pathology
  • Database Management Systems*
  • Databases as Topic / organization & administration*
  • Female
  • Humans
  • Information Storage and Retrieval / methods*
  • Medical Informatics Applications
  • Software*