Skip to main navigation Skip to search Skip to main content

Using electronic patient records to discover disease correlations and stratify patient cohorts

  • Francisco S Roque
  • , Peter B Jensen
  • , Henriette Schmock
  • , Marlene Dalgaard
  • , Massimo Andreatta
  • , Thomas Hansen
  • , Karen Søeby
  • , Søren Bredkjær
  • , Anders Juul
  • , Thomas Werge
  • , Lars J Jensen
  • , Søren Brunak*
  • *Corresponding author for this work

Research output: Contribution to journalArticleResearchpeer-review

Abstract

Electronic patient records remain a rather unexplored, but potentially rich data source for discovering correlations between diseases. We describe a general approach for gathering phenotypic descriptions of patients from medical records in a systematic and non-cohort dependent manner. By extracting phenotype information from the free-text in such records we demonstrate that we can extend the information contained in the structured record data, and use it for producing fine-grained patient stratification and disease co-occurrence statistics. The approach uses a dictionary based on the International Classification of Disease ontology and is therefore in principle language independent. As a use case we show how records from a Danish psychiatric hospital lead to the identification of disease correlations, which subsequently can be mapped to systems biology frameworks.

Original languageEnglish
Article numbere1002141
JournalPLoS Computational Biology
Volume7
Issue number8
DOIs
Publication statusPublished - Aug 2011

Keywords

  • Cluster Analysis
  • Cohort Studies
  • Comorbidity
  • Computational Biology/methods
  • Data Collection/methods
  • Data Mining/methods
  • Electronic Health Records
  • Humans
  • International Classification of Diseases
  • Reproducibility of Results

Fingerprint

Explore the research areas of 'Using electronic patient records to discover disease correlations and stratify patient cohorts'.

Cite this