Abstract
Electronic patient records remain a rather unexplored, but potentially rich data source for discovering correlations between diseases. We describe a general approach for gathering phenotypic descriptions of patients from medical records in a systematic and non-cohort dependent manner. By extracting phenotype information from the free-text in such records we demonstrate that we can extend the information contained in the structured record data, and use it for producing fine-grained patient stratification and disease co-occurrence statistics. The approach uses a dictionary based on the International Classification of Disease ontology and is therefore in principle language independent. As a use case we show how records from a Danish psychiatric hospital lead to the identification of disease correlations, which subsequently can be mapped to systems biology frameworks.
| Original language | English |
|---|---|
| Article number | e1002141 |
| Journal | PLoS Computational Biology |
| Volume | 7 |
| Issue number | 8 |
| DOIs | |
| Publication status | Published - Aug 2011 |
Keywords
- Cluster Analysis
- Cohort Studies
- Comorbidity
- Computational Biology/methods
- Data Collection/methods
- Data Mining/methods
- Electronic Health Records
- Humans
- International Classification of Diseases
- Reproducibility of Results
Fingerprint
Explore the research areas of 'Using electronic patient records to discover disease correlations and stratify patient cohorts'.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver