Abstract

Electronic health records (EHRs) are a rich source of data for researchers, but extracting meaningful information out of this highly complex data source is challenging. Phecodes represent one strategy for defining phenotypes for research using EHR data. They are a high-throughput phenotyping tool based on ICD (International Classification of Diseases) codes that can be used to rapidly define the case/control status of thousands of clinically meaningful diseases and conditions. Phecodes were originally developed to conduct phenome-wide association studies to scan for phenotypic associations with common genetic variants. Since then, phecodes have been used to support a wide range of EHR-based phenotyping methods, including the phenotype risk score. This review aims to comprehensively describe the development, validation, and applications of phecodes and suggest some future directions for phecodes and high-throughput phenotyping.

Download full-text PDF

Link Source
Download Source 1https://www.annualreviews.org/doi/10.1146/annurev-biodatasci-122320-112352Web Search
Download Source 2http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9307256PMC
Download Source 3http://dx.doi.org/10.1146/annurev-biodatasci-122320-112352DOI Listing

Publication Analysis

Top Keywords

electronic health
8
high-throughput phenotyping
8
phecodes
6
phecodes electronic
4
health record
4
record phewas
4
phewas phers
4
phers electronic
4
health records
4
records ehrs
4