Pulse lineResearch With Heart Logo

A System for Phenotype Harmonization in the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine (TOPMed) Program.

TitleA System for Phenotype Harmonization in the National Heart, Lung, and Blood Institute Trans-Omics for Precision Medicine (TOPMed) Program.
Publication TypeJournal Article
Year of Publication2021
AuthorsStilp AM, Emery LS, Broome JG, Buth EJ, Khan AT, Laurie CA, Wang FFei, Wong Q, Chen D, D'Augustine CM, Heard-Costa NL, Hohensee CR, Johnson WCraig, Juarez LD, Liu J, Mutalik KM, Raffield LM, Wiggins KL, de Vries PS, Kelly TN, Kooperberg C, Natarajan P, Peloso GM, Peyser PA, Reiner AP, Arnett DK, Aslibekyan S, Barnes KC, Bielak LF, Bis JC, Cade BE, Chen M-H, Correa A, L Cupples A, de Andrade M, Ellinor PT, Fornage M, Franceschini N, Gan W, Ganesh SK, Graffelman J, Grove ML, Guo X, Hawley NL, Hsu W-L, Jackson RD, Jaquish CE, Johnson AD, Kardia SLR, Kelly S, Lee J, Mathias RA, McGarvey ST, Mitchell BD, Montasser ME, Morrison AC, North KE, Nouraie SMehdi, Oelsner EC, Pankratz N, Rich SS, Rotter JI, Smith JA, Taylor KD, Vasan RS, Weeks DE, Weiss ST, Wilson CG, Yanek LR, Psaty BM, Heckbert SR, Laurie CC
JournalAm J Epidemiol
Volume190
Issue10
Pagination1977-1992
Date Published2021 Oct 01
ISSN1476-6256
Abstract

Genotype-phenotype association studies often combine phenotype data from multiple studies to increase statistical power. Harmonization of the data usually requires substantial effort due to heterogeneity in phenotype definitions, study design, data collection procedures, and data-set organization. Here we describe a centralized system for phenotype harmonization that includes input from phenotype domain and study experts, quality control, documentation, reproducible results, and data-sharing mechanisms. This system was developed for the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program, which is generating genomic and other -omics data for more than 80 studies with extensive phenotype data. To date, 63 phenotypes have been harmonized across thousands of participants (recruited in 1948-2012) from up to 17 studies per phenotype. Here we discuss challenges in this undertaking and how they were addressed. The harmonized phenotype data and associated documentation have been submitted to National Institutes of Health data repositories for controlled access by the scientific community. We also provide materials to facilitate future harmonization efforts by the community, which include 1) the software code used to generate the 63 harmonized phenotypes, enabling others to reproduce, modify, or extend these harmonizations to additional studies, and 2) the results of labeling thousands of phenotype variables with controlled vocabulary terms.

DOI10.1093/aje/kwab115
Alternate JournalAm J Epidemiol
PubMed ID33861317
PubMed Central IDPMC8485147
Grant List75N95020D00003 / DA / NIDA NIH HHS / United States
N01HC65236 / HL / NHLBI NIH HHS / United States
N01HC65235 / HL / NHLBI NIH HHS / United States
75N95020D00004 / DA / NIDA NIH HHS / United States
R01 HL095080 / HL / NHLBI NIH HHS / United States
HHSN268201500003C / HL / NHLBI NIH HHS / United States
75N90020D00002 / CL / CLC NIH HHS / United States
HHSN268201800012C / HL / NHLBI NIH HHS / United States
R01 HL130733 / HL / NHLBI NIH HHS / United States
R01 HL104135 / HL / NHLBI NIH HHS / United States
N01HC95160 / HL / NHLBI NIH HHS / United States
R01 HL120393 / HL / NHLBI NIH HHS / United States
R01 HL046380 / HL / NHLBI NIH HHS / United States
K24 HL105780 / HL / NHLBI NIH HHS / United States
R03 HL141439 / HL / NHLBI NIH HHS / United States
U54 HG003067 / HG / NHGRI NIH HHS / United States
R01 HL083141 / HL / NHLBI NIH HHS / United States
R01 HL121007 / HL / NHLBI NIH HHS / United States
HHSN268201600002C / HL / NHLBI NIH HHS / United States
HHSN268201800004I / HL / NHLBI NIH HHS / United States
N01HC95163 / HL / NHLBI NIH HHS / United States
U01 HL080295 / HL / NHLBI NIH HHS / United States
HHSN268201500001C / HL / NHLBI NIH HHS / United States
UL1 TR001079 / TR / NCATS NIH HHS / United States
75N96020D00002 / ES / NIEHS NIH HHS / United States
HHSN268201600018C / HL / NHLBI NIH HHS / United States
R01 HL122684 / HL / NHLBI NIH HHS / United States
R01 HL092577 / HL / NHLBI NIH HHS / United States
R21 HL140385 / HL / NHLBI NIH HHS / United States
R01 HL068986 / HL / NHLBI NIH HHS / United States
75N93020D00002 / AI / NIAID NIH HHS / United States
U01 HL130114 / HL / NHLBI NIH HHS / United States
R01 HL087660 / HL / NHLBI NIH HHS / United States
U10 HL054464 / HL / NHLBI NIH HHS / United States
HHSN268200800007C / HL / NHLBI NIH HHS / United States
R01 HL085251 / HL / NHLBI NIH HHS / United States
R01 HL066216 / HL / NHLBI NIH HHS / United States
N01HC95169 / HL / NHLBI NIH HHS / United States
U01 HL120393 / HL / NHLBI NIH HHS / United States
R01 HL113338 / HL / NHLBI NIH HHS / United States
R01 DK117445 / DK / NIDDK NIH HHS / United States
75N95020D00002 / DA / NIDA NIH HHS / United States
75N99020D00003 / OF / ORFDO NIH HHS / United States
U01 HL089897 / HL / NHLBI NIH HHS / United States
N01HC95164 / HL / NHLBI NIH HHS / United States
N01HC55222 / HL / NHLBI NIH HHS / United States
N02HL64278 / HL / NHLBI NIH HHS / United States
HHSN268201800014C / HL / NHLBI NIH HHS / United States
R01 HL128914 / HL / NHLBI NIH HHS / United States
R01 HL139672 / HL / NHLBI NIH HHS / United States
N01HC85086 / HL / NHLBI NIH HHS / United States
N01HC65234 / HL / NHLBI NIH HHS / United States
N01HC95162 / HL / NHLBI NIH HHS / United States
U01 HL054464 / HL / NHLBI NIH HHS / United States
R01 HL119443 / HL / NHLBI NIH HHS / United States
N01HC95168 / HL / NHLBI NIH HHS / United States
R37 HL066289 / HL / NHLBI NIH HHS / United States
U01 HL089856 / HL / NHLBI NIH HHS / United States
75N90020D00003 / CL / CLC NIH HHS / United States
75N96020D00003 / ES / NIEHS NIH HHS / United States
U10 HL054457 / HL / NHLBI NIH HHS / United States
R01 HL142711 / HL / NHLBI NIH HHS / United States
R35 HL135818 / HL / NHLBI NIH HHS / United States
HHSN268201800003I / HL / NHLBI NIH HHS / United States
U10 HL054481 / HL / NHLBI NIH HHS / United States
P30 DK063491 / DK / NIDDK NIH HHS / United States
U01 HL072524 / HL / NHLBI NIH HHS / United States
75N99020D00002 / OF / ORFDO NIH HHS / United States
HHSN268201700002C / HL / NHLBI NIH HHS / United States
N01HC65233 / HL / NHLBI NIH HHS / United States
HHSN268201800007I / HL / NHLBI NIH HHS / United States
HHSN268201200036C / HL / NHLBI NIH HHS / United States
HHSN268201800001C / HL / NHLBI NIH HHS / United States
HHSN268201700001I / HL / NHLBI NIH HHS / United States
HHSN268201800013I / MD / NIMHD NIH HHS / United States
75N99020D00006 / OF / ORFDO NIH HHS / United States
N01HC65237 / HL / NHLBI NIH HHS / United States
HHSN268201600003C / HL / NHLBI NIH HHS / United States
U01 HL054457 / HL / NHLBI NIH HHS / United States
HHSN268201700004I / HL / NHLBI NIH HHS / United States
N01HC95165 / HL / NHLBI NIH HHS / United States
N01HC95159 / HL / NHLBI NIH HHS / United States
HHSN268201500001I / HL / NHLBI NIH HHS / United States
R21 HL129924 / HL / NHLBI NIH HHS / United States
75N95020D00007 / DA / NIDA NIH HHS / United States
N01HC95161 / HL / NHLBI NIH HHS / United States
UL1 TR001420 / TR / NCATS NIH HHS / United States
75N95020D00005 / DA / NIDA NIH HHS / United States
HHSN268201600004C / HL / NHLBI NIH HHS / United States
U01 HL072515 / HL / NHLBI NIH HHS / United States
HHSN268201800011C / HL / NHLBI NIH HHS / United States
KL2 RR024990 / RR / NCRR NIH HHS / United States
HHSN268201500003I / HL / NHLBI NIH HHS / United States
HHSN268201600001C / HL / NHLBI NIH HHS / United States
75N92021D00006 / HL / NHLBI NIH HHS / United States
R01 HL093093 / HL / NHLBI NIH HHS / United States
HHSN268201700005C / HL / NHLBI NIH HHS / United States
HHSN268201700001C / HL / NHLBI NIH HHS / United States
N01HC85082 / HL / NHLBI NIH HHS / United States
75N99020D00005 / OF / ORFDO NIH HHS / United States
N01HC95167 / HL / NHLBI NIH HHS / United States
HHSN268201700003C / HL / NHLBI NIH HHS / United States
75N99020D00007 / OF / ORFDO NIH HHS / United States
N01HC85083 / HL / NHLBI NIH HHS / United States
U01 HG004735 / HG / NHGRI NIH HHS / United States
N01HC25195 / HL / NHLBI NIH HHS / United States
R01 HL085571 / HL / NHLBI NIH HHS / United States
HHSN268201800015I / HB / NHLBI NIH HHS / United States
U01 HL054481 / HL / NHLBI NIH HHS / United States
75N92019D00031 / HL / NHLBI NIH HHS / United States
R01 MD012765 / MD / NIMHD NIH HHS / United States
HHSN268201700004C / HL / NHLBI NIH HHS / United States
UL1 TR000040 / TR / NCATS NIH HHS / United States
HHSN268201700002I / HL / NHLBI NIH HHS / United States
HHSN268201700005I / HL / NHLBI NIH HHS / United States
75N98020D00007 / OD / NIH HHS / United States
R01 HL117626 / HL / NHLBI NIH HHS / United States
U01 HG006379 / HG / NHGRI NIH HHS / United States
N01HC85079 / HL / NHLBI NIH HHS / United States
N01HC95166 / HL / NHLBI NIH HHS / United States
K23 HL130627 / HL / NHLBI NIH HHS / United States
R01 AG023629 / AG / NIA NIH HHS / United States
UL1 TR001881 / TR / NCATS NIH HHS / United States
R01 HL073410 / HL / NHLBI NIH HHS / United States
HHSN268201800005I / HL / NHLBI NIH HHS / United States
N01HC85080 / HL / NHLBI NIH HHS / United States
P01 HL132825 / HL / NHLBI NIH HHS / United States
HHSN268201700003I / HL / NHLBI NIH HHS / United States
HHSN268201800006I / HL / NHLBI NIH HHS / United States
R01 AG018728 / AG / NIA NIH HHS / United States
75N99020D00004 / OF / ORFDO NIH HHS / United States
K01 HL135405 / HL / NHLBI NIH HHS / United States
N01HC85081 / HL / NHLBI NIH HHS / United States