Title | Evaluation of microarray-based DNA methylation measurement using technical replicates: the Atherosclerosis Risk In Communities (ARIC) Study. |
Publication Type | Journal Article |
Year of Publication | 2014 |
Authors | Bose M, Wu C, Pankow JS, Demerath EW, Bressler J, Fornage M, Grove ML, Mosley TH, Hicks C, North K, Kao WHong, Zhang Y, Boerwinkle E |
Secondary Authors | Guan W |
Journal | BMC Bioinformatics |
Volume | 15 |
Pagination | 312 |
Date Published | 2014 Sep 19 |
ISSN | 1471-2105 |
Keywords | Atherosclerosis, Cluster Analysis, CpG Islands, DNA Methylation, Epigenomics, Female, Genetic Predisposition to Disease, Humans, Male, Middle Aged, Oligonucleotide Array Sequence Analysis, Reproducibility of Results, Residence Characteristics |
Abstract | BACKGROUND: DNA methylation is a widely studied epigenetic phenomenon; alterations in methylation patterns influence human phenotypes and risk of disease. As part of the Atherosclerosis Risk in Communities (ARIC) study, the Illumina Infinium HumanMethylation450 (HM450) BeadChip was used to measure DNA methylation in peripheral blood obtained from ~3000 African American study participants. Over 480,000 cytosine-guanine (CpG) dinucleotide sites were surveyed on the HM450 BeadChip. To evaluate the impact of technical variation, 265 technical replicates from 130 participants were included in the study. RESULTS: For each CpG site, we calculated the intraclass correlation coefficient (ICC) to compare variation of methylation levels within- and between-replicate pairs, ranging between 0 and 1. We modeled the distribution of ICC as a mixture of censored or truncated normal and normal distributions using an EM algorithm. The CpG sites were clustered into low- and high-reliability groups, according to the calculated posterior probabilities. We also demonstrated the performance of this clustering when applied to a study of association between methylation levels and smoking status of individuals. For the CpG sites showing genome-wide significant association with smoking status, most (~96%) were seen from sites in the high reliability cluster. CONCLUSIONS: We suggest that CpG sites with low ICC may be excluded from subsequent association analyses, or extra caution needs to be taken for associations at such sites. |
DOI | 10.1186/1471-2105-15-312 |
Alternate Journal | BMC Bioinformatics |
PubMed ID | 25239148 |
PubMed Central ID | PMC4180315 |
Grant List | HHSN268201100012C / HL / NHLBI NIH HHS / United States RC2 HL102419 / HL / NHLBI NIH HHS / United States HHSN268201100009I / HL / NHLBI NIH HHS / United States 5RC2HL102419 / HL / NHLBI NIH HHS / United States HHSN268201100010C / HL / NHLBI NIH HHS / United States HHSN268201100008C / HL / NHLBI NIH HHS / United States HHSN268201100005G / HL / NHLBI NIH HHS / United States HHSN268201100008I / HL / NHLBI NIH HHS / United States HHSN268201100005C / / PHS HHS / United States HHSN268201100007C / HL / NHLBI NIH HHS / United States HHSN268201100009C / / PHS HHS / United States HHSN268201100011I / HL / NHLBI NIH HHS / United States HHSN268201100011C / HL / NHLBI NIH HHS / United States HHSN268201100010C / / PHS HHS / United States HHSN268201100006C / HL / NHLBI NIH HHS / United States HHSN268201100008C / / PHS HHS / United States HHSN268201100012C / / PHS HHS / United States HHSN268201100005I / HL / NHLBI NIH HHS / United States HHSN268201100007C / / PHS HHS / United States HHSN268201100009C / HL / NHLBI NIH HHS / United States HHSN268201100011C / / PHS HHS / United States HHSN268201100005C / HL / NHLBI NIH HHS / United States HHSN268201100007I / HL / NHLBI NIH HHS / United States HHSN268201100006C / / PHS HHS / United States |