Suppr超能文献

在没有iDat文件的情况下,利用家系中的相关结构和重复测量对Illumina 450K甲基化数据进行质量控制。

Quality control for Illumina 450K methylation data in the absence of iDat files using correlation structure in pedigrees and repeated measures.

作者信息

LeBlanc Marissa, Nustad Haakon E, Zucknick Manuela, Page Christian M

机构信息

Oslo Centre for Biostatistics and Epidemiology, Oslo University Hospital, Klaus Torgårds vei 3, 0372, Oslo, Norway.

Department of Medical Genetics, Oslo University Hospital, Kirkeveien 166, 0450, Oslo, Norway.

出版信息

BMC Genet. 2018 Sep 17;19(Suppl 1):66. doi: 10.1186/s12863-018-0636-5.

Abstract

BACKGROUND

An important feature in many genomic studies is quality control and normalization. This is particularly important when analyzing epigenetic data, where the process of obtaining measurements can be bias prone. The GAW20 data was from the Genetics of Lipid Lowering Drugs and Diet Network (GOLDN), a study with multigeneration families, where DNA cytosine-phosphate-guanine (CpG) methylation was measured pre- and posttreatment with fenofibrate. We performed quality control assessment of the GAW20 DNA methylation data, including normalization, assessment of batch effects and detection of sample swaps.

RESULTS

We show that even after normalization, the GOLDN methylation data has systematic differences pre- and posttreatment. Through investigation of (a) CpGs sites containing a single nucleotide polymorphism, (b) the stability of breeding values for methylation across time points, and (c) autosomal gender-associated CpGs, 13 sample swaps were detected, 11 of which were posttreatment.

CONCLUSIONS

This paper demonstrates several ways to perform quality control of methylation data in the absence of raw data files and highlights the importance of normalization and quality control of the GAW20 methylation data from the GOLDN study.

摘要

背景

许多基因组研究中的一个重要特征是质量控制和标准化。在分析表观遗传数据时,这一点尤为重要,因为获取测量值的过程可能容易产生偏差。GAW20数据来自脂质降低药物和饮食网络遗传学(GOLDN)研究,该研究涉及多代家庭,在非诺贝特治疗前后测量了DNA胞嘧啶-磷酸-鸟嘌呤(CpG)甲基化。我们对GAW20 DNA甲基化数据进行了质量控制评估,包括标准化、批次效应评估和样本交换检测。

结果

我们表明,即使经过标准化,GOLDN甲基化数据在治疗前后仍存在系统差异。通过对(a)包含单核苷酸多态性的CpG位点、(b)甲基化育种值随时间点的稳定性以及(c)常染色体性别相关CpG的研究,检测到13个样本交换,其中11个是治疗后发生的。

结论

本文展示了在没有原始数据文件的情况下对甲基化数据进行质量控制的几种方法,并强调了对GOLDN研究中GAW20甲基化数据进行标准化和质量控制的重要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ed67/6156833/71322f400a7d/12863_2018_636_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验