Suppr超能文献

分析来自高度分化基因型的全基因组亚硫酸氢盐测序数据。

Analyzing whole genome bisulfite sequencing data from highly divergent genotypes.

机构信息

Center for Epigenetics, Johns Hopkins School of Medicine, 855 N. Wolfe St, Baltimore, MD 21205, USA.

Department of Computer Science, Johns Hopkins University, 3400 N. Charles St, Baltimore, MD 21218, USA.

出版信息

Nucleic Acids Res. 2019 Nov 4;47(19):e117. doi: 10.1093/nar/gkz674.

Abstract

In the study of DNA methylation, genetic variation between species, strains or individuals can result in CpG sites that are exclusive to a subset of samples, and insertions and deletions can rearrange the spatial distribution of CpGs. How to account for this variation in an analysis of the interplay between sequence variation and DNA methylation is not well understood, especially when the number of CpG differences between samples is large. Here, we use whole-genome bisulfite sequencing data on two highly divergent mouse strains to study this problem. We show that alignment to personal genomes is necessary for valid methylation quantification. We introduce a method for including strain-specific CpGs in differential analysis, and show that this increases power. We apply our method to a human normal-cancer dataset, and show this improves accuracy and power, illustrating the broad applicability of our approach. Our method uses smoothing to impute methylation levels at strain-specific sites, thereby allowing strain-specific CpGs to contribute to the analysis, while accounting for differences in the spatial occurrences of CpGs. Our results have implications for joint analysis of genetic variation and DNA methylation using bisulfite-converted DNA, and unlocks the use of personal genomes for addressing this question.

摘要

在 DNA 甲基化研究中,物种、品系或个体之间的遗传变异可能导致 CpG 位点仅存在于一部分样本中,并且插入和缺失会重新排列 CpG 的空间分布。在分析序列变异与 DNA 甲基化之间的相互作用时,如何解释这种变异尚不清楚,尤其是当样本之间的 CpG 差异数量很大时。在这里,我们使用两种高度分化的小鼠品系的全基因组亚硫酸氢盐测序数据来研究这个问题。我们表明,对齐个人基因组对于有效的甲基化定量是必要的。我们引入了一种在差异分析中包含品系特异性 CpG 的方法,并表明这可以提高功效。我们将我们的方法应用于人类正常-癌症数据集,并表明这提高了准确性和功效,说明了我们方法的广泛适用性。我们的方法使用平滑来推断品系特异性位点的甲基化水平,从而允许品系特异性 CpG 参与分析,同时考虑 CpG 空间出现的差异。我们的结果对使用亚硫酸氢盐转化 DNA 进行遗传变异和 DNA 甲基化的联合分析具有启示意义,并为使用个人基因组解决这个问题开辟了道路。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e59d/6821270/6b4c499073e9/gkz674fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验