学习核小体核心和连接子的加权序列模型可以在酿酒酵母和人类中产生更准确的预测。

Learning a weighted sequence model of the nucleosome core and linker yields more accurate predictions in Saccharomyces cerevisiae and Homo sapiens.

机构信息

Department of Electrical Engineering, University of Washington, Seattle, Washington, USA.

出版信息

PLoS Comput Biol. 2010 Jul 8;6(7):e1000834. doi: 10.1371/journal.pcbi.1000834.

DOI:10.1371/journal.pcbi.1000834

PMID:20628623

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2900294/

Abstract

DNA in eukaryotes is packaged into a chromatin complex, the most basic element of which is the nucleosome. The precise positioning of the nucleosome cores allows for selective access to the DNA, and the mechanisms that control this positioning are important pieces of the gene expression puzzle. We describe a large-scale nucleosome pattern that jointly characterizes the nucleosome core and the adjacent linkers and is predominantly characterized by long-range oscillations in the mono, di- and tri-nucleotide content of the DNA sequence, and we show that this pattern can be used to predict nucleosome positions in both Homo sapiens and Saccharomyces cerevisiae more accurately than previously published methods. Surprisingly, in both H. sapiens and S. cerevisiae, the most informative individual features are the mono-nucleotide patterns, although the inclusion of di- and tri-nucleotide features results in improved performance. Our approach combines a much longer pattern than has been previously used to predict nucleosome positioning from sequence-301 base pairs, centered at the position to be scored-with a novel discriminative classification approach that selectively weights the contributions from each of the input features. The resulting scores are relatively insensitive to local AT-content and can be used to accurately discriminate putative dyad positions from adjacent linker regions without requiring an additional dynamic programming step and without the attendant edge effects and assumptions about linker length modeling and overall nucleosome density. Our approach produces the best dyad-linker classification results published to date in H. sapiens, and outperforms two recently published models on a large set of S. cerevisiae nucleosome positions. Our results suggest that in both genomes, a comparable and relatively small fraction of nucleosomes are well-positioned and that these positions are predictable based on sequence alone. We believe that the bulk of the remaining nucleosomes follow a statistical positioning model.

摘要

真核生物中的 DNA 被包装成染色质复合物，其最基本的元件是核小体。核小体核心的精确定位允许对 DNA 进行选择性访问，而控制这种定位的机制是基因表达谜题的重要组成部分。我们描述了一种大规模的核小体模式，该模式共同描述了核小体核心及其相邻连接子，主要表现为 DNA 序列中单、二和三核苷酸含量的长程波动，并且我们表明，该模式可用于预测人类和酿酒酵母中的核小体位置，比以前发表的方法更准确。令人惊讶的是，在人类和酿酒酵母中，最具信息量的单个特征是单核苷酸模式，尽管包含二核苷酸和三核苷酸特征会导致性能提高。我们的方法结合了比以前用于从序列预测核小体定位的方法更长的模式-301 个碱基对，以要评分的位置为中心-与一种新颖的判别分类方法相结合，该方法选择性地加权来自每个输入特征的贡献。所得分数对局部 AT 含量相对不敏感，可用于准确区分假定的二联体位置与相邻连接子区域，而无需额外的动态编程步骤，并且无需考虑边缘效应和关于连接子长度建模和整体核小体密度的假设。我们的方法在人类中产生了迄今为止发表的最佳二联体-连接子分类结果，并在大量酿酒酵母核小体位置上优于两个最近发表的模型。我们的结果表明，在这两个基因组中，相当一部分核小体的位置都很好，并且这些位置可以仅根据序列进行预测。我们认为其余大部分核小体遵循统计定位模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f546/2900294/28d0f0abab7b/pcbi.1000834.g001.jpg

相似文献

Learning a weighted sequence model of the nucleosome core and linker yields more accurate predictions in Saccharomyces cerevisiae and Homo sapiens.学习核小体核心和连接子的加权序列模型可以在酿酒酵母和人类中产生更准确的预测。

PLoS Comput Biol. 2010 Jul 8;6(7):e1000834. doi: 10.1371/journal.pcbi.1000834.

Prediction of nucleosome occupancy in Saccharomyces cerevisiae using position-correlation scoring function.利用位置相关评分函数预测酿酒酵母中的核小体占有率。

Genomics. 2011 Nov;98(5):359-66. doi: 10.1016/j.ygeno.2011.07.008. Epub 2011 Aug 2.

Nucleosome positioning based on the sequence word composition.基于序列词组成的核小体定位

Protein Pept Lett. 2012 Jan;19(1):79-90. doi: 10.2174/092986612798472811.

An analysis and prediction of nucleosome positioning based on information content.基于信息含量的核小体定位分析与预测。

Chromosome Res. 2013 Mar;21(1):63-74. doi: 10.1007/s10577-013-9338-z. Epub 2013 Feb 22.

Chemical map of Schizosaccharomyces pombe reveals species-specific features in nucleosome positioning.裂殖酵母的化学图谱揭示了核小体定位中的种属特异性特征。

Proc Natl Acad Sci U S A. 2013 Dec 10;110(50):20158-63. doi: 10.1073/pnas.1315809110. Epub 2013 Nov 25.

DNA structural patterns and nucleosome positioning.DNA结构模式与核小体定位

J Biomol Struct Dyn. 1994 Oct;12(2):301-25. doi: 10.1080/07391102.1994.10508742.

Structure-based analysis of DNA sequence patterns guiding nucleosome positioning in vitro.基于结构的体外指导核小体定位的 DNA 序列模式分析。

J Biomol Struct Dyn. 2010 Jun;27(6):821-41. doi: 10.1080/073911010010524947.

A deformation energy-based model for predicting nucleosome dyads and occupancy.一种基于变形能量的预测核小体二分体和占有率的模型。

Sci Rep. 2016 Apr 7;6:24133. doi: 10.1038/srep24133.

Nucleosome positioning in yeasts: methods, maps, and mechanisms.酵母中的核小体定位：方法、图谱及机制

Chromosoma. 2015 Jun;124(2):131-51. doi: 10.1007/s00412-014-0501-x. Epub 2014 Dec 23.

A genomic code for nucleosome positioning.一种核小体定位的基因组编码。

Nature. 2006 Aug 17;442(7104):772-8. doi: 10.1038/nature04979. Epub 2006 Jul 19.

引用本文的文献

NucleoMap: A computational tool for identifying nucleosomes in ultra-high resolution contact maps.NucleoMap：一种用于识别超高分辨率接触图中核小体的计算工具。

PLoS Comput Biol. 2022 Jul 14;18(7):e1010265. doi: 10.1371/journal.pcbi.1010265. eCollection 2022 Jul.

Nucleosome positioning sequence patterns as packing or regulatory.核小体定位序列模式作为包装或调节。

PLoS Comput Biol. 2020 Jan 27;16(1):e1007365. doi: 10.1371/journal.pcbi.1007365. eCollection 2020 Jan.

The DNA-binding protein HTa from is an archaeal histone analog.来自的 DNA 结合蛋白 HTa 是一种古菌组蛋白类似物。

Elife. 2019 Nov 11;8:e52542. doi: 10.7554/eLife.52542.

Evidence of selection for an accessible nucleosomal array in human.人类中可及核小体阵列的选择证据。

BMC Genomics. 2016 Jul 29;17:526. doi: 10.1186/s12864-016-2880-2.

Novel nucleosomal particles containing core histones and linker DNA but no histone H1.新型核小体颗粒，包含核心组蛋白和连接DNA，但不含组蛋白H1。

Nucleic Acids Res. 2016 Jan 29;44(2):573-81. doi: 10.1093/nar/gkv943. Epub 2015 Sep 22.

Prediction of nucleosome rotational positioning in yeast and human genomes based on sequence-dependent DNA anisotropy.基于序列依赖性 DNA 各向异性预测酵母和人类基因组中的核小体旋转定位。

BMC Bioinformatics. 2014 Sep 22;15(1):313. doi: 10.1186/1471-2105-15-313.

Apoptotic lymphocytes of H. sapiens lose nucleosomes in GC-rich promoters.人类凋亡淋巴细胞在富含 GC 的启动子中丢失核小体。

PLoS Comput Biol. 2014 Jul 31;10(7):e1003760. doi: 10.1371/journal.pcbi.1003760. eCollection 2014 Jul.

Regulation of the nucleosome repeat length in vivo by the DNA sequence, protein concentrations and long-range interactions.体内核小体重复长度受 DNA 序列、蛋白质浓度和长程相互作用的调控。

PLoS Comput Biol. 2014 Jul 3;10(7):e1003698. doi: 10.1371/journal.pcbi.1003698. eCollection 2014 Jul.

Conserved substitution patterns around nucleosome footprints in eukaryotes and Archaea derive from frequent nucleosome repositioning through evolution.真核生物和古菌中核小体足迹周围保守的取代模式源自进化过程中核小体的频繁重定位。

PLoS Comput Biol. 2013;9(11):e1003373. doi: 10.1371/journal.pcbi.1003373. Epub 2013 Nov 21.

An analysis and prediction of nucleosome positioning based on information content.基于信息含量的核小体定位分析与预测。

Chromosome Res. 2013 Mar;21(1):63-74. doi: 10.1007/s10577-013-9338-z. Epub 2013 Feb 22.

本文引用的文献

Labile H3.3+H2A.Z nucleosomes mark 'nucleosome-free regions'.不稳定的H3.3+H2A.Z核小体标记“无核小体区域”。

Nat Genet. 2009 Aug;41(8):865-6. doi: 10.1038/ng0809-865.

H3.3/H2A.Z double variant-containing nucleosomes mark 'nucleosome-free regions' of active promoters and other regulatory regions.含有H3.3/H2A.Z双变体的核小体标记活跃启动子和其他调控区域的“无核小体区域”。

Nat Genet. 2009 Aug;41(8):941-5. doi: 10.1038/ng.409. Epub 2009 Jul 26.

Intrinsic histone-DNA interactions are not the major determinant of nucleosome positions in vivo.内在组蛋白与DNA的相互作用并非体内核小体位置的主要决定因素。

Nat Struct Mol Biol. 2009 Aug;16(8):847-52. doi: 10.1038/nsmb.1636. Epub 2009 Jul 20.

Modeling interactions between adjacent nucleosomes improves genome-wide predictions of nucleosome occupancy.模拟相邻核小体之间的相互作用可改善全基因组核小体占据率的预测。

Bioinformatics. 2009 Jun 15;25(12):i348-55. doi: 10.1093/bioinformatics/btp216.

Comparative analysis of H2A.Z nucleosome organization in the human and yeast genomes.人类和酵母基因组中H2A.Z核小体组织的比较分析。

Genome Res. 2009 Jun;19(6):967-77. doi: 10.1101/gr.084830.108. Epub 2009 Feb 26.

Poly(dA:dT) tracts: major determinants of nucleosome organization.聚（dA:dT）序列：核小体组织的主要决定因素。

Curr Opin Struct Biol. 2009 Feb;19(1):65-71. doi: 10.1016/j.sbi.2009.01.004. Epub 2009 Feb 7.

The DNA-encoded nucleosome organization of a eukaryotic genome.真核生物基因组的DNA编码核小体组织

Nature. 2009 Mar 19;458(7236):362-6. doi: 10.1038/nature07667. Epub 2008 Dec 17.

Global analysis of the insulator binding protein CTCF in chromatin barrier regions reveals demarcation of active and repressive domains.对染色质屏障区域中绝缘子结合蛋白CTCF的全局分析揭示了活性结构域和抑制结构域的划分。

Genome Res. 2009 Jan;19(1):24-32. doi: 10.1101/gr.082800.108. Epub 2008 Dec 3.

Identifying positioned nucleosomes with epigenetic marks in human from ChIP-Seq.通过染色质免疫沉淀测序（ChIP-Seq）在人类中识别带有表观遗传标记的定位核小体。

BMC Genomics. 2008 Nov 13;9:537. doi: 10.1186/1471-2164-9-537.

Distinct modes of regulation by chromatin encoded through nucleosome positioning signals.通过核小体定位信号编码的染色质的不同调控模式。

PLoS Comput Biol. 2008 Nov;4(11):e1000216. doi: 10.1371/journal.pcbi.1000216. Epub 2008 Nov 7.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

学习核小体核心和连接子的加权序列模型可以在酿酒酵母和人类中产生更准确的预测。

Learning a weighted sequence model of the nucleosome core and linker yields more accurate predictions in Saccharomyces cerevisiae and Homo sapiens.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献