染色质可及性数据集显示出由于 DNase I 酶的序列特异性而产生的偏差。

Chromatin accessibility data sets show bias due to sequence specificity of the DNase I enzyme.

机构信息

Wellcome Trust Sanger Institute, Hinxton, Cambridge, United Kingdom.

出版信息

PLoS One. 2013 Jul 26;8(7):e69853. doi: 10.1371/journal.pone.0069853. Print 2013.

DOI:10.1371/journal.pone.0069853

PMID:23922824

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3724795/

Abstract

BACKGROUND

DNase I is an enzyme which cuts duplex DNA at a rate that depends strongly upon its chromatin environment. In combination with high-throughput sequencing (HTS) technology, it can be used to infer genome-wide landscapes of open chromatin regions. Using this technology, systematic identification of hundreds of thousands of DNase I hypersensitive sites (DHS) per cell type has been possible, and this in turn has helped to precisely delineate genomic regulatory compartments. However, to date there has been relatively little investigation into possible biases affecting this data.

RESULTS

We report a significant degree of sequence preference spanning sites cut by DNase I in a number of published data sets. The two major protocols in current use each show a different pattern, but for a given protocol the pattern of sequence specificity seems to be quite consistent. The patterns are substantially different from biases seen in other types of HTS data sets, and in some cases the most constrained position lies outside the sequenced fragment, implying that this constraint must relate to the digestion process rather than events occurring during library preparation or sequencing.

CONCLUSIONS

DNase I is a sequence-specific enzyme, with a specificity that may depend on experimental conditions. This sequence specificity is not taken into account by existing pipelines for identifying open chromatin regions. Care must be taken when interpreting DNase I results, especially when looking at the precise locations of the reads. Future studies may be able to improve the sensitivity and precision of chromatin state measurement by compensating for sequence bias.

摘要

背景

DNase I 是一种在很大程度上依赖其染色质环境的酶，可切割双链 DNA。与高通量测序 (HTS) 技术结合使用，它可用于推断开放染色质区域的全基因组景观。使用这项技术，已经可以对每一种细胞类型进行数以十万计的 DNase I 超敏位点 (DHS) 的系统识别，这反过来又有助于精确划定基因组调控区。然而，迄今为止，对可能影响这些数据的偏差的研究相对较少。

结果

我们报告了在许多已发表的数据集，DNase I 切割位点的序列偏好程度存在显著差异。目前使用的两种主要方案各显示出不同的模式，但对于给定的方案，序列特异性模式似乎非常一致。这些模式与其他类型的 HTS 数据集中的偏差有很大不同，在某些情况下，受限制最大的位置位于测序片段之外，这意味着这种限制必须与消化过程有关，而不是与文库制备或测序过程中发生的事件有关。

结论

DNase I 是一种序列特异性酶，其特异性可能取决于实验条件。现有的识别开放染色质区域的管道并未考虑到这种序列特异性。在解释 DNase I 结果时必须谨慎，尤其是在查看读取的确切位置时。未来的研究可能能够通过补偿序列偏差来提高染色质状态测量的灵敏度和精度。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4876/3724795/93b4ef80d799/pone.0069853.g001.jpg

相似文献

Chromatin accessibility data sets show bias due to sequence specificity of the DNase I enzyme.染色质可及性数据集显示出由于 DNase I 酶的序列特异性而产生的偏差。

PLoS One. 2013 Jul 26;8(7):e69853. doi: 10.1371/journal.pone.0069853. Print 2013.

DNase-seq to Study Chromatin Accessibility in Early Embryos.利用DNase测序技术研究早期胚胎中的染色质可及性

Cold Spring Harb Protoc. 2019 Apr 1;2019(4):pdb.prot098335. doi: 10.1101/pdb.prot098335.

Genome-wide mapping of DNase I hypersensitive sites in plants.植物中DNase I超敏位点的全基因组图谱绘制

Methods Mol Biol. 2015;1284:71-89. doi: 10.1007/978-1-4939-2444-8_4.

Genome-Wide Mapping of DNase I Hypersensitive Sites in Tomato.番茄中DNase I超敏位点的全基因组图谱绘制

Methods Mol Biol. 2018;1830:367-379. doi: 10.1007/978-1-4939-8657-6_22.

Genome-scale mapping of DNase I hypersensitivity.DNA酶I超敏位点的全基因组图谱绘制。

Curr Protoc Mol Biol. 2013 Jul;Chapter 27:Unit 21.27. doi: 10.1002/0471142727.mb2127s103.

DNase I SIM: A Simplified In-Nucleus Method for DNase I Hypersensitive Site Sequencing.DNase I SIM：一种用于DNase I超敏位点测序的简化细胞核内方法。

Methods Mol Biol. 2017;1629:141-154. doi: 10.1007/978-1-4939-7125-1_10.

Mapping nucleosome positions using DNase-seq.利用DNA酶测序法绘制核小体位置图谱。

Genome Res. 2016 Mar;26(3):351-64. doi: 10.1101/gr.195602.115. Epub 2016 Jan 15.

The 'dark matter' in the plant genomes: non-coding and unannotated DNA sequences associated with open chromatin.植物基因组中的“暗物质”：与开放染色质相关的非编码和未注释DNA序列

Curr Opin Plant Biol. 2015 Apr;24:17-23. doi: 10.1016/j.pbi.2015.01.005. Epub 2015 Jan 24.

Isolation of nuclei for use in genome-wide DNase hypersensitivity assays to probe chromatin structure.分离用于全基因组DNA酶超敏感性分析以探测染色质结构的细胞核。

Methods Mol Biol. 2013;977:13-9. doi: 10.1007/978-1-62703-284-1_2.

Genomic Footprinting Analyses from DNase-seq Data to Construct Gene Regulatory Networks.从 DNase-seq 数据进行基因组足迹分析以构建基因调控网络。

Methods Mol Biol. 2021;2328:25-46. doi: 10.1007/978-1-0716-1534-8_3.

引用本文的文献

ChromBPNet: bias factorized, base-resolution deep learning models of chromatin accessibility reveal cis-regulatory sequence syntax, transcription factor footprints and regulatory variants.ChromBPNet：染色质可及性的偏差分解、碱基分辨率深度学习模型揭示顺式调控序列语法、转录因子足迹和调控变异体

bioRxiv. 2025 Jan 8:2024.12.25.630221. doi: 10.1101/2024.12.25.630221.

Emerging Approaches to Profile Accessible Chromatin from Formalin-Fixed Paraffin-Embedded Sections.从福尔马林固定石蜡包埋切片中分析可及染色质的新方法

Epigenomes. 2024 May 12;8(2):20. doi: 10.3390/epigenomes8020020.

Multimodal learning of noncoding variant effects using genome sequence and chromatin structure.使用基因组序列和染色质结构进行非编码变异效应的多模态学习。

Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad541.

Correction of transposase sequence bias in ATAC-seq data with rule ensemble modeling.使用规则集成模型校正ATAC-seq数据中转座酶序列偏差

NAR Genom Bioinform. 2023 Jun 2;5(2):lqad054. doi: 10.1093/nargab/lqad054. eCollection 2023 Jun.

Intestinal bacteria and colorectal cancer: etiology and treatment.肠道细菌与结直肠癌：病因与治疗。

Gut Microbes. 2023 Jan-Dec;15(1):2185028. doi: 10.1080/19490976.2023.2185028.

Immaturin-Nuclease as a Model System for a Gene-Programmed Sexual Development and Rejuvenescence in Life History.未成熟核酸酶作为生命史中基因编程性发育和年轻化的模型系统。

Microorganisms. 2022 Dec 28;11(1):82. doi: 10.3390/microorganisms11010082.

Chromatin accessibility profiling by ATAC-seq.染色质可及性分析的 ATAC-seq 技术。

Nat Protoc. 2022 Jun;17(6):1518-1552. doi: 10.1038/s41596-022-00692-9. Epub 2022 Apr 27.

Comprehensive understanding of Tn5 insertion preference improves transcription regulatory element identification.对Tn5插入偏好的全面理解有助于改善转录调控元件的识别。

NAR Genom Bioinform. 2021 Oct 27;3(4):lqab094. doi: 10.1093/nargab/lqab094. eCollection 2021 Dec.

Whole-Genome Analysis Reveals That the Nucleoid Protein IHF Predominantly Binds to the Replication Origin Specifically at the Time of Initiation.全基因组分析表明，类核蛋白IHF主要在起始时特异性结合于复制起点。

Front Microbiol. 2021 Aug 12;12:697712. doi: 10.3389/fmicb.2021.697712. eCollection 2021.

Nascent RNA scaffolds contribute to chromosome territory architecture and counter chromatin compaction.初生 RNA 支架有助于染色体域结构并对抗染色质的紧缩。

Mol Cell. 2021 Sep 2;81(17):3509-3525.e5. doi: 10.1016/j.molcel.2021.07.004. Epub 2021 Jul 27.

本文引用的文献

Chromatin accessibility reveals insights into androgen receptor activation and transcriptional specificity.染色质可及性揭示了雄激素受体激活和转录特异性的相关见解。

Genome Biol. 2012 Oct 3;13(10):R88. doi: 10.1186/gb-2012-13-10-r88.

Predicting cell-type-specific gene expression from regions of open chromatin.从开放染色质区域预测细胞类型特异性基因表达。

Genome Res. 2012 Sep;22(9):1711-22. doi: 10.1101/gr.135129.111.

An expansive human regulatory lexicon encoded in transcription factor footprints.转录因子足迹中编码的广泛人类调控词汇。

Nature. 2012 Sep 6;489(7414):83-90. doi: 10.1038/nature11212.

The accessible chromatin landscape of the human genome.人类基因组的可及染色质景观。

Nature. 2012 Sep 6;489(7414):75-82. doi: 10.1038/nature11232.

An integrated encyclopedia of DNA elements in the human genome.人类基因组中 DNA 元件的综合百科全书。

Nature. 2012 Sep 6;489(7414):57-74. doi: 10.1038/nature11247.

The NIH Roadmap Epigenomics Program data resource.NIH 路线图表观基因组学计划数据资源。

Epigenomics. 2012 Jun;4(3):317-24. doi: 10.2217/epi.12.18.

A new strategy to reduce allelic bias in RNA-Seq readmapping.一种减少 RNA-Seq 读段比对中等位基因偏倚的新策略。

Nucleic Acids Res. 2012 Sep;40(16):e127. doi: 10.1093/nar/gks425. Epub 2012 May 14.

Differential DNase I hypersensitivity reveals factor-dependent chromatin dynamics.差异性 DNase I 超敏性揭示了依赖因子的染色质动力学。

Genome Res. 2012 Jun;22(6):1015-25. doi: 10.1101/gr.133280.111. Epub 2012 Apr 16.

Open chromatin defined by DNaseI and FAIRE identifies regulatory elements that shape cell-type identity.由 DNaseI 和 FAIRE 定义的开放染色质可识别出塑造细胞类型特征的调控元件。

Genome Res. 2011 Oct;21(10):1757-67. doi: 10.1101/gr.121541.111. Epub 2011 Jul 12.

Systematic bias in high-throughput sequencing data and its correction by BEADS.高通量测序数据中的系统偏差及其通过 BEADS 的校正。

Nucleic Acids Res. 2011 Aug;39(15):e103. doi: 10.1093/nar/gkr425. Epub 2011 Jun 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

染色质可及性数据集显示出由于 DNase I 酶的序列特异性而产生的偏差。

Chromatin accessibility data sets show bias due to sequence specificity of the DNase I enzyme.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献