Suppr超能文献

惠灵顿:一种从 DNase-seq 数据中准确识别数字基因组足迹的新方法。

Wellington: a novel method for the accurate identification of digital genomic footprints from DNase-seq data.

机构信息

Warwick Systems Biology Centre, University of Warwick, Coventry, CV4 7AL, United Kingdom, School of Cancer Sciences, Institute of Biomedical Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, United Kingdom, Department of Statistics, University of Warwick, Coventry, CV4 7AL, United Kingdom and School of Immunity and Infection, Institute of Biomedical Research, College of Medical and Dental Sciences, University of Birmingham, Birmingham, B15 2TT, United Kingdom.

出版信息

Nucleic Acids Res. 2013 Nov;41(21):e201. doi: 10.1093/nar/gkt850. Epub 2013 Sep 25.

Abstract

The expression of eukaryotic genes is regulated by cis-regulatory elements such as promoters and enhancers, which bind sequence-specific DNA-binding proteins. One of the great challenges in the gene regulation field is to characterise these elements. This involves the identification of transcription factor (TF) binding sites within regulatory elements that are occupied in a defined regulatory context. Digestion with DNase and the subsequent analysis of regions protected from cleavage (DNase footprinting) has for many years been used to identify specific binding sites occupied by TFs at individual cis-elements with high resolution. This methodology has recently been adapted for high-throughput sequencing (DNase-seq). In this study, we describe an imbalance in the DNA strand-specific alignment information of DNase-seq data surrounding protein-DNA interactions that allows accurate prediction of occupied TF binding sites. Our study introduces a novel algorithm, Wellington, which considers the imbalance in this strand-specific information to efficiently identify DNA footprints. This algorithm significantly enhances specificity by reducing the proportion of false positives and requires significantly fewer predictions than previously reported methods to recapitulate an equal amount of ChIP-seq data. We also provide an open-source software package, pyDNase, which implements the Wellington algorithm to interface with DNase-seq data and expedite analyses.

摘要

真核基因的表达受顺式调控元件(如启动子和增强子)的调控,这些元件与序列特异性 DNA 结合蛋白结合。基因调控领域的一个重大挑战是对这些元件进行特征描述。这涉及到鉴定在特定调控环境中被占据的调节元件内的转录因子(TF)结合位点。用 DNA 酶消化,然后分析被保护免受切割的区域(DNase 足迹法)多年来一直用于以高分辨率鉴定单个顺式元件上 TF 占据的特定结合位点。该方法最近已被用于高通量测序(DNase-seq)。在这项研究中,我们描述了围绕蛋白-DNA 相互作用的 DNase-seq 数据中 DNA 链特异性比对信息的不平衡,这使得准确预测被占据的 TF 结合位点成为可能。我们的研究引入了一种新的算法 Wellington,该算法考虑了这种链特异性信息的不平衡,从而有效地识别 DNA 足迹。与之前报道的方法相比,该算法通过减少假阳性的比例显著提高了特异性,并且需要更少的预测来重现等量的 ChIP-seq 数据。我们还提供了一个开源软件包 pyDNase,它实现了 Wellington 算法,以与 DNase-seq 数据接口并加速分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5be8/3834841/9b0a732cbe52/gkt850f1p.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验