• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于DNA形状的调控评分提高了基于位置权重矩阵对转录因子结合位点的识别。

A DNA shape-based regulatory score improves position-weight matrix-based recognition of transcription factor binding sites.

作者信息

Yang Jichen, Ramsey Stephen A

机构信息

Department of Biomedical Sciences and.

Department of Biomedical Sciences and School of Electrical Engineering and Computer Science, Oregon State University, Corvallis, OR, USA.

出版信息

Bioinformatics. 2015 Nov 1;31(21):3445-50. doi: 10.1093/bioinformatics/btv391. Epub 2015 Jun 30.

DOI:10.1093/bioinformatics/btv391
PMID:26130577
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4838056/
Abstract

MOTIVATION

The position-weight matrix (PWM) is a useful representation of a transcription factor binding site (TFBS) sequence pattern because the PWM can be estimated from a small number of representative TFBS sequences. However, because the PWM probability model assumes independence between individual nucleotide positions, the PWMs for some TFs poorly discriminate binding sites from non-binding-sites that have similar sequence content. Since the local three-dimensional DNA structure ('shape') is a determinant of TF binding specificity and since DNA shape has a significant sequence-dependence, we combined DNA shape-derived features into a TF-generalized regulatory score and tested whether the score could improve PWM-based discrimination of TFBS from non-binding-sites.

RESULTS

We compared a traditional PWM model to a model that combines the PWM with a DNA shape feature-based regulatory potential score, for accuracy in detecting binding sites for 75 vertebrate transcription factors. The PWM+shape model was more accurate than the PWM-only model, for 45% of TFs tested, with no significant loss of accuracy for the remaining TFs.

AVAILABILITY AND IMPLEMENTATION

The shape-based model is available as an open-source R package at that is archived on the GitHub software repository at https://github.com/ramseylab/regshape/.

CONTACT

stephen.ramsey@oregonstate.edu

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

位置权重矩阵(PWM)是转录因子结合位点(TFBS)序列模式的一种有用表示形式,因为PWM可以从少量代表性TFBS序列中估计出来。然而,由于PWM概率模型假设各个核苷酸位置之间相互独立,某些转录因子的PWM难以区分具有相似序列内容的结合位点和非结合位点。由于局部三维DNA结构(“形状”)是TF结合特异性的决定因素,并且由于DNA形状具有显著的序列依赖性,我们将基于DNA形状的特征组合成一个TF通用调控得分,并测试该得分是否可以提高基于PWM的TFBS与非结合位点的区分能力。

结果

我们将传统的PWM模型与一个将PWM与基于DNA形状特征的调控潜力得分相结合的模型进行了比较,以检测75种脊椎动物转录因子结合位点的准确性。对于45%的测试转录因子,PWM+形状模型比仅使用PWM的模型更准确,其余转录因子的准确性没有显著损失。

可用性和实现方式

基于形状的模型作为一个开源R包提供,存档于GitHub软件仓库(https://github.com/ramseylab/regshape/)。

联系方式

stephen.ramsey@oregonstate.edu

补充信息

补充数据可在《生物信息学》在线获取。

相似文献

1
A DNA shape-based regulatory score improves position-weight matrix-based recognition of transcription factor binding sites.一种基于DNA形状的调控评分提高了基于位置权重矩阵对转录因子结合位点的识别。
Bioinformatics. 2015 Nov 1;31(21):3445-50. doi: 10.1093/bioinformatics/btv391. Epub 2015 Jun 30.
2
Transcription Factor Information System (TFIS): A Tool for Detection of Transcription Factor Binding Sites.转录因子信息系统(TFIS):一种用于检测转录因子结合位点的工具。
Interdiscip Sci. 2017 Sep;9(3):378-391. doi: 10.1007/s12539-016-0168-5. Epub 2016 Apr 6.
3
Assessment of clusters of transcription factor binding sites in relationship to human promoter, CpG islands and gene expression.与人类启动子、CpG岛及基因表达相关的转录因子结合位点簇的评估。
BMC Genomics. 2004 Feb 23;5(1):16. doi: 10.1186/1471-2164-5-16.
4
A general pairwise interaction model provides an accurate description of in vivo transcription factor binding sites.一种通用的成对相互作用模型能够准确描述体内转录因子结合位点。
PLoS One. 2014 Jun 13;9(6):e99015. doi: 10.1371/journal.pone.0099015. eCollection 2014.
5
EMQIT: a machine learning approach for energy based PWM matrix quality improvement.EMQIT:一种基于能量的脉宽调制矩阵质量改进的机器学习方法。
Biol Direct. 2017 Aug 1;12(1):17. doi: 10.1186/s13062-017-0189-y.
6
A novel method for improved accuracy of transcription factor binding site prediction.一种提高转录因子结合位点预测准确性的新方法。
Nucleic Acids Res. 2018 Jul 6;46(12):e72. doi: 10.1093/nar/gky237.
7
Logo2PWM: a tool to convert sequence logo to position weight matrix.Logo2PWM:将序列 logo 转换为位置权重矩阵的工具。
BMC Genomics. 2017 Oct 3;18(Suppl 6):709. doi: 10.1186/s12864-017-4023-9.
8
Tree-based position weight matrix approach to model transcription factor binding site profiles.基于树的位置权重矩阵方法来模拟转录因子结合位点图谱。
PLoS One. 2011;6(9):e24210. doi: 10.1371/journal.pone.0024210. Epub 2011 Sep 2.
9
Inferring intra-motif dependencies of DNA binding sites from ChIP-seq data.从ChIP-seq数据推断DNA结合位点的基序内依赖性。
BMC Bioinformatics. 2015 Nov 9;16:375. doi: 10.1186/s12859-015-0797-4.
10
Dinucleotide weight matrices for predicting transcription factor binding sites: generalizing the position weight matrix.二核苷酸权重矩阵用于预测转录因子结合位点:位置权重矩阵的推广。
PLoS One. 2010 Mar 22;5(3):e9722. doi: 10.1371/journal.pone.0009722.

引用本文的文献

1
Harnessing regulatory networks in Actinobacteria for natural product discovery.利用放线菌中的调控网络进行天然产物发现。
J Ind Microbiol Biotechnol. 2024 Jan 9;51. doi: 10.1093/jimb/kuae011.
2
Predicting DNA structure using a deep learning method.使用深度学习方法预测 DNA 结构。
Nat Commun. 2024 Feb 9;15(1):1243. doi: 10.1038/s41467-024-45191-5.
3
PWM2Vec: An Efficient Embedding Approach for Viral Host Specification from Coronavirus Spike Sequences.PWM2Vec:一种基于冠状病毒刺突序列进行病毒宿主特异性分析的高效嵌入方法。
Biology (Basel). 2022 Mar 9;11(3):418. doi: 10.3390/biology11030418.
4
Sequence and chromatin determinants of transcription factor binding and the establishment of cell type-specific binding patterns.转录因子结合的序列和染色质决定因素,以及细胞类型特异性结合模式的建立。
Biochim Biophys Acta Gene Regul Mech. 2020 Jun;1863(6):194443. doi: 10.1016/j.bbagrm.2019.194443. Epub 2019 Oct 19.
5
Landscape of transcriptional deregulation in lung cancer.肺癌中转录失调的全景。
BMC Genomics. 2018 Jun 5;19(1):435. doi: 10.1186/s12864-018-4828-1.
6
A unified approach for quantifying and interpreting DNA shape readout by transcription factors.一种统一的方法,用于量化和解释转录因子对 DNA 形状的读取。
Mol Syst Biol. 2018 Feb 22;14(2):e7902. doi: 10.15252/msb.20177902.
7
Expanding the repertoire of DNA shape features for genome-scale studies of transcription factor binding.拓展用于转录因子结合的全基因组规模研究的DNA形状特征库。
Nucleic Acids Res. 2017 Dec 15;45(22):12877-12887. doi: 10.1093/nar/gkx1145.
8
Predicting Variation of DNA Shape Preferences in Protein-DNA Interaction in Cancer Cells with a New Biophysical Model.利用新的生物物理模型预测癌细胞中蛋白质 - DNA 相互作用中 DNA 形状偏好的变化。
Genes (Basel). 2017 Sep 18;8(9):233. doi: 10.3390/genes8090233.
9
DNA Shape Features Improve Transcription Factor Binding Site Predictions In Vivo.DNA 形状特征可改善体内转录因子结合位点预测。
Cell Syst. 2016 Sep 28;3(3):278-286.e4. doi: 10.1016/j.cels.2016.07.001. Epub 2016 Aug 18.
10
Quantitative modeling of gene expression using DNA shape features of binding sites.利用结合位点的DNA形状特征对基因表达进行定量建模。
Nucleic Acids Res. 2016 Jul 27;44(13):e120. doi: 10.1093/nar/gkw446. Epub 2016 Jun 1.

本文引用的文献

1
TFBSshape: a motif database for DNA shape features of transcription factor binding sites.TFBSshape:一个转录因子结合位点 DNA 形状特征的基序数据库。
Nucleic Acids Res. 2014 Jan;42(Database issue):D148-55. doi: 10.1093/nar/gkt1087. Epub 2013 Nov 7.
2
DNAshape: a method for the high-throughput prediction of DNA structural features on a genomic scale.DNA 构象:一种高通量预测基因组范围内 DNA 结构特征的方法。
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W56-62. doi: 10.1093/nar/gkt437. Epub 2013 May 22.
3
Architecture of the human regulatory network derived from ENCODE data.人类调控网络的结构源自 ENCODE 数据。
Nature. 2012 Sep 6;489(7414):91-100. doi: 10.1038/nature11245.
4
A flexible integrative approach based on random forest improves prediction of transcription factor binding sites.基于随机森林的灵活综合方法提高了转录因子结合位点的预测能力。
Nucleic Acids Res. 2012 Aug;40(14):e106. doi: 10.1093/nar/gks283. Epub 2012 Apr 5.
5
Epigenetic priors for identifying active transcription factor binding sites.用于识别活性转录因子结合位点的表观遗传先验
Bioinformatics. 2012 Jan 1;28(1):56-62. doi: 10.1093/bioinformatics/btr614. Epub 2011 Nov 8.
6
Accurate inference of transcription factor binding from DNA sequence and chromatin accessibility data.从 DNA 序列和染色质可及性数据中准确推断转录因子结合。
Genome Res. 2011 Mar;21(3):447-55. doi: 10.1101/gr.112623.110. Epub 2010 Nov 24.
7
Genome-wide histone acetylation data improve prediction of mammalian transcription factor binding sites.全基因组组蛋白乙酰化数据可提高哺乳动物转录因子结合位点预测的准确性。
Bioinformatics. 2010 Sep 1;26(17):2071-5. doi: 10.1093/bioinformatics/btq405. Epub 2010 Jul 27.
8
Integrating multiple evidence sources to predict transcription factor binding in the human genome.整合多个证据来源以预测人类基因组中的转录因子结合
Genome Res. 2010 Apr;20(4):526-36. doi: 10.1101/gr.096305.109. Epub 2010 Mar 10.
9
Genome-wide prediction of transcription factor binding sites using an integrated model.基于整合模型的转录因子结合位点的全基因组预测。
Genome Biol. 2010 Jan 22;11(1):R7. doi: 10.1186/gb-2010-11-1-r7.
10
Genomic targets of the KRAB and SCAN domain-containing zinc finger protein 263.KRAB 和 SCAN 结构域含有锌指蛋白 263 的基因组靶标。
J Biol Chem. 2010 Jan 8;285(2):1393-403. doi: 10.1074/jbc.M109.063032. Epub 2009 Nov 2.