• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过域级同源性提高转录因子位置权重矩阵的覆盖率。

Increasing coverage of transcription factor position weight matrices through domain-level homology.

机构信息

Institute for Systems Biology, Seattle, Washington, United States of America.

出版信息

PLoS One. 2012;7(8):e42779. doi: 10.1371/journal.pone.0042779. Epub 2012 Aug 27.

DOI:10.1371/journal.pone.0042779
PMID:22952610
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3428306/
Abstract

Transcription factor-DNA interactions, central to cellular regulation and control, are commonly described by position weight matrices (PWMs). These matrices are frequently used to predict transcription factor binding sites in regulatory regions of DNA to complement and guide further experimental investigation. The DNA sequence preferences of transcription factors, encoded in PWMs, are dictated primarily by select residues within the DNA binding domain(s) that interact directly with DNA. Therefore, the DNA binding properties of homologous transcription factors with identical DNA binding domains may be characterized by PWMs derived from different species. Accordingly, we have implemented a fully automated domain-level homology searching method for identical DNA binding sequences.By applying the domain-level homology search to transcription factors with existing PWMs in the JASPAR and TRANSFAC databases, we were able to significantly increase coverage in terms of the total number of PWMs associated with a given species, assign PWMs to transcription factors that did not previously have any associations, and increase the number of represented species with PWMs over an order of magnitude. Additionally, using protein binding microarray (PBM) data, we have validated the domain-level method by demonstrating that transcription factor pairs with matching DNA binding domains exhibit comparable DNA binding specificity predictions to transcription factor pairs with completely identical sequences.The increased coverage achieved herein demonstrates the potential for more thorough species-associated investigation of protein-DNA interactions using existing resources. The PWM scanning results highlight the challenging nature of transcription factors that contain multiple DNA binding domains, as well as the impact of motif discovery on the ability to predict DNA binding properties. The method is additionally suitable for identifying domain-level homology mappings to enable utilization of additional information sources in the study of transcription factors. The domain-level homology search method, resulting PWM mappings, web-based user interface, and web API are publicly available at http://dodoma.systemsbiology.netdodoma.systemsbiology.net.

摘要

转录因子与 DNA 的相互作用是细胞调控的核心,通常用位置权重矩阵 (PWMs) 来描述。这些矩阵常用于预测 DNA 调控区域中转录因子的结合位点,以补充和指导进一步的实验研究。PWMs 中编码的转录因子与 DNA 的结合序列偏好主要由与 DNA 直接相互作用的 DNA 结合域内的选择残基决定。因此,具有相同 DNA 结合域的同源转录因子的 DNA 结合特性可以用来自不同物种的 PWM 来表征。因此,我们实现了一种完全自动化的同源性搜索方法,用于搜索具有相同 DNA 结合序列的同源性转录因子。通过将域级同源性搜索应用于 JASPAR 和 TRANSFAC 数据库中具有现有 PWM 的转录因子,我们能够显著增加与给定物种相关的 PWM 总数的覆盖范围,将 PWM 分配给以前没有任何关联的转录因子,并将具有 PWM 的代表物种数量增加一个数量级。此外,使用蛋白质结合微阵列 (PBM) 数据,我们通过证明具有匹配 DNA 结合域的转录因子对与具有完全相同序列的转录因子对具有可比的 DNA 结合特异性预测,验证了域级方法的有效性。本文实现的更高的覆盖范围表明,使用现有资源更全面地研究物种相关的蛋白质-DNA 相互作用具有潜力。PWM 扫描结果突出了包含多个 DNA 结合域的转录因子的挑战性,以及 motif 发现对预测 DNA 结合特性能力的影响。该方法还适合于识别域级同源映射,以在转录因子研究中利用其他信息源。域级同源性搜索方法、产生的 PWM 映射、基于网络的用户界面和网络 API 可在 http://dodoma.systemsbiology.net 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/0c7059e340d3/pone.0042779.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/c2bb0cc3eb79/pone.0042779.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/d37d13a15e3e/pone.0042779.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/cd71b58f0ae5/pone.0042779.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/6eee721d67c4/pone.0042779.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/051912da9831/pone.0042779.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/913330be7291/pone.0042779.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/0c7059e340d3/pone.0042779.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/c2bb0cc3eb79/pone.0042779.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/d37d13a15e3e/pone.0042779.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/cd71b58f0ae5/pone.0042779.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/6eee721d67c4/pone.0042779.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/051912da9831/pone.0042779.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/913330be7291/pone.0042779.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a469/3428306/0c7059e340d3/pone.0042779.g007.jpg

相似文献

1
Increasing coverage of transcription factor position weight matrices through domain-level homology.通过域级同源性提高转录因子位置权重矩阵的覆盖率。
PLoS One. 2012;7(8):e42779. doi: 10.1371/journal.pone.0042779. Epub 2012 Aug 27.
2
abc4pwm: affinity based clustering for position weight matrices in applications of DNA sequence analysis.abc4pwm:基于亲和度的位置权重矩阵聚类在 DNA 序列分析中的应用。
BMC Bioinformatics. 2022 Mar 3;23(1):83. doi: 10.1186/s12859-022-04615-z.
3
Optimized position weight matrices in prediction of novel putative binding sites for transcription factors in the Drosophila melanogaster genome.优化位置权重矩阵以预测果蝇基因组中转录因子的新型潜在结合位点。
PLoS One. 2013 Aug 6;8(8):e68712. doi: 10.1371/journal.pone.0068712. Print 2013.
4
Reliable scaling of position weight matrices for binding strength comparisons between transcription factors.用于转录因子之间结合强度比较的位置权重矩阵的可靠缩放。
BMC Bioinformatics. 2015 Aug 20;16:265. doi: 10.1186/s12859-015-0666-1.
5
Creating PWMs of transcription factors using 3D structure-based computation of protein-DNA free binding energies.使用基于 3D 结构的蛋白-DNA 自由结合能计算来创建转录因子的 PWMs。
BMC Bioinformatics. 2010 May 3;11:225. doi: 10.1186/1471-2105-11-225.
6
DISPARE: DIScriminative PAttern REfinement for Position Weight Matrices.DISPARE:位置权重矩阵的判别式模式细化。
BMC Bioinformatics. 2009 Nov 26;10:388. doi: 10.1186/1471-2105-10-388.
7
Improvement of TRANSFAC matrices using multiple local alignment of transcription factor binding site sequences.利用转录因子结合位点序列的多重局部比对改进TRANSFAC矩阵。
Genome Inform. 2005;16(1):68-72.
8
A DNA shape-based regulatory score improves position-weight matrix-based recognition of transcription factor binding sites.一种基于DNA形状的调控评分提高了基于位置权重矩阵对转录因子结合位点的识别。
Bioinformatics. 2015 Nov 1;31(21):3445-50. doi: 10.1093/bioinformatics/btv391. Epub 2015 Jun 30.
9
PiDNA: Predicting protein-DNA interactions with structural models.PiDNA:利用结构模型预测蛋白质-DNA 相互作用。
Nucleic Acids Res. 2013 Jul;41(Web Server issue):W523-30. doi: 10.1093/nar/gkt388. Epub 2013 May 22.
10
Dinucleotide weight matrices for predicting transcription factor binding sites: generalizing the position weight matrix.二核苷酸权重矩阵用于预测转录因子结合位点:位置权重矩阵的推广。
PLoS One. 2010 Mar 22;5(3):e9722. doi: 10.1371/journal.pone.0009722.

引用本文的文献

1
GHT-SELEX demonstrates unexpectedly high intrinsic sequence specificity and complex DNA binding of many human transcription factors.GHT-SELEX显示出许多人类转录因子具有出乎意料的高内在序列特异性和复杂的DNA结合能力。
bioRxiv. 2024 Nov 12:2024.11.11.618478. doi: 10.1101/2024.11.11.618478.
2
Evidence for a hierarchical transcriptional circuit in Drosophila male germline involving testis-specific TAF and two gene-specific transcription factors, Mod and Acj6.果蝇雄性生殖细胞中涉及睾丸特异性 TAF 和两个基因特异性转录因子 Mod 和 Acj6 的层次转录回路的证据。
FEBS Lett. 2018 Jan;592(1):46-59. doi: 10.1002/1873-3468.12937. Epub 2017 Dec 27.
3

本文引用的文献

1
A linear model for transcription factor binding affinity prediction in protein binding microarrays.一种用于蛋白质结合微阵列中转录因子结合亲和力预测的线性模型。
PLoS One. 2011;6(5):e20059. doi: 10.1371/journal.pone.0020059. Epub 2011 May 26.
2
The Pfam protein families database.Pfam 蛋白质家族数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D211-22. doi: 10.1093/nar/gkp985. Epub 2009 Nov 17.
3
JASPAR 2010: the greatly expanded open-access database of transcription factor binding profiles.JASPAR 2010:转录因子结合谱的大幅扩展的开放获取数据库。
Integrated analysis of motif activity and gene expression changes of transcription factors.
转录因子基序活性和基因表达变化的综合分析。
Genome Res. 2018 Feb;28(2):243-255. doi: 10.1101/gr.227231.117. Epub 2017 Dec 12.
4
Functional Profiling of Human MeCP2 by Automated Data Comparison Analysis and Computerized Expression Pathway Modeling.通过自动数据比较分析和计算机化表达途径建模对人类MeCP2进行功能分析
Healthc Inform Res. 2016 Apr;22(2):120-8. doi: 10.4258/hir.2016.22.2.120. Epub 2016 Apr 30.
5
Redundant ERF-VII Transcription Factors Bind to an Evolutionarily Conserved cis-Motif to Regulate Hypoxia-Responsive Gene Expression in Arabidopsis.冗余的ERF-VII转录因子与一个进化上保守的顺式基序结合,以调控拟南芥中的缺氧响应基因表达。
Plant Cell. 2016 Jan;28(1):160-80. doi: 10.1105/tpc.15.00866. Epub 2015 Dec 14.
6
The bZIP Transcription Factor HAC-1 Is Involved in the Unfolded Protein Response and Is Necessary for Growth on Cellulose in Neurospora crassa.bZIP转录因子HAC-1参与未折叠蛋白反应,是粗糙脉孢菌在纤维素上生长所必需的。
PLoS One. 2015 Jul 1;10(7):e0131415. doi: 10.1371/journal.pone.0131415. eCollection 2015.
7
Determination and inference of eukaryotic transcription factor sequence specificity.真核转录因子序列特异性的测定和推断。
Cell. 2014 Sep 11;158(6):1431-1443. doi: 10.1016/j.cell.2014.08.009.
Nucleic Acids Res. 2010 Jan;38(Database issue):D105-10. doi: 10.1093/nar/gkp950. Epub 2009 Nov 11.
4
PROSITE, a protein domain database for functional characterization and annotation.PROSITE,一个用于功能特征描述和注释的蛋白质域数据库。
Nucleic Acids Res. 2010 Jan;38(Database issue):D161-6. doi: 10.1093/nar/gkp885. Epub 2009 Oct 25.
5
The Universal Protein Resource (UniProt) in 2010.2010 年的通用蛋白质资源(UniProt)。
Nucleic Acids Res. 2010 Jan;38(Database issue):D142-8. doi: 10.1093/nar/gkp846. Epub 2009 Oct 20.
6
A census of human transcription factors: function, expression and evolution.人类转录因子普查:功能、表达与进化
Nat Rev Genet. 2009 Apr;10(4):252-63. doi: 10.1038/nrg2538.
7
Predicting the binding preference of transcription factors to individual DNA k-mers.预测转录因子与单个DNA k聚体的结合偏好性。
Bioinformatics. 2009 Apr 15;25(8):1012-8. doi: 10.1093/bioinformatics/btn645. Epub 2008 Dec 16.
8
SMART 6: recent updates and new developments.SMART 6:近期更新与新进展
Nucleic Acids Res. 2009 Jan;37(Database issue):D229-32. doi: 10.1093/nar/gkn808. Epub 2008 Oct 31.
9
InterPro: the integrative protein signature database.InterPro:综合蛋白质特征数据库。
Nucleic Acids Res. 2009 Jan;37(Database issue):D211-5. doi: 10.1093/nar/gkn785. Epub 2008 Oct 21.
10
UniPROBE: an online database of protein binding microarray data on protein-DNA interactions.UniPROBE:一个关于蛋白质与DNA相互作用的蛋白质结合微阵列数据在线数据库。
Nucleic Acids Res. 2009 Jan;37(Database issue):D77-82. doi: 10.1093/nar/gkn660. Epub 2008 Oct 8.