• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种新的无比对方法用于比较转录因子结合位点基序。

A novel alignment-free method for comparing transcription factor binding site motifs.

机构信息

Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, North Carolina, United States of America.

出版信息

PLoS One. 2010 Jan 20;5(1):e8797. doi: 10.1371/journal.pone.0008797.

DOI:10.1371/journal.pone.0008797
PMID:20098703
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2808352/
Abstract

BACKGROUND

Transcription factor binding site (TFBS) motifs can be accurately represented by position frequency matrices (PFM) or other equivalent forms. We often need to compare TFBS motifs using their PFMs in order to search for similar motifs in a motif database, or cluster motifs according to their binding preference. The majority of current methods for motif comparison involve a similarity metric for column-to-column comparison and a method to find the optimal position alignment between the two compared motifs. In some applications, alignment-free methods might be preferred; however, few such methods with high accuracy have been described.

METHODOLOGY/PRINCIPAL FINDINGS: Here we describe a novel alignment-free method for quantifying the similarity of motifs using their PFMs by converting PFMs into k-mer vectors. The motifs could then be compared by measuring the similarity among their corresponding k-mer vectors.

CONCLUSIONS/SIGNIFICANCE: We demonstrate that our method in general achieves similar performance or outperforms the existing methods for clustering motifs according to their binding preference and identifying similar motifs of transcription factors of the same family.

摘要

背景

转录因子结合位点(TFBS)基序可以通过位置频率矩阵(PFM)或其他等效形式准确地表示。为了在 motif 数据库中搜索相似的基序,或者根据结合偏好对基序进行聚类,我们经常需要使用它们的 PFM 来比较 TFBS 基序。目前大多数用于 motif 比较的方法都涉及列与列之间比较的相似性度量和在两个比较的基序之间找到最佳位置对齐的方法。在某些应用中,可能更喜欢无对齐方法;但是,已经描述的具有高精度的此类方法很少。

方法/主要发现:在这里,我们描述了一种使用其 PFM 将 PFM 转换为 k-mer 向量来量化基序相似性的新型无对齐方法。然后可以通过测量它们对应的 k-mer 向量之间的相似性来比较基序。

结论/意义:我们证明,我们的方法通常可以根据结合偏好对基序进行聚类,并识别相同家族的转录因子的相似基序,从而实现与现有方法类似的性能,或者优于现有方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0c4/2808352/7bae7b67689e/pone.0008797.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0c4/2808352/ddfefd932d8d/pone.0008797.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0c4/2808352/0664cc59ee8b/pone.0008797.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0c4/2808352/7bae7b67689e/pone.0008797.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0c4/2808352/ddfefd932d8d/pone.0008797.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0c4/2808352/0664cc59ee8b/pone.0008797.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d0c4/2808352/7bae7b67689e/pone.0008797.g003.jpg

相似文献

1
A novel alignment-free method for comparing transcription factor binding site motifs.一种新的无比对方法用于比较转录因子结合位点基序。
PLoS One. 2010 Jan 20;5(1):e8797. doi: 10.1371/journal.pone.0008797.
2
SPIC: a novel similarity metric for comparing transcription factor binding site motifs based on information contents.SPIC:一种基于信息内容比较转录因子结合位点基序的新型相似性度量。
BMC Syst Biol. 2013;7 Suppl 2(Suppl 2):S14. doi: 10.1186/1752-0509-7-S2-S14. Epub 2013 Dec 17.
3
Alignment-free clustering of transcription factor binding motifs using a genetic-k-medoids approach.使用遗传k-中心点方法对转录因子结合基序进行无比对聚类。
BMC Bioinformatics. 2015 Jan 28;16:22. doi: 10.1186/s12859-015-0450-2.
4
FISim: a new similarity measure between transcription factor binding sites based on the fuzzy integral.FISim:一种基于模糊积分的转录因子结合位点间新的相似性度量方法。
BMC Bioinformatics. 2009 Jul 20;10:224. doi: 10.1186/1471-2105-10-224.
5
Similarity of position frequency matrices for transcription factor binding sites.转录因子结合位点的位置频率矩阵的相似性。
Bioinformatics. 2005 Feb 1;21(3):307-13. doi: 10.1093/bioinformatics/bth480. Epub 2004 Aug 19.
6
Natural similarity measures between position frequency matrices with an application to clustering.位置频率矩阵之间的自然相似性度量及其在聚类中的应用。
Bioinformatics. 2008 Feb 1;24(3):350-7. doi: 10.1093/bioinformatics/btm610. Epub 2008 Jan 2.
7
MATLIGN: a motif clustering, comparison and matching tool.MATLIGN:一种基序聚类、比较和匹配工具。
BMC Bioinformatics. 2007 Jun 8;8:189. doi: 10.1186/1471-2105-8-189.
8
A novel Bayesian DNA motif comparison method for clustering and retrieval.一种用于聚类和检索的新型贝叶斯DNA基序比较方法。
PLoS Comput Biol. 2008 Feb 29;4(2):e1000010. doi: 10.1371/journal.pcbi.1000010.
9
MoRAine--a web server for fast computational transcription factor binding motif re-annotation.MoRAine——一个用于快速计算转录因子结合基序重新注释的网络服务器。
J Integr Bioinform. 2008 Aug 25;5(2):91. doi: 10.2390/biecoll-jib-2008-91.
10
Exhaustive search for over-represented DNA sequence motifs with CisFinder.使用 CisFinder 进行过度表达的 DNA 序列基序的穷尽搜索。
DNA Res. 2009 Oct;16(5):261-73. doi: 10.1093/dnares/dsp014. Epub 2009 Sep 9.

引用本文的文献

1
Discovering unknown human and mouse transcription factor binding sites and their characteristics from ChIP-seq data.从 ChIP-seq 数据中发现未知的人和小鼠转录因子结合位点及其特征。
Proc Natl Acad Sci U S A. 2021 May 18;118(20). doi: 10.1073/pnas.2026754118.
2
Graph Theory-Based Sequence Descriptors as Remote Homology Predictors.基于图论的序列描述符可作为远程同源性预测器。
Biomolecules. 2019 Dec 23;10(1):26. doi: 10.3390/biom10010026.
3
A proximity-based graph clustering method for the identification and application of transcription factor clusters.

本文引用的文献

1
Genome-wide de novo prediction of cis-regulatory binding sites in prokaryotes.原核生物顺式调控结合位点的全基因组从头预测。
Nucleic Acids Res. 2009 Jun;37(10):e72. doi: 10.1093/nar/gkp248. Epub 2009 Apr 21.
2
Natural similarity measures between position frequency matrices with an application to clustering.位置频率矩阵之间的自然相似性度量及其在聚类中的应用。
Bioinformatics. 2008 Feb 1;24(3):350-7. doi: 10.1093/bioinformatics/btm610. Epub 2008 Jan 2.
3
JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update.
一种基于邻近性的图聚类方法,用于转录因子簇的识别与应用。
BMC Bioinformatics. 2017 Nov 29;18(1):530. doi: 10.1186/s12859-017-1935-y.
4
An efficient algorithm for improving structure-based prediction of transcription factor binding sites.一种用于改进基于结构的转录因子结合位点预测的高效算法。
BMC Bioinformatics. 2017 Jul 17;18(1):342. doi: 10.1186/s12859-017-1755-0.
5
RSAT matrix-clustering: dynamic exploration and redundancy reduction of transcription factor binding motif collections.RSAT矩阵聚类:转录因子结合基序集合的动态探索与冗余减少
Nucleic Acids Res. 2017 Jul 27;45(13):e119. doi: 10.1093/nar/gkx314.
6
CLIMP: Clustering Motifs via Maximal Cliques with Parallel Computing Design.CLIMP:通过具有并行计算设计的最大团进行基序聚类
PLoS One. 2016 Aug 3;11(8):e0160435. doi: 10.1371/journal.pone.0160435. eCollection 2016.
7
Structure-based prediction of transcription factor binding specificity using an integrative energy function.基于结构的转录因子结合特异性预测:使用整合能量函数
Bioinformatics. 2016 Jun 15;32(12):i306-i313. doi: 10.1093/bioinformatics/btw264.
8
Cofunctional Subpathways Were Regulated by Transcription Factor with Common Motif, Common Family, or Common Tissue.共功能子途径由具有共同基序、共同家族或共同组织的转录因子调控。
Biomed Res Int. 2015;2015:780357. doi: 10.1155/2015/780357. Epub 2015 Nov 24.
9
Alignment-free clustering of transcription factor binding motifs using a genetic-k-medoids approach.使用遗传k-中心点方法对转录因子结合基序进行无比对聚类。
BMC Bioinformatics. 2015 Jan 28;16:22. doi: 10.1186/s12859-015-0450-2.
10
SPIC: a novel similarity metric for comparing transcription factor binding site motifs based on information contents.SPIC:一种基于信息内容比较转录因子结合位点基序的新型相似性度量。
BMC Syst Biol. 2013;7 Suppl 2(Suppl 2):S14. doi: 10.1186/1752-0509-7-S2-S14. Epub 2013 Dec 17.
JASPAR,转录因子结合谱的开放获取数据库:2008年更新中的新内容和工具。
Nucleic Acids Res. 2008 Jan;36(Database issue):D102-6. doi: 10.1093/nar/gkm955. Epub 2007 Nov 15.
4
STAMP: a web tool for exploring DNA-binding motif similarities.STAMP:一个用于探索DNA结合基序相似性的网络工具。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W253-8. doi: 10.1093/nar/gkm272. Epub 2007 May 3.
5
DNA familial binding profiles made easy: comparison of various motif alignment and clustering strategies.轻松实现DNA家族结合图谱:多种基序比对和聚类策略的比较
PLoS Comput Biol. 2007 Mar 30;3(3):e61. doi: 10.1371/journal.pcbi.0030061. Epub 2007 Feb 15.
6
Identifying the conserved network of cis-regulatory sites of a eukaryotic genome.识别真核生物基因组顺式调控位点的保守网络。
Proc Natl Acad Sci U S A. 2005 Nov 29;102(48):17400-5. doi: 10.1073/pnas.0505147102. Epub 2005 Nov 21.
7
Sequence features of DNA binding sites reveal structural class of associated transcription factor.DNA结合位点的序列特征揭示了相关转录因子的结构类别。
Bioinformatics. 2006 Jan 15;22(2):157-63. doi: 10.1093/bioinformatics/bti731. Epub 2005 Nov 2.
8
Improved detection of DNA motifs using a self-organized clustering of familial binding profiles.利用家族性结合谱的自组织聚类改进DNA基序检测。
Bioinformatics. 2005 Jun;21 Suppl 1:i283-91. doi: 10.1093/bioinformatics/bti1025.
9
Similarity of position frequency matrices for transcription factor binding sites.转录因子结合位点的位置频率矩阵的相似性。
Bioinformatics. 2005 Feb 1;21(3):307-13. doi: 10.1093/bioinformatics/bth480. Epub 2004 Aug 19.
10
Applied bioinformatics for the identification of regulatory elements.应用生物信息学进行调控元件的识别。
Nat Rev Genet. 2004 Apr;5(4):276-87. doi: 10.1038/nrg1315.