• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用SCOPE鉴定共调控基因中的潜在调控基序。

Using SCOPE to identify potential regulatory motifs in coregulated genes.

作者信息

Martyanov Viktor, Gross Robert H

机构信息

Department of Biology, Dartmouth College.

出版信息

J Vis Exp. 2011 May 31(51):2703. doi: 10.3791/2703.

DOI:10.3791/2703
PMID:21673638
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3197115/
Abstract

SCOPE is an ensemble motif finder that uses three component algorithms in parallel to identify potential regulatory motifs by over-representation and motif position preference. Each component algorithm is optimized to find a different kind of motif. By taking the best of these three approaches, SCOPE performs better than any single algorithm, even in the presence of noisy data. In this article, we utilize a web version of SCOPE to examine genes that are involved in telomere maintenance. SCOPE has been incorporated into at least two other motif finding programs and has been used in other studies. The three algorithms that comprise SCOPE are BEAM, which finds non-degenerate motifs (ACCGGT), PRISM, which finds degenerate motifs (ASCGWT), and SPACER, which finds longer bipartite motifs (ACCnnnnnnnnGGT). These three algorithms have been optimized to find their corresponding type of motif. Together, they allow SCOPE to perform extremely well. Once a gene set has been analyzed and candidate motifs identified, SCOPE can look for other genes that contain the motif which, when added to the original set, will improve the motif score. This can occur through over-representation or motif position preference. Working with partial gene sets that have biologically verified transcription factor binding sites, SCOPE was able to identify most of the rest of the genes also regulated by the given transcription factor. Output from SCOPE shows candidate motifs, their significance, and other information both as a table and as a graphical motif map. FAQs and video tutorials are available at the SCOPE web site which also includes a "Sample Search" button that allows the user to perform a trial run. Scope has a very friendly user interface that enables novice users to access the algorithm's full power without having to become an expert in the bioinformatics of motif finding. As input, SCOPE can take a list of genes, or FASTA sequences. These can be entered in browser text fields, or read from a file. The output from SCOPE contains a list of all identified motifs with their scores, number of occurrences, fraction of genes containing the motif, and the algorithm used to identify the motif. For each motif, result details include a consensus representation of the motif, a sequence logo, a position weight matrix, and a list of instances for every motif occurrence (with exact positions and "strand" indicated). Results are returned in a browser window and also optionally by email. Previous papers describe the SCOPE algorithms in detail.

摘要

SCOPE是一种集成基序查找工具,它并行使用三种组件算法,通过过度代表性和基序位置偏好来识别潜在的调控基序。每个组件算法都经过优化,以查找不同类型的基序。通过综合这三种方法的最佳结果,SCOPE的表现优于任何单一算法,即使在存在噪声数据的情况下也是如此。在本文中,我们利用SCOPE的网络版本来研究参与端粒维持的基因。SCOPE已被纳入至少其他两个基序查找程序中,并已用于其他研究。构成SCOPE的三种算法分别是:BEAM,用于查找非简并基序(ACCGGT);PRISM,用于查找简并基序(ASCGWT);SPACER,用于查找更长的二分基序(ACCnnnnnnnnGGT)。这三种算法已针对查找其相应类型的基序进行了优化。它们共同使SCOPE表现得极为出色。一旦分析了一个基因集并识别出候选基序,SCOPE就可以寻找其他包含该基序的基因,将这些基因添加到原始基因集中会提高基序得分。这可以通过过度代表性或基序位置偏好来实现。使用具有经过生物学验证的转录因子结合位点的部分基因集,SCOPE能够识别出也受给定转录因子调控的其余大多数基因。SCOPE的输出以表格和图形化基序图的形式显示候选基序、它们的显著性以及其他信息。SCOPE网站提供常见问题解答和视频教程,该网站还包括一个“示例搜索”按钮,允许用户进行试运行。SCOPE具有非常友好的用户界面,使新手用户无需成为基序查找生物信息学方面的专家就能充分利用该算法的功能。作为输入,SCOPE可以接受基因列表或FASTA序列。这些可以在浏览器文本字段中输入,也可以从文件中读取。SCOPE的输出包含所有已识别基序的列表,以及它们的得分、出现次数、包含该基序的基因比例,以及用于识别该基序的算法。对于每个基序,结果详细信息包括基序的共有表示、序列图谱、位置权重矩阵,以及每个基序出现实例的列表(标明确切位置和“链”)。结果在浏览器窗口中返回,也可以选择通过电子邮件返回。之前的论文详细描述了SCOPE算法。

相似文献

1
Using SCOPE to identify potential regulatory motifs in coregulated genes.使用SCOPE鉴定共调控基因中的潜在调控基序。
J Vis Exp. 2011 May 31(51):2703. doi: 10.3791/2703.
2
SCOPE: a web server for practical de novo motif discovery.SCOPE:用于实际从头基序发现的网络服务器。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W259-64. doi: 10.1093/nar/gkm310. Epub 2007 May 7.
3
A novel ensemble learning method for de novo computational identification of DNA binding sites.一种用于从头计算识别DNA结合位点的新型集成学习方法。
BMC Bioinformatics. 2007 Jul 12;8:249. doi: 10.1186/1471-2105-8-249.
4
FR3D: finding local and composite recurrent structural motifs in RNA 3D structures.FR3D:在RNA三维结构中寻找局部和复合重复结构基序
J Math Biol. 2008 Jan;56(1-2):215-52. doi: 10.1007/s00285-007-0110-x. Epub 2007 Aug 11.
5
EMD: an ensemble algorithm for discovering regulatory motifs in DNA sequences.EMD:一种用于在DNA序列中发现调控基序的集成算法。
BMC Bioinformatics. 2006 Jul 13;7:342. doi: 10.1186/1471-2105-7-342.
6
A cluster refinement algorithm for motif discovery.一种用于发现模体的簇精炼算法。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Oct-Dec;7(4):654-68. doi: 10.1109/TCBB.2009.25.
7
iMotifs: an integrated sequence motif visualization and analysis environment.iMotifs:一个集成的序列基序可视化和分析环境。
Bioinformatics. 2010 Mar 15;26(6):843-4. doi: 10.1093/bioinformatics/btq026. Epub 2010 Jan 26.
8
SamSelect: a sample sequence selection algorithm for quorum planted motif search on large DNA datasets.SamSelect:一种用于在大型 DNA 数据集上进行约定种植基序搜索的样本序列选择算法。
BMC Bioinformatics. 2018 Jun 18;19(1):228. doi: 10.1186/s12859-018-2242-y.
9
PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.PhyloGibbs:一种整合了系统发育的吉布斯采样基序查找器。
PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.
10
A fast weak motif-finding algorithm based on community detection in graphs.基于图中社区检测的快速弱模式发现算法。
BMC Bioinformatics. 2013 Jul 17;14:227. doi: 10.1186/1471-2105-14-227.

引用本文的文献

1
Transcriptional rewiring over evolutionary timescales changes quantitative and qualitative properties of gene expression.在进化时间尺度上的转录重排改变了基因表达的定量和定性特征。
Elife. 2016 Sep 10;5:e18981. doi: 10.7554/eLife.18981.
2
Microarray analysis of Arabidopsis under gold exposure to identify putative genes involved in the synthesis of gold nanoparticles (AuNPs).对暴露于金环境下的拟南芥进行微阵列分析,以鉴定参与金纳米颗粒(AuNPs)合成的潜在基因。
Genom Data. 2014 Dec 23;3:100-2. doi: 10.1016/j.gdata.2014.12.001. eCollection 2015 Mar.

本文引用的文献

1
RegAnalyst: a web interface for the analysis of regulatory motifs, networks and pathways.RegAnalyst:用于分析调控基序、网络和通路的网络界面。
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W193-201. doi: 10.1093/nar/gkp388. Epub 2009 May 21.
2
Three subclasses of a Drosophila insulator show distinct and cell type-specific genomic distributions.果蝇绝缘子的三个亚类表现出不同的、细胞类型特异性的基因组分布。
Genes Dev. 2009 Jun 1;23(11):1338-50. doi: 10.1101/gad.1798209. Epub 2009 May 14.
3
Identification of the Candida albicans Cap1p regulon.
白色念珠菌Cap1p调控子的鉴定。
Eukaryot Cell. 2009 Jun;8(6):806-20. doi: 10.1128/EC.00002-09. Epub 2009 Apr 24.
4
MOTIFATOR: detection and characterization of regulatory motifs using prokaryote transcriptome data.MOTIFATOR:利用原核生物转录组数据检测和表征调控基序
Bioinformatics. 2009 Feb 15;25(4):550-1. doi: 10.1093/bioinformatics/btp013. Epub 2009 Jan 25.
5
DISCLOSE : DISsection of CLusters Obtained by SEries of transcriptome data using functional annotations and putative transcription factor binding sites.公开:使用功能注释和假定转录因子结合位点对通过一系列转录组数据获得的簇进行剖析。
BMC Bioinformatics. 2008 Dec 16;9:535. doi: 10.1186/1471-2105-9-535.
6
Genomewide location analysis of Candida albicans Upc2p, a regulator of sterol metabolism and azole drug resistance.白色念珠菌Upc2p的全基因组定位分析,Upc2p是一种固醇代谢和唑类药物抗性的调节因子。
Eukaryot Cell. 2008 May;7(5):836-47. doi: 10.1128/EC.00070-08. Epub 2008 Apr 4.
7
A novel ensemble learning method for de novo computational identification of DNA binding sites.一种用于从头计算识别DNA结合位点的新型集成学习方法。
BMC Bioinformatics. 2007 Jul 12;8:249. doi: 10.1186/1471-2105-8-249.
8
SCOPE: a web server for practical de novo motif discovery.SCOPE:用于实际从头基序发现的网络服务器。
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W259-64. doi: 10.1093/nar/gkm310. Epub 2007 May 7.
9
SPACER: identification of cis-regulatory elements with non-contiguous critical residues.间隔序列:具有非连续关键残基的顺式调控元件的鉴定
Bioinformatics. 2007 Apr 15;23(8):1029-31. doi: 10.1093/bioinformatics/btm041.
10
BEAM: a beam search algorithm for the identification of cis-regulatory elements in groups of genes.BEAM:一种用于识别基因组中顺式调控元件的束搜索算法。
J Comput Biol. 2006 Apr;13(3):686-701. doi: 10.1089/cmb.2006.13.686.