• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过共复合物网络比对,丰富不显著 BLAST 命中的同源物。

Enrichment of homologs in insignificant BLAST hits by co-complex network alignment.

机构信息

Theoretical Biology and Bioinformatics group, Department of Biology, Faculty of Science, Utrecht University, Padualaan 8, Utrecht, 3584CH, Utrecht, the Netherlands.

出版信息

BMC Bioinformatics. 2010 Feb 12;11:86. doi: 10.1186/1471-2105-11-86.

DOI:10.1186/1471-2105-11-86
PMID:20152020
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2836305/
Abstract

BACKGROUND

Homology is a crucial concept in comparative genomics. The algorithm probably most widely used for homology detection in comparative genomics, is BLAST. Usually a stringent score cutoff is applied to distinguish putative homologs from possible false positive hits. As a consequence, some BLAST hits are discarded that are in fact homologous.

RESULTS

Analogous to the use of the genomics context in genome alignments, we test whether conserved functional context can be used to select candidate homologs from insignificant BLAST hits. We make a co-complex network alignment between complex subunits in yeast and human and find that proteins with an insignificant BLAST hit that are part of homologous complexes, are likely to be homologous themselves. Further analysis of the distant homologs we recovered using the co-complex network alignment, shows that a large majority of these distant homologs are in fact ancient paralogs.

CONCLUSIONS

Our results show that, even though evolution takes place at the sequence and genome level, co-complex networks can be used as circumstantial evidence to improve confidence in the homology of distantly related sequences.

摘要

背景

同源性是比较基因组学中的一个关键概念。在比较基因组学中,用于同源性检测的算法可能是最广泛使用的 BLAST。通常应用严格的得分截止值来区分可能的同源物和可能的假阳性命中。因此,一些实际上是同源的 BLAST 命中被丢弃。

结果

类似于在基因组比对中使用基因组学上下文,我们测试是否可以利用保守的功能上下文从无意义的 BLAST 命中中选择候选同源物。我们在酵母和人类的复杂亚基之间进行共复合物网络比对,并发现具有无意义 BLAST 命中但属于同源复合物的蛋白质本身很可能是同源的。对我们使用共复合物网络比对恢复的远程同源物的进一步分析表明,这些远程同源物中的绝大多数实际上是古老的旁系同源物。

结论

我们的结果表明,尽管进化发生在序列和基因组水平上,但共复合物网络可以作为间接证据,提高对远距离相关序列同源性的信心。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18d0/2836305/c1e6644c6510/1471-2105-11-86-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18d0/2836305/d8b314b5969b/1471-2105-11-86-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18d0/2836305/d1bbcd1d938b/1471-2105-11-86-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18d0/2836305/c2342a52c5e8/1471-2105-11-86-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18d0/2836305/c1e6644c6510/1471-2105-11-86-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18d0/2836305/d8b314b5969b/1471-2105-11-86-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18d0/2836305/d1bbcd1d938b/1471-2105-11-86-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18d0/2836305/c2342a52c5e8/1471-2105-11-86-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/18d0/2836305/c1e6644c6510/1471-2105-11-86-4.jpg

相似文献

1
Enrichment of homologs in insignificant BLAST hits by co-complex network alignment.通过共复合物网络比对,丰富不显著 BLAST 命中的同源物。
BMC Bioinformatics. 2010 Feb 12;11:86. doi: 10.1186/1471-2105-11-86.
2
Identification of homologs in insignificant blast hits by exploiting extrinsic gene properties.通过利用外在基因特性鉴定无显著比对结果的同源物。
BMC Bioinformatics. 2007 Sep 21;8:356. doi: 10.1186/1471-2105-8-356.
3
Large-scale comparison of protein sequence alignment algorithms with structure alignments.蛋白质序列比对算法与结构比对的大规模比较。
Proteins. 2000 Jul 1;40(1):6-22. doi: 10.1002/(sici)1097-0134(20000701)40:1<6::aid-prot30>3.0.co;2-7.
4
Detection of homologous proteins by an intermediate sequence search.通过中间序列搜索检测同源蛋白。
Protein Sci. 2004 Jan;13(1):54-62. doi: 10.1110/ps.03335004.
5
Iterative sequence/secondary structure search for protein homologs: comparison with amino acid sequence alignments and application to fold recognition in genome databases.用于蛋白质同源物的迭代序列/二级结构搜索:与氨基酸序列比对的比较及在基因组数据库中折叠识别的应用
Bioinformatics. 2000 Nov;16(11):988-1002. doi: 10.1093/bioinformatics/16.11.988.
6
FastBLAST: homology relationships for millions of proteins.FastBLAST:数百万种蛋白质的同源关系。
PLoS One. 2008;3(10):e3589. doi: 10.1371/journal.pone.0003589. Epub 2008 Oct 31.
7
Quality of alignment comparison by COMPASS improves with inclusion of diverse confident homologs.通过COMPASS进行比对的质量会随着纳入多样的可靠同源物而提高。
Bioinformatics. 2004 Apr 12;20(6):818-28. doi: 10.1093/bioinformatics/btg485. Epub 2004 Jan 29.
8
Efficient recognition of protein fold at low sequence identity by conservative application of Psi-BLAST: validation.通过保守应用Psi-BLAST在低序列同一性下高效识别蛋白质折叠:验证
J Mol Recognit. 2005 Mar-Apr;18(2):139-49. doi: 10.1002/jmr.721.
9
Ballast: blast post-processing based on locally conserved segments.镇流器:基于局部保守片段的爆炸后处理
Bioinformatics. 2000 Sep;16(9):750-9. doi: 10.1093/bioinformatics/16.9.750.
10
A performance enhanced PSI-BLAST based on hybrid alignment.基于混合比对的性能增强 PSI-BLAST。
Bioinformatics. 2011 Jan 1;27(1):31-7. doi: 10.1093/bioinformatics/btq621. Epub 2010 Nov 24.

引用本文的文献

1
Protein functional annotation of simultaneously improved stability, accuracy and false discovery rate achieved by a sequence-based deep learning.基于序列的深度学习同时提高稳定性、准确性和假阳性率的蛋白质功能注释。
Brief Bioinform. 2020 Jul 15;21(4):1437-1447. doi: 10.1093/bib/bbz081.
2
Extensive gene remodeling in the viral world: new evidence for nongradual evolution in the mobilome network.病毒世界中的广泛基因重塑:移动基因组网络中非渐进式进化的新证据。
Genome Biol Evol. 2014 Aug 7;6(9):2195-205. doi: 10.1093/gbe/evu168.
3
Phylometrics: a pipeline for inferring phylogenetic trees from a sequence relationship network perspective.

本文引用的文献

1
Cohesive versus flexible evolution of functional modules in eukaryotes.真核生物中功能模块的凝聚性进化与灵活性进化
PLoS Comput Biol. 2009 Jan;5(1):e1000276. doi: 10.1371/journal.pcbi.1000276. Epub 2009 Jan 30.
2
Complex fate of paralogs.旁系同源基因的复杂命运
BMC Evol Biol. 2008 Dec 18;8:337. doi: 10.1186/1471-2148-8-337.
3
Reactome knowledgebase of human biological pathways and processes.人类生物途径和过程的Reactome知识库。
系统发育学:从序列关系网络视角推断系统发育树的管道。
BMC Bioinformatics. 2010 Oct 7;11 Suppl 6(Suppl 6):S18. doi: 10.1186/1471-2105-11-S6-S18.
Nucleic Acids Res. 2009 Jan;37(Database issue):D619-22. doi: 10.1093/nar/gkn863. Epub 2008 Nov 3.
4
Evolution of genome architecture.基因组结构的演变
Int J Biochem Cell Biol. 2009 Feb;41(2):298-306. doi: 10.1016/j.biocel.2008.09.015. Epub 2008 Sep 26.
5
Global alignment of multiple protein interaction networks with application to functional orthology detection.多个蛋白质相互作用网络的全局比对及其在功能直系同源检测中的应用。
Proc Natl Acad Sci U S A. 2008 Sep 2;105(35):12763-8. doi: 10.1073/pnas.0806627105. Epub 2008 Aug 25.
6
Evolution of an RNP assembly system: a minimal SMN complex facilitates formation of UsnRNPs in Drosophila melanogaster.核糖核蛋白组装系统的进化:一个最小的生存运动神经元复合体促进黑腹果蝇中UsnRNP的形成。
Proc Natl Acad Sci U S A. 2008 Jul 22;105(29):10045-50. doi: 10.1073/pnas.0802287105. Epub 2008 Jul 10.
7
Prediction of enzyme function by combining sequence similarity and protein interactions.通过结合序列相似性和蛋白质相互作用预测酶的功能。
BMC Bioinformatics. 2008 May 27;9:249. doi: 10.1186/1471-2105-9-249.
8
MIPS: analysis and annotation of genome information in 2007.MIPS:2007年基因组信息的分析与注释
Nucleic Acids Res. 2008 Jan;36(Database issue):D196-201. doi: 10.1093/nar/gkm980. Epub 2007 Dec 23.
9
CORUM: the comprehensive resource of mammalian protein complexes.CORUM:哺乳动物蛋白质复合物的综合资源库。
Nucleic Acids Res. 2008 Jan;36(Database issue):D646-50. doi: 10.1093/nar/gkm936. Epub 2007 Oct 26.
10
Identification of homologs in insignificant blast hits by exploiting extrinsic gene properties.通过利用外在基因特性鉴定无显著比对结果的同源物。
BMC Bioinformatics. 2007 Sep 21;8:356. doi: 10.1186/1471-2105-8-356.