• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

密码子位点的进化速率可用于序列比对和推断蛋白质结构域功能。

Evolutionary rates at codon sites may be used to align sequences and infer protein domain function.

机构信息

Evolutionary Medicine Unit, University of the Witwatersrand and National Health Laboratory Service, Johannesburg, South Africa.

出版信息

BMC Bioinformatics. 2010 Mar 24;11:151. doi: 10.1186/1471-2105-11-151.

DOI:10.1186/1471-2105-11-151
PMID:20334658
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2851608/
Abstract

BACKGROUND

Sequence alignments form part of many investigations in molecular biology, including the determination of phylogenetic relationships, the prediction of protein structure and function, and the measurement of evolutionary rates. However, to obtain meaningful results, a significant degree of sequence similarity is required to ensure that the alignments are accurate and the inferences correct. Limitations arise when sequence similarity is low, which is particularly problematic when working with fast-evolving genes, evolutionary distant taxa, genomes with nucleotide biases, and cases of convergent evolution.

RESULTS

A novel approach was conceptualized to address the "low sequence similarity" alignment problem. We developed an alignment algorithm termed FIRE (Functional Inference using the Rates of Evolution), which aligns sequences using the evolutionary rate at codon sites, as measured by the dN/dS ratio, rather than nucleotide or amino acid residues. FIRE was used to test the hypotheses that evolutionary rates can be used to align sequences and that the alignments may be used to infer protein domain function. Using a range of test data, we found that aligning domains based on evolutionary rates was possible even when sequence similarity was very low (for example, antibody variable regions). Furthermore, the alignment has the potential to infer protein domain function, indicating that domains with similar functions are subject to similar evolutionary constraints. These data suggest that an evolutionary rate-based approach to sequence analysis (particularly when combined with structural data) may be used to study cases of convergent evolution or when sequences have very low similarity. However, when aligning homologous gene sets with sequence similarity, FIRE did not perform as well as the best traditional alignment algorithms indicating that the conventional approach of aligning residues as opposed to evolutionary rates remains the method of choice in these cases.

CONCLUSIONS

FIRE provides proof of concept that it is possible to align sequences and infer domain function by using evolutionary rates rather than residue similarity. This represents a new approach to sequence analysis with a wide range of potential applications in molecular biology.

摘要

背景

序列比对是分子生物学中许多研究的一部分,包括确定系统发育关系、预测蛋白质结构和功能以及测量进化率。然而,为了获得有意义的结果,需要有相当程度的序列相似性,以确保比对准确,推断正确。当序列相似性较低时,就会出现限制,特别是在处理快速进化的基因、进化距离较远的分类群、核苷酸偏倚的基因组以及趋同进化的情况下。

结果

我们提出了一种新的方法来解决“低序列相似性”比对问题。我们开发了一种称为 FIRE(使用进化速率进行功能推断)的比对算法,该算法使用由 dN/dS 比衡量的密码子位点的进化率来比对序列,而不是使用核苷酸或氨基酸残基。FIRE 用于测试以下假设:进化率可用于对齐序列,并且比对可以用于推断蛋白质结构域的功能。使用一系列测试数据,我们发现即使在序列相似性非常低的情况下(例如抗体可变区),基于进化率对齐结构域也是可能的。此外,该比对有可能推断蛋白质结构域的功能,表明具有相似功能的结构域受到相似的进化限制。这些数据表明,基于进化率的序列分析方法(特别是与结构数据结合使用时)可用于研究趋同进化的情况或当序列具有非常低的相似性时。然而,当用序列相似性对齐同源基因集时,FIRE 的性能不如最佳传统比对算法好,这表明在这些情况下,将残基而非进化率进行比对仍然是首选方法。

结论

FIRE 证明了通过使用进化率而不是残基相似性来对齐序列并推断结构域功能是可行的。这代表了一种新的序列分析方法,在分子生物学中有广泛的潜在应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/499e/2851608/8486fcc34071/1471-2105-11-151-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/499e/2851608/21b8d52024f0/1471-2105-11-151-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/499e/2851608/8486fcc34071/1471-2105-11-151-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/499e/2851608/21b8d52024f0/1471-2105-11-151-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/499e/2851608/8486fcc34071/1471-2105-11-151-2.jpg

相似文献

1
Evolutionary rates at codon sites may be used to align sequences and infer protein domain function.密码子位点的进化速率可用于序列比对和推断蛋白质结构域功能。
BMC Bioinformatics. 2010 Mar 24;11:151. doi: 10.1186/1471-2105-11-151.
2
Robust sequence alignment using evolutionary rates coupled with an amino acid substitution matrix.使用进化速率结合氨基酸替换矩阵进行稳健的序列比对。
BMC Bioinformatics. 2015 Aug 14;16:255. doi: 10.1186/s12859-015-0688-8.
3
EvoDB: a database of evolutionary rate profiles, associated protein domains and phylogenetic trees for PFAM-A.EvoDB:PFAM-A的进化速率概况、相关蛋白质结构域和系统发育树的数据库。
Database (Oxford). 2015 Jul 2;2015:bav065. doi: 10.1093/database/bav065. Print 2015.
4
Convergent evolution of domain architectures (is rare).结构域架构的趋同进化(很罕见)。
Bioinformatics. 2005 Apr 15;21(8):1464-71. doi: 10.1093/bioinformatics/bti204. Epub 2004 Dec 7.
5
GLProbs: Aligning Multiple Sequences Adaptively.GL问题:自适应多序列比对
IEEE/ACM Trans Comput Biol Bioinform. 2015 Jan-Feb;12(1):67-78. doi: 10.1109/TCBB.2014.2316820.
6
Improving the alignment quality of consistency based aligners with an evaluation function using synonymous protein words.利用同义蛋白质词的评估函数提高一致性比对器的比对质量。
PLoS One. 2011;6(12):e27872. doi: 10.1371/journal.pone.0027872. Epub 2011 Dec 2.
7
Compression-based classification of biological sequences and structures via the Universal Similarity Metric: experimental assessment.通过通用相似性度量对生物序列和结构进行基于压缩的分类:实验评估
BMC Bioinformatics. 2007 Jul 13;8:252. doi: 10.1186/1471-2105-8-252.
8
High quality protein sequence alignment by combining structural profile prediction and profile alignment using SABER-TOOTH.使用 SABER-TOOTH 结合结构轮廓预测和轮廓比对进行高质量蛋白质序列比对。
BMC Bioinformatics. 2010 May 14;11:251. doi: 10.1186/1471-2105-11-251.
9
CLUSS: clustering of protein sequences based on a new similarity measure.CLUSS:基于一种新的相似性度量对蛋白质序列进行聚类。
BMC Bioinformatics. 2007 Aug 4;8:286. doi: 10.1186/1471-2105-8-286.
10
Using CLUSTAL for multiple sequence alignments.使用CLUSTAL进行多序列比对。
Methods Enzymol. 1996;266:383-402. doi: 10.1016/s0076-6879(96)66024-8.

引用本文的文献

1
Reduction of Tooth Replacement Disproportionately Affects the Evolution of Enamel Matrix Proteins.牙齿替换的减少对釉质基质蛋白的进化产生了不成比例的影响。
J Mol Evol. 2025 Aug 7. doi: 10.1007/s00239-025-10258-4.
2
Genome-Wide Identification of in and In Vivo Transgenic Studies Confirm the Critical Roles of in the Conversion of Shikonin to Acetylshikonin.全基因组范围内对紫草素体内外转基因研究的鉴定证实了紫草素向乙酰紫草素转化过程中(此处原文缺失关键信息)的关键作用。
Life (Basel). 2022 Nov 3;12(11):1775. doi: 10.3390/life12111775.
3
Robust sequence alignment using evolutionary rates coupled with an amino acid substitution matrix.

本文引用的文献

1
Developmental genetics and homology: a hierarchical approach.发育遗传学与同源性:一种层次化方法。
Trends Ecol Evol. 1997 Oct;12(10):405-8. doi: 10.1016/s0169-5347(97)01125-7.
2
Utility of computational methods to identify the apoptosis machinery in unicellular eukaryotes.计算方法在识别单细胞真核生物中细胞凋亡机制方面的效用。
Bioinform Biol Insights. 2008 Mar 12;2:101-17. doi: 10.4137/bbi.s430.
3
Evolutionary patterning: a novel approach to the identification of potential drug target sites in Plasmodium falciparum.
使用进化速率结合氨基酸替换矩阵进行稳健的序列比对。
BMC Bioinformatics. 2015 Aug 14;16:255. doi: 10.1186/s12859-015-0688-8.
4
EvoDB: a database of evolutionary rate profiles, associated protein domains and phylogenetic trees for PFAM-A.EvoDB:PFAM-A的进化速率概况、相关蛋白质结构域和系统发育树的数据库。
Database (Oxford). 2015 Jul 2;2015:bav065. doi: 10.1093/database/bav065. Print 2015.
5
On Programmed Cell Death in Plasmodium falciparum: Status Quo.疟原虫细胞程序性死亡:现状。
J Trop Med. 2012;2012:646534. doi: 10.1155/2012/646534. Epub 2012 Jan 12.
PLoS One. 2008;3(11):e3685. doi: 10.1371/journal.pone.0003685. Epub 2008 Nov 10.
4
Models of coding sequence evolution.编码序列进化模型。
Brief Bioinform. 2009 Jan;10(1):97-109. doi: 10.1093/bib/bbn049. Epub 2008 Oct 29.
5
The quest for orthologs: finding the corresponding gene across genomes.寻找直系同源基因:在不同基因组中找到对应的基因。
Trends Genet. 2008 Nov;24(11):539-51. doi: 10.1016/j.tig.2008.08.009. Epub 2008 Sep 24.
6
Searching protein structure databases with DaliLite v.3.使用DaliLite v.3搜索蛋白质结构数据库。
Bioinformatics. 2008 Dec 1;24(23):2780-1. doi: 10.1093/bioinformatics/btn507. Epub 2008 Sep 25.
7
Multiple sequence alignment using ClustalW and ClustalX.使用ClustalW和ClustalX进行多序列比对。
Curr Protoc Bioinformatics. 2002 Aug;Chapter 2:Unit 2.3. doi: 10.1002/0471250953.bi0203s00.
8
Early diversification and complex evolutionary history of the p53 tumor suppressor gene family.p53肿瘤抑制基因家族的早期多样化及复杂进化史。
Dev Genes Evol. 2007 Dec;217(11-12):801-6. doi: 10.1007/s00427-007-0185-9. Epub 2007 Oct 9.
9
Clustal W and Clustal X version 2.0.Clustal W和Clustal X 2.0版本
Bioinformatics. 2007 Nov 1;23(21):2947-8. doi: 10.1093/bioinformatics/btm404. Epub 2007 Sep 10.
10
PAML 4: phylogenetic analysis by maximum likelihood.PAML 4:基于最大似然法的系统发育分析。
Mol Biol Evol. 2007 Aug;24(8):1586-91. doi: 10.1093/molbev/msm088. Epub 2007 May 4.