• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TwinCons:用于揭示深度序列相似性和差异的保守性评分。

TwinCons: Conservation score for uncovering deep sequence similarity and divergence.

机构信息

NASA Center for the Origin of Life, Georgia Institute of Technology, Atlanta, Georgia, United States of America.

School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, United States of America.

出版信息

PLoS Comput Biol. 2021 Oct 29;17(10):e1009541. doi: 10.1371/journal.pcbi.1009541. eCollection 2021 Oct.

DOI:10.1371/journal.pcbi.1009541
PMID:34714829
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8580257/
Abstract

We have developed the program TwinCons, to detect noisy signals of deep ancestry of proteins or nucleic acids. As input, the program uses a composite alignment containing pre-defined groups, and mathematically determines a 'cost' of transforming one group to the other at each position of the alignment. The output distinguishes conserved, variable and signature positions. A signature is conserved within groups but differs between groups. The method automatically detects continuous characteristic stretches (segments) within alignments. TwinCons provides a convenient representation of conserved, variable and signature positions as a single score, enabling the structural mapping and visualization of these characteristics. Structure is more conserved than sequence. TwinCons highlights alternative sequences of conserved structures. Using TwinCons, we detected highly similar segments between proteins from the translation and transcription systems. TwinCons detects conserved residues within regions of high functional importance for the ribosomal RNA (rRNA) and demonstrates that signatures are not confined to specific regions but are distributed across the rRNA structure. The ability to evaluate both nucleic acid and protein alignments allows TwinCons to be used in combined sequence and structural analysis of signatures and conservation in rRNA and in ribosomal proteins (rProteins). TwinCons detects a strong sequence conservation signal between bacterial and archaeal rProteins related by circular permutation. This conserved sequence is structurally colocalized with conserved rRNA, indicated by TwinCons scores of rRNA alignments of bacterial and archaeal groups. This combined analysis revealed deep co-evolution of rRNA and rProtein buried within the deepest branching points in the tree of life.

摘要

我们开发了 TwinCons 程序,用于检测蛋白质或核酸的深层祖先的嘈杂信号。作为输入,该程序使用包含预定义组的组合比对,并在比对的每个位置通过数学方法确定将一个组转换为另一个组的“成本”。输出区分保守、可变和特征位置。特征在组内保守但在组间不同。该方法自动检测比对中连续的特征(段)。TwinCons 将保守、可变和特征位置的表示作为单个分数,从而能够对这些特征进行结构映射和可视化。结构比序列更保守。TwinCons 突出显示保守结构的替代序列。使用 TwinCons,我们在来自翻译和转录系统的蛋白质之间检测到高度相似的片段。TwinCons 在核糖体 RNA(rRNA) 高功能重要区域内检测到保守残基,并表明特征不仅限于特定区域,而是分布在 rRNA 结构中。评估核酸和蛋白质比对的能力使 TwinCons 能够用于 rRNA 和核糖体蛋白(rProtein) 中特征和保守性的组合序列和结构分析。TwinCons 在通过循环排列相关的细菌和古细菌 rProtein 之间检测到强烈的序列保守信号。这种保守序列在结构上与保守的 rRNA 共定位,这表明细菌和古细菌组的 rRNA 比对的 TwinCons 分数。这种组合分析揭示了 rRNA 和 rProtein 在生命之树最深的分支点内的深度共同进化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/e08ea4b2add0/pcbi.1009541.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/4d01d50e86f5/pcbi.1009541.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/06550fa1056f/pcbi.1009541.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/85491f338eea/pcbi.1009541.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/8f0a674ce597/pcbi.1009541.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/8439c70284ef/pcbi.1009541.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/e08ea4b2add0/pcbi.1009541.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/4d01d50e86f5/pcbi.1009541.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/06550fa1056f/pcbi.1009541.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/85491f338eea/pcbi.1009541.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/8f0a674ce597/pcbi.1009541.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/8439c70284ef/pcbi.1009541.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2a85/8580257/e08ea4b2add0/pcbi.1009541.g006.jpg

相似文献

1
TwinCons: Conservation score for uncovering deep sequence similarity and divergence.TwinCons:用于揭示深度序列相似性和差异的保守性评分。
PLoS Comput Biol. 2021 Oct 29;17(10):e1009541. doi: 10.1371/journal.pcbi.1009541. eCollection 2021 Oct.
2
Gapped alignment of protein sequence motifs through Monte Carlo optimization of a hidden Markov model.通过隐马尔可夫模型的蒙特卡罗优化实现蛋白质序列基序的间隙比对。
BMC Bioinformatics. 2004 Oct 25;5:157. doi: 10.1186/1471-2105-5-157.
3
ProfileGrids as a new visual representation of large multiple sequence alignments: a case study of the RecA protein family.轮廓网格作为大型多序列比对的一种新的可视化表示形式:RecA蛋白家族的案例研究
BMC Bioinformatics. 2008 Dec 22;9:554. doi: 10.1186/1471-2105-9-554.
4
Comparison of archaeal and bacterial genomes: computer analysis of protein sequences predicts novel functions and suggests a chimeric origin for the archaea.古菌与细菌基因组的比较:蛋白质序列的计算机分析预测新功能并暗示古菌的嵌合起源。
Mol Microbiol. 1997 Aug;25(4):619-37. doi: 10.1046/j.1365-2958.1997.4821861.x.
5
Genome alignment, evolution of prokaryotic genome organization, and prediction of gene function using genomic context.基因组比对、原核生物基因组组织的进化以及利用基因组背景预测基因功能。
Genome Res. 2001 Mar;11(3):356-72. doi: 10.1101/gr.gr-1619r.
6
AL2CO: calculation of positional conservation in a protein sequence alignment.AL2CO:蛋白质序列比对中位置保守性的计算
Bioinformatics. 2001 Aug;17(8):700-12. doi: 10.1093/bioinformatics/17.8.700.
7
UPF201 archaeal specific family members reveal structural similarity to RNA-binding proteins but low likelihood for RNA-binding function.UPF201古菌特异性家族成员显示出与RNA结合蛋白的结构相似性,但具有RNA结合功能的可能性较低。
PLoS One. 2008;3(12):e3903. doi: 10.1371/journal.pone.0003903. Epub 2008 Dec 11.
8
Universal and domain-specific sequences in 23S-28S ribosomal RNA identified by computational phylogenetics.通过计算系统发育学鉴定的23S - 28S核糖体RNA中的通用序列和特定结构域序列。
RNA. 2015 Oct;21(10):1719-30. doi: 10.1261/rna.051144.115. Epub 2015 Aug 17.
9
The performance of several multiple-sequence alignment programs in relation to secondary-structure features for an rRNA sequence.几个多序列比对程序针对一个rRNA序列的二级结构特征的性能。
Mol Biol Evol. 2000 Apr;17(4):530-9. doi: 10.1093/oxfordjournals.molbev.a026333.
10
Fast discovery and visualization of conserved regions in DNA sequences using quasi-alignment.使用准比对快速发现和可视化 DNA 序列中的保守区域。
BMC Bioinformatics. 2013;14 Suppl 11(Suppl 11):S2. doi: 10.1186/1471-2105-14-S11-S2. Epub 2013 Sep 13.

引用本文的文献

1
Endogenous geminivirus-like elements in the genus Rhododendron provide insights into the evolutionary origins of the begomovirus nuclear shuttle protein.杜鹃花属中的内源性类双生病毒元件为双生病毒核穿梭蛋白的进化起源提供了见解。
Plant Mol Biol. 2025 Jul 3;115(4):81. doi: 10.1007/s11103-025-01609-2.
2
Evolutionary Dynamics of RuBisCO: Emergence of the Small Subunit and its Impact Through Time.核酮糖-1,5-二磷酸羧化酶/加氧酶的进化动力学:小亚基的出现及其随时间的影响
Mol Biol Evol. 2025 Jan 6;42(1). doi: 10.1093/molbev/msae268.
3
Self-assembling viral histones are evolutionary intermediates between archaeal and eukaryotic nucleosomes.

本文引用的文献

1
ProteoVision: web server for advanced visualization of ribosomal proteins.ProteoVision:用于核糖体蛋白高级可视化的网络服务器。
Nucleic Acids Res. 2021 Jul 2;49(W1):W578-W588. doi: 10.1093/nar/gkab351.
2
Evolution of ribosomal protein network architectures.核糖体蛋白网络结构的进化。
Sci Rep. 2021 Jan 12;11(1):625. doi: 10.1038/s41598-020-80194-4.
3
Supersized Ribosomal RNA Expansion Segments in Asgard Archaea.Asgard 古菌中的超大核糖体 RNA 扩展片段。
自组装病毒组蛋白是古菌和真核核小体之间的进化中间体。
Nat Microbiol. 2024 Jul;9(7):1713-1724. doi: 10.1038/s41564-024-01707-9. Epub 2024 May 28.
4
Emergence of an Orphan Nitrogenase Protein Following Atmospheric Oxygenation.大气氧合作用后,一种氮酶蛋白的出现。
Mol Biol Evol. 2024 Apr 2;41(4). doi: 10.1093/molbev/msae067.
5
An internally controlled system to study microtubule network diversification links tubulin evolution to the use of distinct microtubule regulators.一个用于研究微管网络多样化的内部控制系统将微管蛋白进化与不同微管调节因子的使用联系起来。
bioRxiv. 2024 Jan 9:2024.01.08.573270. doi: 10.1101/2024.01.08.573270.
6
Common evolutionary origins of the bacterial glycyl tRNA synthetase and alanyl tRNA synthetase.细菌甘氨酰tRNA合成酶和丙氨酰tRNA合成酶的共同进化起源。
Protein Sci. 2023 Nov 27;33(3):e4844. doi: 10.1002/pro.4844.
7
Phage-encoded ribosomal protein S21 expression is linked to late-stage phage replication.噬菌体编码的核糖体蛋白S21的表达与噬菌体后期复制相关。
ISME Commun. 2022 Mar 30;2(1):31. doi: 10.1038/s43705-022-00111-w.
8
Shuffled ATG8 interacting motifs form an ancestral bridge between UFMylation and autophagy.乱序的 ATG8 相互作用基序在 UFMylation 和自噬之间形成了一个古老的桥梁。
EMBO J. 2023 May 15;42(10):e112053. doi: 10.15252/embj.2022112053. Epub 2023 Feb 10.
Genome Biol Evol. 2020 Oct 1;12(10):1694-1710. doi: 10.1093/gbe/evaa170.
4
The replication machinery of LUCA: common origin of DNA replication and transcription.原核生物与真核生物共同祖先的复制机制:DNA 复制和转录的共同起源。
BMC Biol. 2020 Jun 9;18(1):61. doi: 10.1186/s12915-020-00800-9.
5
Root of the Tree: The Significance, Evolution, and Origins of the Ribosome.树的根基:核糖体的意义、进化和起源。
Chem Rev. 2020 Jun 10;120(11):4848-4878. doi: 10.1021/acs.chemrev.9b00742. Epub 2020 May 6.
6
A New Analysis of Archaea-Bacteria Domain Separation: Variable Phylogenetic Distance and the Tempo of Early Evolution.古菌-细菌领域分离的新分析:进化早期的可变系统发育距离和时标。
Mol Biol Evol. 2020 Aug 1;37(8):2332-2340. doi: 10.1093/molbev/msaa089.
7
Zebra2: advanced and easy-to-use web-server for bioinformatic analysis of subfamily-specific and conserved positions in diverse protein superfamilies.Zebra2:一个高级且易于使用的网络服务器,用于分析不同蛋白质超家族中亚家族特异性和保守位置的生物信息学。
Nucleic Acids Res. 2020 Jul 2;48(W1):W65-W71. doi: 10.1093/nar/gkaa276.
8
Consistent Metagenome-Derived Metrics Verify and Delineate Bacterial Species Boundaries.源自宏基因组的一致指标验证并划定细菌物种边界。
mSystems. 2020 Jan 14;5(1):e00731-19. doi: 10.1128/mSystems.00731-19.
9
The SCOP database in 2020: expanded classification of representative family and superfamily domains of known protein structures.2020 年的 SCOP 数据库:已知蛋白质结构的代表性家族和超家族域的扩展分类。
Nucleic Acids Res. 2020 Jan 8;48(D1):D376-D382. doi: 10.1093/nar/gkz1064.
10
HH-suite3 for fast remote homology detection and deep protein annotation.HH-suite3 用于快速远程同源检测和深度蛋白质注释。
BMC Bioinformatics. 2019 Sep 14;20(1):473. doi: 10.1186/s12859-019-3019-7.