Suppr超能文献

ProTG4:一个用于从可翻译的G-四链体(4)映射肽的计算机模拟文库中近似通用蛋白质序列的网络服务器。

ProTG4: A Web Server to Approximate the Sequence of a Generic Protein From an in Silico Library of Translatable G-Quadruplex (4)-Mapped Peptides.

作者信息

Kundu Siddhartha

机构信息

Department of Biochemistry, All India Institute of Medical Sciences, New Delhi, India.

出版信息

Bioinform Biol Insights. 2021 Sep 28;15:11779322211045878. doi: 10.1177/11779322211045878. eCollection 2021.

Abstract

An RNA G-quadruplex in the protein coding segment of mRNA is translatable and may potentially impact protein translation. This can be consequent to staggered ribosomal synthesis and/or result in an increased frequency of missense translational events. A mathematical model of the peptides that encompass the substituted amino acids, ie, the -mapped peptidome, has been previously studied. However, the significance and relevance to disease biology of this model remains to be established. ProTG4 computes a confidence-of-sequence-identity -score, which is the average weighted length of every matched -mapped peptide in a generic protein sequence. The weighted length is the product of the length of the peptide and the probability of its non-random occurrence in a library of randomly generated sequences of equivalent lengths. This is then averaged over the entire length of the protein sequence. ProTG4 is simple to operate, has clear instructions, and is accompanied by a set of ready-to-use examples. The rationale of the study, algorithms deployed, and the computational pipeline deployed are also part of the web page. Analyses by ProTG4 of taxonomically diverse protein sequences suggest that there is significant homology to -mapped peptides. These findings, especially in potentially infectious and infesting agents, offer plausible explanations into the aetiology and pathogenesis of certain proteopathies. ProTG4 can also provide a quantitative measure to identify and annotate the canonical form of a generic protein sequence from its known isoforms. The article presents several case studies and discusses the relevance of ProTG4-assisted peptide analysis in gaining insights into various mechanisms of disease biology (mistranslation, alternate splicing, amino acid substitutions).

摘要

信使核糖核酸(mRNA)蛋白质编码区中的RNA G-四链体是可翻译的,并且可能潜在地影响蛋白质翻译。这可能是交错核糖体合成的结果,和/或导致错义翻译事件的频率增加。先前已经研究了包含取代氨基酸的肽段的数学模型,即映射肽组。然而,该模型对疾病生物学的意义和相关性仍有待确定。ProTG4计算序列同一性置信度得分,它是通用蛋白质序列中每个匹配的映射肽段的平均加权长度。加权长度是肽段长度与其在等效长度的随机生成序列库中非随机出现概率的乘积。然后在蛋白质序列的整个长度上求平均值。ProTG4操作简单,说明清晰,并附有一组现成的示例。研究的基本原理、所采用的算法以及计算流程也是网页的一部分。ProTG4对分类学上不同的蛋白质序列进行分析表明,与映射肽段存在显著同源性。这些发现,尤其是在潜在的感染和侵袭因子方面,为某些蛋白质病的病因和发病机制提供了合理的解释。ProTG4还可以提供一种定量方法,从已知的异构体中识别和注释通用蛋白质序列的标准形式。本文介绍了几个案例研究,并讨论了ProTG4辅助肽段分析在深入了解疾病生物学的各种机制(错译、可变剪接、氨基酸取代)方面的相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/917a/8482721/0c3fe7f2f3f3/10.1177_11779322211045878-fig1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验