• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种通过傅里叶变换识别蛋白质-蛋白质相互作用的协同进化分析。

A coevolution analysis for identifying protein-protein interactions by Fourier transform.

作者信息

Yin Changchuan, Yau Stephen S-T

机构信息

Department of Mathematics, Statistics and Computer Science, The University of Illinois at Chicago, Chicago, IL 60607-7045, United States of America.

Department of Mathematical Sciences, Tsinghua University, Beijing 100084, China.

出版信息

PLoS One. 2017 Apr 21;12(4):e0174862. doi: 10.1371/journal.pone.0174862. eCollection 2017.

DOI:10.1371/journal.pone.0174862
PMID:28430779
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5400233/
Abstract

Protein-protein interactions (PPIs) play key roles in life processes, such as signal transduction, transcription regulations, and immune response, etc. Identification of PPIs enables better understanding of the functional networks within a cell. Common experimental methods for identifying PPIs are time consuming and expensive. However, recent developments in computational approaches for inferring PPIs from protein sequences based on coevolution theory avoid these problems. In the coevolution theory model, interacted proteins may show coevolutionary mutations and have similar phylogenetic trees. The existing coevolution methods depend on multiple sequence alignments (MSA); however, the MSA-based coevolution methods often produce high false positive interactions. In this paper, we present a computational method using an alignment-free approach to accurately detect PPIs and reduce false positives. In the method, protein sequences are numerically represented by biochemical properties of amino acids, which reflect the structural and functional differences of proteins. Fourier transform is applied to the numerical representation of protein sequences to capture the dissimilarities of protein sequences in biophysical context. The method is assessed for predicting PPIs in Ebola virus. The results indicate strong coevolution between the protein pairs (NP-VP24, NP-VP30, NP-VP40, VP24-VP30, VP24-VP40, and VP30-VP40). The method is also validated for PPIs in influenza and E.coli genomes. Since our method can reduce false positive and increase the specificity of PPI prediction, it offers an effective tool to understand mechanisms of disease pathogens and find potential targets for drug design. The Python programs in this study are available to public at URL (https://github.com/cyinbox/PPI).

摘要

蛋白质-蛋白质相互作用(PPIs)在生命过程中发挥着关键作用,如信号转导、转录调控和免疫反应等。蛋白质-蛋白质相互作用的鉴定有助于更好地理解细胞内的功能网络。鉴定蛋白质-蛋白质相互作用的常见实验方法既耗时又昂贵。然而,最近基于协同进化理论从蛋白质序列推断蛋白质-蛋白质相互作用的计算方法的发展避免了这些问题。在协同进化理论模型中,相互作用的蛋白质可能会显示协同进化突变并具有相似的系统发育树。现有的协同进化方法依赖于多序列比对(MSA);然而,基于多序列比对的协同进化方法往往会产生较高的假阳性相互作用。在本文中,我们提出了一种使用无比对方法来准确检测蛋白质-蛋白质相互作用并减少假阳性的计算方法。在该方法中,蛋白质序列由氨基酸的生化特性进行数值表示,这反映了蛋白质的结构和功能差异。将傅里叶变换应用于蛋白质序列的数值表示,以捕捉生物物理背景下蛋白质序列的差异。该方法在埃博拉病毒中用于预测蛋白质-蛋白质相互作用进行了评估。结果表明蛋白质对(NP-VP24、NP-VP30、NP-VP40、VP24-VP30、VP24-VP40和VP30-VP40)之间存在强烈的协同进化。该方法也在流感和大肠杆菌基因组中的蛋白质-蛋白质相互作用方面得到了验证。由于我们的方法可以减少假阳性并提高蛋白质-蛋白质相互作用预测的特异性,它为理解疾病病原体的机制和寻找药物设计的潜在靶点提供了一个有效的工具。本研究中的Python程序可在URL(https://github.com/cyinbox/PPI)上向公众提供。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2538/5400233/f7d3c2ff59c2/pone.0174862.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2538/5400233/e51a8e9eb771/pone.0174862.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2538/5400233/154a30148377/pone.0174862.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2538/5400233/5cd3a4941387/pone.0174862.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2538/5400233/d8f47f11d6f0/pone.0174862.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2538/5400233/f7d3c2ff59c2/pone.0174862.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2538/5400233/e51a8e9eb771/pone.0174862.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2538/5400233/154a30148377/pone.0174862.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2538/5400233/5cd3a4941387/pone.0174862.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2538/5400233/d8f47f11d6f0/pone.0174862.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2538/5400233/f7d3c2ff59c2/pone.0174862.g005.jpg

相似文献

1
A coevolution analysis for identifying protein-protein interactions by Fourier transform.一种通过傅里叶变换识别蛋白质-蛋白质相互作用的协同进化分析。
PLoS One. 2017 Apr 21;12(4):e0174862. doi: 10.1371/journal.pone.0174862. eCollection 2017.
2
Herbal Lead as Ideal Bioactive Compounds Against Probable Drug Targets of Ebola Virus in Comparison with Known Chemical Analogue: A Computational Drug Discovery Perspective.与已知化学类似物相比,草药先导化合物作为针对埃博拉病毒潜在药物靶点的理想生物活性化合物:计算药物发现视角
Interdiscip Sci. 2017 Jun;9(2):254-277. doi: 10.1007/s12539-016-0149-8. Epub 2016 Feb 8.
3
Predicting Protein-Protein Interactions via Random Ferns with Evolutionary Matrix Representation.基于进化矩阵表示的随机蕨类预测蛋白质-蛋白质相互作用。
Comput Math Methods Med. 2022 Feb 22;2022:7191684. doi: 10.1155/2022/7191684. eCollection 2022.
4
Sequence analysis of the GP, NP, VP40 and VP24 genes of Ebola virus isolated from deceased, surviving and asymptomatically infected individuals during the 1996 outbreak in Gabon: comparative studies and phylogenetic characterization.1996年加蓬埃博拉病毒疫情期间,从死亡、存活及无症状感染者体内分离出的埃博拉病毒的糖蛋白(GP)、核蛋白(NP)、病毒蛋白40(VP40)和病毒蛋白24(VP24)基因的序列分析:比较研究和系统发育特征分析
J Gen Virol. 2002 Jan;83(Pt 1):67-73. doi: 10.1099/0022-1317-83-1-67.
5
Contribution of ebola virus glycoprotein, nucleoprotein, and VP24 to budding of VP40 virus-like particles.埃博拉病毒糖蛋白、核蛋白和VP24对VP40病毒样颗粒出芽的作用。
J Virol. 2004 Jul;78(14):7344-51. doi: 10.1128/JVI.78.14.7344-7351.2004.
6
Complete nucleotide sequences of Marburg virus genes 5 and 6 encoding VP30 and VP24 proteins.编码VP30和VP24蛋白的马尔堡病毒基因5和基因6的完整核苷酸序列。
Biochem Mol Biol Int. 1995 Mar;35(3):605-13.
7
Conserved differences in protein sequence determine the human pathogenicity of Ebolaviruses.蛋白质序列中的保守差异决定了埃博拉病毒对人类的致病性。
Sci Rep. 2016 Mar 24;6:23743. doi: 10.1038/srep23743.
8
Ebolavirus classification based on natural vectors.基于天然载体的埃博拉病毒分类。
DNA Cell Biol. 2015 Jun;34(6):418-28. doi: 10.1089/dna.2014.2678. Epub 2015 Mar 24.
9
A new method to cluster genomes based on cumulative Fourier power spectrum.一种基于累积傅里叶功率谱的基因组聚类新方法。
Gene. 2018 Oct 5;673:239-250. doi: 10.1016/j.gene.2018.06.042. Epub 2018 Jun 20.
10
Recent advances in predicting and modeling protein-protein interactions.预测和建模蛋白质-蛋白质相互作用的最新进展。
Trends Biochem Sci. 2023 Jun;48(6):527-538. doi: 10.1016/j.tibs.2023.03.003. Epub 2023 Apr 14.

引用本文的文献

1
CBIL-VHPLI: a model for predicting viral-host protein-lncRNA interactions based on machine learning and transfer learning.CBIL-VHPLI:一种基于机器学习和迁移学习的预测病毒-宿主蛋白-lncRNA 相互作用的模型。
Sci Rep. 2024 Jul 30;14(1):17549. doi: 10.1038/s41598-024-68750-8.
2
Engineering and Expression Strategies for Optimization of L-Asparaginase Development and Production.工程化与表达策略优化 L-天冬酰胺酶的开发与生产。
Int J Mol Sci. 2023 Oct 16;24(20):15220. doi: 10.3390/ijms242015220.
3
FFP: joint Fast Fourier transform and fractal dimension in amino acid property-aware phylogenetic analysis.

本文引用的文献

1
Extracting Coevolutionary Features from Protein Sequences for Predicting Protein-Protein Interactions.从蛋白质序列中提取协同进化特征以预测蛋白质-蛋白质相互作用
IEEE/ACM Trans Comput Biol Bioinform. 2017 Jan-Feb;14(1):155-166. doi: 10.1109/TCBB.2016.2520923. Epub 2016 Jan 22.
2
An improved model for whole genome phylogenetic analysis by Fourier transform.一种通过傅里叶变换进行全基因组系统发育分析的改进模型。
J Theor Biol. 2015 Oct 7;382:99-110. doi: 10.1016/j.jtbi.2015.06.033. Epub 2015 Jul 4.
3
A new method to cluster DNA sequences using Fourier power spectrum.
FFP:氨基酸特性感知系统发育分析中的联合快速傅里叶变换和分形维数。
BMC Bioinformatics. 2022 Aug 19;23(1):347. doi: 10.1186/s12859-022-04889-3.
4
Protein-protein interaction and non-interaction predictions using gene sequence natural vector.利用基因序列自然向量进行蛋白质-蛋白质相互作用和非相互作用预测。
Commun Biol. 2022 Jul 2;5(1):652. doi: 10.1038/s42003-022-03617-0.
5
Mannosylated hemagglutinin peptides bind cyanovirin-N independent of disulfide-bonds in complementary binding sites.甘露糖基化血凝素肽结合氰病毒-N,不依赖于互补结合位点中的二硫键。
RSC Adv. 2020 Mar 17;10(19):11079-11087. doi: 10.1039/d0ra01128b. eCollection 2020 Mar 16.
6
Spatially Enriched Paralog Rearrangements Argue Functionally Diverse Ribosomes Arise during Cold Acclimation in Arabidopsis.空间富集的基因旁系同源重组事件表明在拟南芥的冷驯化过程中产生了功能多样化的核糖体。
Int J Mol Sci. 2021 Jun 7;22(11):6160. doi: 10.3390/ijms22116160.
7
Coevolutionary and Phylogenetic Analysis of Mimiviral Replication Machinery Suggest the Cellular Origin of Mimiviruses.拟病毒复制机制的共进化和系统发育分析提示拟病毒的细胞起源。
Mol Biol Evol. 2021 May 4;38(5):2014-2029. doi: 10.1093/molbev/msab003.
8
A Novel Protein Mapping Method for Predicting the Protein Interactions in COVID-19 Disease by Deep Learning.一种通过深度学习预测 COVID-19 疾病中蛋白质相互作用的新型蛋白质图谱绘制方法。
Interdiscip Sci. 2021 Mar;13(1):44-60. doi: 10.1007/s12539-020-00405-4. Epub 2021 Jan 12.
9
SWSPM: A Novel Alignment-Free DNA Comparison Method Based on Signal Processing Approaches.SWSPM:一种基于信号处理方法的新型无比对DNA比较方法。
Evol Bioinform Online. 2019 May 30;15:1176934319849071. doi: 10.1177/1176934319849071. eCollection 2019.
10
Encodings and models for antimicrobial peptide classification for multi-resistant pathogens.用于多重耐药病原体抗菌肽分类的编码与模型
BioData Min. 2019 Mar 4;12:7. doi: 10.1186/s13040-019-0196-x. eCollection 2019.
一种使用傅里叶功率谱对DNA序列进行聚类的新方法。
J Theor Biol. 2015 May 7;372:135-45. doi: 10.1016/j.jtbi.2015.02.026. Epub 2015 Mar 5.
4
The contribution of PA-X to the virulence of pandemic 2009 H1N1 and highly pathogenic H5N1 avian influenza viruses.PA-X对2009年大流行性H1N1流感病毒和高致病性H5N1禽流感病毒毒力的作用。
Sci Rep. 2015 Feb 5;5:8262. doi: 10.1038/srep08262.
5
Protein-protein Interaction Networks of E. coli and S. cerevisiae are similar.大肠杆菌和酿酒酵母的蛋白质-蛋白质相互作用网络相似。
Sci Rep. 2014 Nov 28;4:7187. doi: 10.1038/srep07187.
6
A measure of DNA sequence similarity by Fourier Transform with applications on hierarchical clustering.一种通过傅里叶变换衡量DNA序列相似性及其在层次聚类中的应用
J Theor Biol. 2014 Oct 21;359:18-28. doi: 10.1016/j.jtbi.2014.05.043. Epub 2014 Jun 6.
7
Quantitative genome-wide genetic interaction screens reveal global epistatic relationships of protein complexes in Escherichia coli.全基因组范围内的定量遗传相互作用筛选揭示了大肠杆菌中蛋白质复合物的全局上位关系。
PLoS Genet. 2014 Feb 20;10(2):e1004120. doi: 10.1371/journal.pgen.1004120. eCollection 2014 Feb.
8
The binary protein-protein interaction landscape of Escherichia coli.大肠杆菌的二元蛋白质-蛋白质相互作用图谱
Nat Biotechnol. 2014 Mar;32(3):285-290. doi: 10.1038/nbt.2831. Epub 2014 Feb 23.
9
Computational Prediction of Protein-Protein Interaction Networks: Algo-rithms and Resources.计算蛋白质-蛋白质相互作用网络的预测:算法和资源。
Curr Genomics. 2013 Sep;14(6):397-414. doi: 10.2174/1389202911314060004.
10
The interactomes of influenza virus NS1 and NS2 proteins identify new host factors and provide insights for ADAR1 playing a supportive role in virus replication.流感病毒 NS1 和 NS2 蛋白的相互作用组鉴定了新的宿主因子,并为 ADAR1 在病毒复制中发挥支持作用提供了新的见解。
PLoS Pathog. 2013;9(7):e1003440. doi: 10.1371/journal.ppat.1003440. Epub 2013 Jul 4.