• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统发育dCor:作为系统发育谱分析新指标的距离相关性

Phylo_dCor: distance correlation as a novel metric for phylogenetic profiling.

作者信息

Sferra Gabriella, Fratini Federica, Ponzi Marta, Pizzi Elisabetta

机构信息

Dipartimento di Malattie Infettive, Parassitarie e Immunomediate, Istituto Superiore di Sanità, Viale Regina Elena 299, 00161, Rome, Italy.

出版信息

BMC Bioinformatics. 2017 Sep 5;18(1):396. doi: 10.1186/s12859-017-1815-5.

DOI:10.1186/s12859-017-1815-5
PMID:28870256
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5584357/
Abstract

BACKGROUND

Elaboration of powerful methods to predict functional and/or physical protein-protein interactions from genome sequence is one of the main tasks in the post-genomic era. Phylogenetic profiling allows the prediction of protein-protein interactions at a whole genome level in both Prokaryotes and Eukaryotes. For this reason it is considered one of the most promising methods.

RESULTS

Here, we propose an improvement of phylogenetic profiling that enables handling of large genomic datasets and infer global protein-protein interactions. This method uses the distance correlation as a new measure of phylogenetic profile similarity. We constructed robust reference sets and developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation that makes it applicable to large genomic data. Using Saccharomyces cerevisiae and Escherichia coli genome datasets, we showed that Phylo-dCor outperforms phylogenetic profiling methods previously described based on the mutual information and Pearson's correlation as measures of profile similarity.

CONCLUSIONS

In this work, we constructed and assessed robust reference sets and propose the distance correlation as a measure for comparing phylogenetic profiles. To make it applicable to large genomic data, we developed Phylo-dCor, a parallelized version of the algorithm for calculating the distance correlation. Two R scripts that can be run on a wide range of machines are available upon request.

摘要

背景

从基因组序列预测功能和/或物理蛋白质-蛋白质相互作用的强大方法的开发是后基因组时代的主要任务之一。系统发育谱分析允许在原核生物和真核生物的全基因组水平上预测蛋白质-蛋白质相互作用。因此,它被认为是最有前途的方法之一。

结果

在此,我们提出了一种系统发育谱分析的改进方法,该方法能够处理大型基因组数据集并推断全局蛋白质-蛋白质相互作用。该方法使用距离相关性作为系统发育谱相似性的新度量。我们构建了稳健的参考集,并开发了Phylo-dCor,这是一种用于计算距离相关性的算法的并行版本,使其适用于大型基因组数据。使用酿酒酵母和大肠杆菌基因组数据集,我们表明Phylo-dCor优于先前基于互信息和皮尔逊相关性作为谱相似性度量所描述的系统发育谱分析方法。

结论

在这项工作中,我们构建并评估了稳健的参考集,并提出距离相关性作为比较系统发育谱的一种度量。为了使其适用于大型基因组数据,我们开发了Phylo-dCor,这是一种用于计算距离相关性的算法的并行版本。如有需要,可提供两个可在多种机器上运行的R脚本。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ed5/5584357/86a7d02c71e7/12859_2017_1815_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ed5/5584357/7270b738d55e/12859_2017_1815_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ed5/5584357/86a7d02c71e7/12859_2017_1815_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ed5/5584357/7270b738d55e/12859_2017_1815_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9ed5/5584357/86a7d02c71e7/12859_2017_1815_Fig2_HTML.jpg

相似文献

1
Phylo_dCor: distance correlation as a novel metric for phylogenetic profiling.系统发育dCor:作为系统发育谱分析新指标的距离相关性
BMC Bioinformatics. 2017 Sep 5;18(1):396. doi: 10.1186/s12859-017-1815-5.
2
Comparative assessment of performance and genome dependence among phylogenetic profiling methods.系统发育谱分析方法之间性能和基因组依赖性的比较评估。
BMC Bioinformatics. 2006 Sep 27;7:420. doi: 10.1186/1471-2105-7-420.
3
Protein-protein Interaction Networks of E. coli and S. cerevisiae are similar.大肠杆菌和酿酒酵母的蛋白质-蛋白质相互作用网络相似。
Sci Rep. 2014 Nov 28;4:7187. doi: 10.1038/srep07187.
4
Interrogating noise in protein sequences from the perspective of protein-protein interactions prediction.从蛋白质-蛋白质相互作用预测的角度探讨蛋白质序列中的噪声。
J Theor Biol. 2012 Dec 21;315:64-70. doi: 10.1016/j.jtbi.2012.09.007. Epub 2012 Sep 18.
5
Discovering functional linkages and uncharacterized cellular pathways using phylogenetic profile comparisons: a comprehensive assessment.利用系统发育谱比较发现功能联系和未表征的细胞途径:一项综合评估。
BMC Bioinformatics. 2007 May 23;8:173. doi: 10.1186/1471-2105-8-173.
6
Advancing the prediction accuracy of protein-protein interactions by utilizing evolutionary information from position-specific scoring matrix and ensemble classifier.利用来自位置特异性得分矩阵的进化信息和集成分类器提高蛋白质-蛋白质相互作用的预测准确性。
J Theor Biol. 2017 Apr 7;418:105-110. doi: 10.1016/j.jtbi.2017.01.003. Epub 2017 Jan 11.
7
Evaluation of GO-based functional similarity measures using S. cerevisiae protein interaction and expression profile data.利用酿酒酵母蛋白质相互作用和表达谱数据评估基于GO的功能相似性度量
BMC Bioinformatics. 2008 Nov 6;9:472. doi: 10.1186/1471-2105-9-472.
8
Gene interaction networks based on kernel correlation metrics.基于核相关度量的基因相互作用网络。
Int J Comput Biol Drug Des. 2013;6(1-2):72-92. doi: 10.1504/IJCBDD.2013.052203. Epub 2013 Feb 21.
9
Overrepresentation of interactions between homologous proteins in interactomes.相互作用组中同源蛋白质间相互作用的过度呈现。
FEBS Lett. 2007 Jan 9;581(1):52-6. doi: 10.1016/j.febslet.2006.11.076. Epub 2006 Dec 8.
10
Prediction of metabolic fluxes from gene expression data with Huber penalty convex optimization function.使用Huber罚函数凸优化函数从基因表达数据预测代谢通量
Mol Biosyst. 2017 May 2;13(5):901-909. doi: 10.1039/c6mb00811a.

引用本文的文献

1
CladeOScope: functional interactions through the prism of clade-wise co-evolution.进化枝观测镜:通过进化枝特异性共同进化视角探究功能相互作用
NAR Genom Bioinform. 2021 Apr 20;3(2):lqab024. doi: 10.1093/nargab/lqab024. eCollection 2021 Jun.

本文引用的文献

1
Phylogenetic Profiling for Probing the Modular Architecture of the Human Genome.用于探究人类基因组模块化结构的系统发育谱分析
Cell Syst. 2015 Aug 26;1(2):106-15. doi: 10.1016/j.cels.2015.08.006.
2
Genome-wide protein-protein interactions and protein function exploration in cyanobacteria.蓝藻全基因组蛋白质-蛋白质相互作用及蛋白质功能探索
Sci Rep. 2015 Oct 22;5:15519. doi: 10.1038/srep15519.
3
Expansion of biological pathways based on evolutionary inference.基于进化推断的生物途径扩展。
Cell. 2014 Jul 3;158(1):213-25. doi: 10.1016/j.cell.2014.05.034.
4
STRING v9.1: protein-protein interaction networks, with increased coverage and integration.STRING v9.1:蛋白质-蛋白质相互作用网络,具有更高的覆盖度和集成度。
Nucleic Acids Res. 2013 Jan;41(Database issue):D808-15. doi: 10.1093/nar/gks1094. Epub 2012 Nov 29.
5
Using phylogenetic profiles to predict functional relationships.利用系统发育谱预测功能关系。
Methods Mol Biol. 2012;804:167-77. doi: 10.1007/978-1-61779-361-5_9.
6
eggNOG v3.0: orthologous groups covering 1133 organisms at 41 different taxonomic ranges.eggNOG v3.0:涵盖了 41 个不同分类范围的 1133 个生物体的直系同源物组。
Nucleic Acids Res. 2012 Jan;40(Database issue):D284-9. doi: 10.1093/nar/gkr1060. Epub 2011 Nov 16.
7
Practical and theoretical advances in predicting the function of a protein by its phylogenetic distribution.通过蛋白质的系统发育分布预测其功能的实践与理论进展。
J R Soc Interface. 2008 Feb 6;5(19):151-70. doi: 10.1098/rsif.2007.1047.
8
Phylogenetic profiles for the prediction of protein-protein interactions: how to select reference organisms?用于预测蛋白质-蛋白质相互作用的系统发育谱:如何选择参考生物体?
Biochem Biophys Res Commun. 2007 Feb 23;353(4):985-91. doi: 10.1016/j.bbrc.2006.12.146. Epub 2006 Dec 27.
9
Comparative assessment of performance and genome dependence among phylogenetic profiling methods.系统发育谱分析方法之间性能和基因组依赖性的比较评估。
BMC Bioinformatics. 2006 Sep 27;7:420. doi: 10.1186/1471-2105-7-420.
10
Functional annotation from predicted protein interaction networks.来自预测蛋白质相互作用网络的功能注释。
Bioinformatics. 2005 Aug 1;21(15):3217-26. doi: 10.1093/bioinformatics/bti514. Epub 2005 May 26.