• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用具有种系发生树意识的方法比较真核生物系统发生谱分析方法。

Comparison of eukaryotic phylogenetic profiling approaches using species tree aware methods.

机构信息

Laboratoire de Biologie et Génomique Intégrative, Département de Biologie et Génomique Structurales, Institut de Génétique et de Biologie Moléculaire et Cellulaire, CNRS/INSERM/UDS, Illkirch, France.

出版信息

BMC Bioinformatics. 2009 Nov 24;10:383. doi: 10.1186/1471-2105-10-383.

DOI:10.1186/1471-2105-10-383
PMID:19930674
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2787529/
Abstract

BACKGROUND

Phylogenetic profiling encompasses an important set of methodologies for in silico high throughput inference of functional relationships between genes. The simplest profiles represent the distribution of gene presence-absence in a set of species as a sequence of 0's and 1's, and it is assumed that functionally related genes will have more similar profiles. The methodology has been successfully used in numerous studies of prokaryotic genomes, although its application in eukaryotes appears problematic, with reported low accuracy due to the complex genomic organization within this domain of life. Recently some groups have proposed an alternative approach based on the correlation of homologous gene group sizes, taking into account all potentially informative genetic events leading to a change in group size, regardless of whether they result in a de novo group gain or total gene group loss.

RESULTS

We have compared the performance of classical presence-absence and group size based approaches using a large, diverse set of eukaryotic species. In contrast to most previous comparisons in Eukarya, we take into account the species phylogeny. We also compare the approaches using two different group categories, based on orthology and on domain-sharing. Our results confirm a limited overall performance of phylogenetic profiling in eukaryotes. Although group size based approaches initially showed an increase in performance for the domain-sharing based groups, this seems to be an overestimation due to a simplistic negative control dataset and the choice of null hypothesis rejection criteria.

CONCLUSION

Presence-absence profiling represents a more accurate classifier of related versus non-related profile pairs, when the profiles under consideration have enough information content. Group size based approaches provide a complementary means of detecting domain or family level co-evolution between groups that may be elusive to presence-absence profiling. Moreover positive correlation between co-evolution scores and functional links imply that these methods could be used to estimate functional distances between gene groups and to cluster them based on their functional relatedness. This study should have important implications for the future development and application of phylogenetic profiling methods, not only in eukaryotic, but also in prokaryotic datasets.

摘要

背景

系统发育分析涵盖了一组重要的方法,用于通过计算机对基因之间的功能关系进行高通量推断。最简单的分析方法是将一组物种中的基因存在或缺失情况表示为 0 和 1 的序列,并且假设功能相关的基因将具有更相似的分析方法。该方法已成功应用于许多原核基因组的研究中,尽管由于生命领域内的复杂基因组组织,其在真核生物中的应用似乎存在问题,报告的准确性较低。最近,一些小组提出了一种基于同源基因组大小相关性的替代方法,考虑了导致基因组大小变化的所有潜在信息遗传事件,无论它们是否导致新的基因组获得或基因组完全丢失。

结果

我们使用大量不同的真核生物物种比较了经典的存在缺失和基因组大小分析方法的性能。与真核生物中大多数先前的比较不同,我们考虑了物种的系统发育关系。我们还使用基于同源性和基于结构域共享的两种不同的基因组类别来比较这些方法。我们的结果证实了系统发育分析在真核生物中的整体性能有限。尽管基于基因组大小的方法最初在基于结构域共享的基因组中显示出性能的提高,但由于简单的负控制数据集和选择的无效假设拒绝标准,这似乎是一种高估。

结论

当考虑的分析方法具有足够的信息量时,基于存在缺失的分析方法是一种更准确的相关与非相关分析方法。基于基因组大小的方法提供了一种互补的方法,用于检测基因组之间的域或家族水平的共同进化,这可能是基于存在缺失的分析方法难以检测到的。此外,共同进化评分与功能链接之间的正相关关系意味着,这些方法可用于估计基因组之间的功能距离,并根据其功能相关性对它们进行聚类。这项研究对于未来系统发育分析方法的发展和应用具有重要意义,不仅在真核生物中,而且在原核生物数据集中也是如此。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/2fa8903c3ffb/1471-2105-10-383-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/c8782257f8ce/1471-2105-10-383-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/2e6f1ce6e94e/1471-2105-10-383-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/9196a400172d/1471-2105-10-383-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/496e65a144f3/1471-2105-10-383-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/b71aa4b48b0f/1471-2105-10-383-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/393e7684dfbe/1471-2105-10-383-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/4f1dfd663439/1471-2105-10-383-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/2fa8903c3ffb/1471-2105-10-383-8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/c8782257f8ce/1471-2105-10-383-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/2e6f1ce6e94e/1471-2105-10-383-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/9196a400172d/1471-2105-10-383-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/496e65a144f3/1471-2105-10-383-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/b71aa4b48b0f/1471-2105-10-383-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/393e7684dfbe/1471-2105-10-383-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/4f1dfd663439/1471-2105-10-383-7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/022f/2787529/2fa8903c3ffb/1471-2105-10-383-8.jpg

相似文献

1
Comparison of eukaryotic phylogenetic profiling approaches using species tree aware methods.使用具有种系发生树意识的方法比较真核生物系统发生谱分析方法。
BMC Bioinformatics. 2009 Nov 24;10:383. doi: 10.1186/1471-2105-10-383.
2
Comparative assessment of performance and genome dependence among phylogenetic profiling methods.系统发育谱分析方法之间性能和基因组依赖性的比较评估。
BMC Bioinformatics. 2006 Sep 27;7:420. doi: 10.1186/1471-2105-7-420.
3
Phylogenetic profiling in eukaryotes: The effect of species, orthologous group, and interactome selection on protein interaction prediction.真核生物的系统发生分析:物种、直系同源群和相互作用组选择对蛋白质相互作用预测的影响。
PLoS One. 2022 Apr 14;17(4):e0251833. doi: 10.1371/journal.pone.0251833. eCollection 2022.
4
Scalable phylogenetic profiling using MinHash uncovers likely eukaryotic sexual reproduction genes.使用 MinHash 进行可扩展的系统发育分析揭示了可能的真核生物有性生殖基因。
PLoS Comput Biol. 2020 Jul 22;16(7):e1007553. doi: 10.1371/journal.pcbi.1007553. eCollection 2020 Jul.
5
Predicting protein function with hierarchical phylogenetic profiles: the Gene3D Phylo-Tuner method applied to eukaryotic genomes.利用分层系统发育谱预测蛋白质功能:应用于真核生物基因组的Gene3D系统发育调整器方法
PLoS Comput Biol. 2007 Nov;3(11):e237. doi: 10.1371/journal.pcbi.0030237. Epub 2007 Oct 18.
6
Towards validating the hypothesis of phylogenetic profiling.迈向验证系统发育谱分析假说。
BMC Bioinformatics. 2007 Nov 1;8 Suppl 7(Suppl 7):S25. doi: 10.1186/1471-2105-8-S7-S25.
7
Benchmarking orthology methods using phylogenetic patterns defined at the base of Eukaryotes.使用在真核生物基部定义的系统发育模式对同源物方法进行基准测试。
Brief Bioinform. 2021 May 20;22(3). doi: 10.1093/bib/bbaa206.
8
Maximum likelihood phylogenetic reconstruction from high-resolution whole-genome data and a tree of 68 eukaryotes.基于高分辨率全基因组数据的最大似然系统发育重建及68种真核生物的系统发育树
Pac Symp Biocomput. 2013:285-96.
9
Co-Inheritance Analysis within the Domains of Life Substantially Improves Network Inference by Phylogenetic Profiling.生命域内的共遗传分析通过系统发育谱分析显著改善网络推断。
PLoS One. 2015 Sep 22;10(9):e0139006. doi: 10.1371/journal.pone.0139006. eCollection 2015.
10
The gene tree delusion.基因树错觉
Mol Phylogenet Evol. 2016 Jan;94(Pt A):1-33. doi: 10.1016/j.ympev.2015.07.018. Epub 2015 Jul 31.

引用本文的文献

1
Scalable phylogenetic profiling using MinHash uncovers likely eukaryotic sexual reproduction genes.使用 MinHash 进行可扩展的系统发育分析揭示了可能的真核生物有性生殖基因。
PLoS Comput Biol. 2020 Jul 22;16(7):e1007553. doi: 10.1371/journal.pcbi.1007553. eCollection 2020 Jul.
2
Novel phylogenetic methods are needed for understanding gene function in the era of mega-scale genome sequencing.新型进化方法是理解大规模基因组测序时代基因功能所必需的。
Nucleic Acids Res. 2020 Mar 18;48(5):2209-2219. doi: 10.1093/nar/gkz1241.
3
PhyloPro2.0: a database for the dynamic exploration of phylogenetically conserved proteins and their domain architectures across the Eukarya.

本文引用的文献

1
Co-evolutionary networks of genes and cellular processes across fungal species.跨真菌物种的基因与细胞过程的共同进化网络。
Genome Biol. 2009;10(5):R48. doi: 10.1186/gb-2009-10-5-r48. Epub 2009 May 5.
2
Testing the accuracy of eukaryotic phylogenetic profiles for prediction of biological function.测试真核生物系统发育分布特征预测生物功能的准确性。
Evol Bioinform Online. 2008 Jun 18;4:217-23. doi: 10.4137/ebo.s863.
3
Phylogenetic and functional assessment of orthologs inference projects and methods.直系同源物推断项目和方法的系统发育与功能评估。
PhyloPro2.0:一个用于动态探索真核生物中系统发育保守蛋白及其结构域架构的数据库。
Database (Oxford). 2016 Mar 15;2016. doi: 10.1093/database/baw013. Print 2016.
4
PhyloGene server for identification and visualization of co-evolving proteins using normalized phylogenetic profiles.PhyloGene服务器:利用标准化系统发育谱鉴定和可视化共同进化的蛋白质。
Nucleic Acids Res. 2015 Jul 1;43(W1):W154-9. doi: 10.1093/nar/gkv452. Epub 2015 May 9.
5
Phylogenetic profiling: how much input data is enough?系统发育谱分析:多少输入数据才足够?
PLoS One. 2015 Feb 13;10(2):e0114701. doi: 10.1371/journal.pone.0114701. eCollection 2015.
6
Human disease locus discovery and mapping to molecular pathways through phylogenetic profiling.通过系统发生谱分析发现人类疾病基因座并映射到分子途径。
Mol Syst Biol. 2013 Oct 1;9:692. doi: 10.1038/msb.2013.50.
7
Combining phylogenetic profiling-based and machine learning-based techniques to predict functional related proteins.结合基于系统发育轮廓和基于机器学习的技术来预测功能相关蛋白。
PLoS One. 2013 Sep 19;8(9):e75940. doi: 10.1371/journal.pone.0075940. eCollection 2013.
8
Genetic co-occurrence network across sequenced microbes.测序微生物间的遗传共发生网络。
PLoS Comput Biol. 2011 Dec;7(12):e1002340. doi: 10.1371/journal.pcbi.1002340. Epub 2011 Dec 29.
9
Phylogenetically informed logic relationships improve detection of biological network organization.系统发育信息逻辑关系提高了生物网络组织的检测能力。
BMC Bioinformatics. 2011 Dec 15;12:476. doi: 10.1186/1471-2105-12-476.
10
Inference of gain and loss events from phyletic patterns using stochastic mapping and maximum parsimony--a simulation study.基于随机映射和最大简约法的系统发育模式推断增益和损失事件——模拟研究。
Genome Biol Evol. 2011;3:1265-75. doi: 10.1093/gbe/evr101. Epub 2011 Oct 4.
PLoS Comput Biol. 2009 Jan;5(1):e1000262. doi: 10.1371/journal.pcbi.1000262. Epub 2009 Jan 16.
4
InterPro: the integrative protein signature database.InterPro:综合蛋白质特征数据库。
Nucleic Acids Res. 2009 Jan;37(Database issue):D211-5. doi: 10.1093/nar/gkn785. Epub 2008 Oct 21.
5
Whole-genome analysis reveals molecular innovations and evolutionary transitions in chromalveolate species.全基因组分析揭示了色藻界物种的分子创新和进化转变。
Proc Natl Acad Sci U S A. 2008 Mar 4;105(9):3427-32. doi: 10.1073/pnas.0712248105. Epub 2008 Feb 25.
6
Coevolution of gene families in prokaryotes.原核生物中基因家族的共同进化。
Genome Res. 2008 Mar;18(3):462-8. doi: 10.1101/gr.6815508. Epub 2008 Jan 29.
7
Predicting protein function with hierarchical phylogenetic profiles: the Gene3D Phylo-Tuner method applied to eukaryotic genomes.利用分层系统发育谱预测蛋白质功能:应用于真核生物基因组的Gene3D系统发育调整器方法
PLoS Comput Biol. 2007 Nov;3(11):e237. doi: 10.1371/journal.pcbi.0030237. Epub 2007 Oct 18.
8
Gene annotation and pathway mapping in KEGG.基因注释与KEGG中的通路映射。
Methods Mol Biol. 2007;396:71-91. doi: 10.1007/978-1-59745-515-2_6.
9
Inference of gene function based on gene fusion events: the rosetta-stone method.基于基因融合事件推断基因功能:罗塞塔石碑法。
Methods Mol Biol. 2007;396:31-41. doi: 10.1007/978-1-59745-515-2_3.
10
The Gene Ontology project in 2008.2008年的基因本体论项目。
Nucleic Acids Res. 2008 Jan;36(Database issue):D440-4. doi: 10.1093/nar/gkm883. Epub 2007 Nov 4.