• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

规范化比尔勒-霍姆斯-沃格特曼树空间中的核。

Normalizing Kernels in the Billera-Holmes-Vogtmann Treespace.

出版信息

IEEE/ACM Trans Comput Biol Bioinform. 2017 Nov-Dec;14(6):1359-1365. doi: 10.1109/TCBB.2016.2565475. Epub 2016 May 10.

DOI:10.1109/TCBB.2016.2565475
PMID:28113725
Abstract

As costs of genome sequencing have dropped precipitously, development of efficient bioinformatic methods to analyze genome structure and evolution have become ever more urgent. For example, most published phylogenomic studies involve either massive concatenation of sequences, or informal comparisons of phylogenies inferred on a small subset of orthologous genes, neither of which provides a comprehensive overview of evolution or systematic identification of genes with unusual and interesting evolution (e.g., horizontal gene transfers, gene duplication, and subsequent neofunctionalization). We are interested in identifying such "outlying" gene trees from the set of gene trees and estimating the distribution of trees over the "tree space". This paper describes an improvement to the kdetrees algorithm, an adaptation of classical kernel density estimation to the metric space of phylogenetic trees (Billera-Holmes-Vogtman treespace), whereby the kernel normalizing constants, are estimated through the use of the novel holonomic gradient methods. As in the original kdetrees paper, we have applied kdetrees to a set of Apicomplexa genes. The analysis identified several unreliable sequence alignments that had escaped previous detection, as well as a gene independently reported as a possible case of horizontal gene transfer. The updated version of the kdetrees software package is available both from CRAN (the official R package system), as well as from the official development repository on Github. ( github.com/grady/kdetrees).

摘要

随着基因组测序成本的急剧下降,开发有效的生物信息学方法来分析基因组结构和进化变得更加紧迫。例如,大多数已发表的系统发育基因组学研究要么涉及大量序列的串联,要么是对一小部分直系同源基因推断的系统发育进行非正式比较,这两者都不能提供进化的全面概述或系统地识别具有异常和有趣进化的基因(例如,水平基因转移、基因复制和随后的新功能化)。我们有兴趣从基因树集中识别出这些“异常”的基因树,并估计树在“树空间”中的分布。本文描述了对 kdetrees 算法的改进,即将经典核密度估计方法应用于系统发育树的度量空间(Billera-Holmes-Vogtman 树空间),其中核归一化常数通过使用新颖的全积分梯度方法进行估计。与原始 kdetrees 论文一样,我们将 kdetrees 应用于一组 Apicomplexa 基因。分析确定了几个以前未检测到的不可靠序列比对,以及一个被独立报道为可能发生水平基因转移的基因。kdetrees 软件包的更新版本可从 CRAN(官方 R 包系统)以及 Github 上的官方开发存储库获得。(github.com/grady/kdetrees)。

相似文献

1
Normalizing Kernels in the Billera-Holmes-Vogtmann Treespace.规范化比尔勒-霍姆斯-沃格特曼树空间中的核。
IEEE/ACM Trans Comput Biol Bioinform. 2017 Nov-Dec;14(6):1359-1365. doi: 10.1109/TCBB.2016.2565475. Epub 2016 May 10.
2
kdetrees: Non-parametric estimation of phylogenetic tree distributions.KD树:系统发育树分布的非参数估计
Bioinformatics. 2014 Aug 15;30(16):2280-7. doi: 10.1093/bioinformatics/btu258. Epub 2014 Apr 24.
3
Tropical Density Estimation of Phylogenetic Trees.系统发育树的热带密度估计
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):1855-1863. doi: 10.1109/TCBB.2024.3420815. Epub 2024 Dec 10.
4
GATC: a genetic algorithm for gene tree construction under the Duplication-Transfer-Loss model of evolution.GATC:一种在进化的复制-转移-丢失模型下构建基因树的遗传算法。
BMC Genomics. 2018 May 9;19(Suppl 2):102. doi: 10.1186/s12864-018-4455-x.
5
Estimating optimal species trees from incomplete gene trees under deep coalescence.在深度溯祖情况下从不完整基因树估计最优物种树。
J Comput Biol. 2012 Jun;19(6):591-605. doi: 10.1089/cmb.2012.0037.
6
Geodesics to characterize the phylogenetic landscape.测地线刻画系统发育景观。
PLoS One. 2023 Jun 23;18(6):e0287350. doi: 10.1371/journal.pone.0287350. eCollection 2023.
7
SATe-II: very fast and accurate simultaneous estimation of multiple sequence alignments and phylogenetic trees.SATe-II:一种非常快速且准确的同时估计多个序列比对和系统发育树的方法。
Syst Biol. 2012 Jan;61(1):90-106. doi: 10.1093/sysbio/syr095. Epub 2011 Dec 1.
8
Mean and Variance of Phylogenetic Trees.系统发育树的均值和方差。
Syst Biol. 2020 Jan 1;69(1):139-154. doi: 10.1093/sysbio/syz041.
9
SimPhy: Phylogenomic Simulation of Gene, Locus, and Species Trees.SimPhy:基因树、基因座树和物种树的系统发育基因组学模拟
Syst Biol. 2016 Mar;65(2):334-44. doi: 10.1093/sysbio/syv082. Epub 2015 Nov 1.
10
A fast algorithm for computing geodesic distances in tree space.一种用于计算树空间测地距离的快速算法。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Jan-Mar;8(1):2-13. doi: 10.1109/TCBB.2010.3.

引用本文的文献

1
GET_PHYLOMARKERS, a Software Package to Select Optimal Orthologous Clusters for Phylogenomics and Inferring Pan-Genome Phylogenies, Used for a Critical Geno-Taxonomic Revision of the Genus .GET_PHYLOMARKERS,一个用于为系统发育基因组学选择最佳直系同源簇并推断泛基因组系统发育的软件包,用于该属的关键基因分类修订。
Front Microbiol. 2018 May 1;9:771. doi: 10.3389/fmicb.2018.00771. eCollection 2018.
2
Principal component analysis and the locus of the Fréchet mean in the space of phylogenetic trees.主成分分析与系统发育树空间中弗雷歇均值的轨迹
Biometrika. 2017 Dec;104(4):901-922. doi: 10.1093/biomet/asx047. Epub 2017 Sep 27.