• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于线性整数规划的系统发育聚类(PhyCLIP)。

Phylogenetic Clustering by Linear Integer Programming (PhyCLIP).

机构信息

Bioinformatics Institute, Agency for Science, Technology and Research (A*STAR), Singapore.

NUS Graduate School for Integrative Sciences and Engineering, National University of Singapore (NUS), Singapore.

出版信息

Mol Biol Evol. 2019 Jul 1;36(7):1580-1595. doi: 10.1093/molbev/msz053.

DOI:10.1093/molbev/msz053
PMID:30854550
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6573476/
Abstract

Subspecies nomenclature systems of pathogens are increasingly based on sequence data. The use of phylogenetics to identify and differentiate between clusters of genetically similar pathogens is particularly prevalent in virology from the nomenclature of human papillomaviruses to highly pathogenic avian influenza (HPAI) H5Nx viruses. These nomenclature systems rely on absolute genetic distance thresholds to define the maximum genetic divergence tolerated between viruses designated as closely related. However, the phylogenetic clustering methods used in these nomenclature systems are limited by the arbitrariness of setting intra and intercluster diversity thresholds. The lack of a consensus ground truth to define well-delineated, meaningful phylogenetic subpopulations amplifies the difficulties in identifying an informative distance threshold. Consequently, phylogenetic clustering often becomes an exploratory, ad hoc exercise. Phylogenetic Clustering by Linear Integer Programming (PhyCLIP) was developed to provide a statistically principled phylogenetic clustering framework that negates the need for an arbitrarily defined distance threshold. Using the pairwise patristic distance distributions of an input phylogeny, PhyCLIP parameterizes the intra and intercluster divergence limits as statistical bounds in an integer linear programming model which is subsequently optimized to cluster as many sequences as possible. When applied to the hemagglutinin phylogeny of HPAI H5Nx viruses, PhyCLIP was not only able to recapitulate the current WHO/OIE/FAO H5 nomenclature system but also further delineated informative higher resolution clusters that capture geographically distinct subpopulations of viruses. PhyCLIP is pathogen-agnostic and can be generalized to a wide variety of research questions concerning the identification of biologically informative clusters in pathogen phylogenies. PhyCLIP is freely available at http://github.com/alvinxhan/PhyCLIP, last accessed March 15, 2019.

摘要

病原体亚种命名系统越来越多地基于序列数据。从人类乳头瘤病毒的命名到高致病性禽流感(HPAI)H5Nx 病毒,系统发生学被广泛用于识别和区分遗传上相似的病原体聚类。这些命名系统依赖于绝对遗传距离阈值来定义被指定为密切相关的病毒之间可容忍的最大遗传差异。然而,这些命名系统中使用的系统发育聚类方法受到设定内群和外群多样性阈值的任意性限制。缺乏共识的真实情况来定义定义明确、有意义的系统发育亚群,这加剧了确定信息丰富的距离阈值的困难。因此,系统发生聚类通常成为一种探索性的、临时的练习。线性整数规划的系统发生聚类(PhyCLIP)的开发提供了一个具有统计原理的系统发生聚类框架,该框架否定了需要任意定义距离阈值的必要性。使用输入系统发育树的成对亲缘关系距离分布,PhyCLIP 将内群和外群的分歧限制参数化为整数线性规划模型中的统计边界,随后对该模型进行优化,以尽可能多地聚类序列。当应用于 HPAI H5Nx 病毒的血凝素系统发育时,PhyCLIP 不仅能够重现当前的世卫组织/国际兽疫局/粮农组织 H5 命名系统,还进一步划分了具有信息性的更高分辨率聚类,这些聚类捕获了病毒在地理上不同的亚群。PhyCLIP 与病原体无关,可以推广到病原体系统发育中识别具有生物学意义的聚类的各种研究问题。PhyCLIP 可在 http://github.com/alvinxhan/PhyCLIP 上免费获得,最后访问时间为 2019 年 3 月 15 日。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df21/6573476/507ebcac36e9/msz053f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df21/6573476/483686ed1481/msz053f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df21/6573476/78cb5304be7d/msz053f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df21/6573476/eb9edd5915b9/msz053f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df21/6573476/507ebcac36e9/msz053f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df21/6573476/483686ed1481/msz053f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df21/6573476/78cb5304be7d/msz053f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df21/6573476/eb9edd5915b9/msz053f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df21/6573476/507ebcac36e9/msz053f4.jpg

相似文献

1
Phylogenetic Clustering by Linear Integer Programming (PhyCLIP).基于线性整数规划的系统发育聚类(PhyCLIP)。
Mol Biol Evol. 2019 Jul 1;36(7):1580-1595. doi: 10.1093/molbev/msz053.
2
Inferring putative transmission clusters with Phydelity.使用Phydelity推断假定的传播集群。
Virus Evol. 2019 Oct 9;5(2):vez039. doi: 10.1093/ve/vez039. eCollection 2019 Jul.
3
Nomenclature updates resulting from the evolution of avian influenza A(H5) virus clades 2.1.3.2a, 2.2.1, and 2.3.4 during 2013-2014.2013 - 2014年期间甲型禽流感病毒2.1.3.2a、2.2.1和2.3.4进化分支导致的命名更新。
Influenza Other Respir Viruses. 2015 Sep;9(5):271-6. doi: 10.1111/irv.12324.
4
Phylodynamics of avian influenza clade 2.2.1 H5N1 viruses in Egypt.埃及2.2.1进化枝H5N1禽流感病毒的系统动力学
Virol J. 2016 Mar 22;13:49. doi: 10.1186/s12985-016-0477-7.
5
Biosafety Recommendations for Work with Influenza Viruses Containing a Hemagglutinin from the A/goose/Guangdong/1/96 Lineage.含有 A/goose/Guangdong/1/96 谱系血凝素的流感病毒工作的生物安全建议。
MMWR Recomm Rep. 2013 Jun 28;62(RR-06):1-7.
6
Genetic analysis of avian influenza A viruses isolated from domestic waterfowl in live-bird markets of Hanoi, Vietnam, preceding fatal H5N1 human infections in 2004.对2004年越南河内活禽市场致命H5N1人类感染事件之前从家鸭中分离出的甲型禽流感病毒进行基因分析。
Arch Virol. 2009;154(8):1249-61. doi: 10.1007/s00705-009-0429-2. Epub 2009 Jul 4.
7
Linear programming model to construct phylogenetic network for 16S rRNA sequences of photosynthetic organisms and influenza viruses.
Interdiscip Sci. 2014 Jun;6(2):100-7. doi: 10.1007/s12539-012-0043-y. Epub 2014 Jun 17.
8
Phylogenetic study-based hemagglutinin (HA) gene of highly pathogenic avian influenza virus (H5N1) detected from backyard chickens in Iran, 2015.2015年基于系统发育研究的从伊朗后院鸡中检测到的高致病性禽流感病毒(H5N1)血凝素(HA)基因
Virus Genes. 2017 Feb;53(1):117-120. doi: 10.1007/s11262-016-1394-y. Epub 2016 Sep 27.
9
Genetic diversity and phylogenetic analysis of highly pathogenic avian influenza (HPAI) H5N1 viruses circulating in Bangladesh from 2007-2011.2007-2011 年孟加拉国流行的高致病性禽流感(HPAI)H5N1 病毒的遗传多样性和系统进化分析。
Transbound Emerg Dis. 2013 Dec;60(6):481-91. doi: 10.1111/tbed.12173. Epub 2013 Oct 11.
10
Efficacy of a Recombinant Turkey Herpesvirus H5 Vaccine Against Challenge With H5N1 Clades 1.1.2 and 2.3.2.1 Highly Pathogenic Avian Influenza Viruses in Domestic Ducks (Anas platyrhynchos domesticus).一种重组火鸡疱疹病毒H5疫苗对家鸭(绿头鸭)抵抗H5N1进化分支1.1.2和2.3.2.1高致病性禽流感病毒攻击的效力
Avian Dis. 2016 Mar;60(1):22-32. doi: 10.1637/11282-091615-Reg.1.

引用本文的文献

1
Multi-proteins similarity-based sampling to select representative genomes from large databases.基于多蛋白质相似性的抽样方法,用于从大型数据库中选择代表性基因组。
BMC Bioinformatics. 2025 May 6;26(1):121. doi: 10.1186/s12859-025-06095-3.
2
Local patterns of spread of influenza A H3N2 virus in coastal Kenya over a 1-year period revealed through virus sequence data.在肯尼亚沿海地区,通过病毒序列数据揭示了 A 型 H3N2 流感病毒在 1 年内的局部传播模式。
Sci Rep. 2024 Oct 8;14(1):23426. doi: 10.1038/s41598-024-74218-6.
3
Phylogeography and reassortment patterns of human influenza A viruses in sub-Saharan Africa.

本文引用的文献

1
Genetic Cluster Analysis for HIV Prevention.艾滋病预防的遗传聚类分析。
Curr HIV/AIDS Rep. 2018 Apr;15(2):182-189. doi: 10.1007/s11904-018-0384-1.
2
The influence of phylodynamic model specifications on parameter estimates of the Zika virus epidemic.系统发育动力学模型规范对寨卡病毒流行参数估计的影响。
Virus Evol. 2018 Jan 29;4(1):vex044. doi: 10.1093/ve/vex044. eCollection 2018 Jan.
3
A model-based clustering method to detect infectious disease transmission outbreaks from sequence variation.一种基于模型的聚类方法,用于从序列变异中检测传染病传播暴发。
撒哈拉以南非洲地区人类甲型流感病毒的系统地理学和重配模式。
Sci Rep. 2024 Aug 16;14(1):18987. doi: 10.1038/s41598-024-70023-3.
4
Proposal for a Global Classification and Nomenclature System for A/H9 Influenza Viruses.关于 A/H9 流感病毒的全球分类和命名系统建议。
Emerg Infect Dis. 2024 Aug;30(8):1-13. doi: 10.3201/eid3008.231176.
5
The Genetic Diversity of Nipah Virus Across Spatial Scales.不同空间尺度下尼帕病毒的遗传多样性
J Infect Dis. 2024 Dec 16;230(6):e1235-e1244. doi: 10.1093/infdis/jiae221.
6
Transmission restriction and genomic evolution co-shape the genetic diversity patterns of influenza A virus.传播限制和基因组进化共同塑造了甲型流感病毒的遗传多样性模式。
Virol Sin. 2024 Aug;39(4):525-536. doi: 10.1016/j.virs.2024.02.005. Epub 2024 Feb 27.
7
Bases-dependent Rapid Phylogenetic Clustering (Bd-RPC) enables precise and efficient phylogenetic estimation in viruses.基于碱基的快速系统发育聚类(Bd-RPC)能够在病毒中进行精确且高效的系统发育估计。
Virus Evol. 2024 Jan 27;10(1):veae005. doi: 10.1093/ve/veae005. eCollection 2024.
8
AutoPhy: Automated phylogenetic identification of novel protein subfamilies.AutoPhy:新型蛋白质亚家族的自动系统发育鉴定。
PLoS One. 2024 Jan 11;19(1):e0291801. doi: 10.1371/journal.pone.0291801. eCollection 2024.
9
Assessing the Potential Role of Cats () as Generators of Relevant SARS-CoV-2 Lineages during the Pandemic.评估猫在疫情期间作为相关严重急性呼吸综合征冠状病毒2(SARS-CoV-2)谱系产生者的潜在作用。
Pathogens. 2023 Nov 16;12(11):1361. doi: 10.3390/pathogens12111361.
10
The genetic diversity of Nipah virus across spatial scales.尼帕病毒在不同空间尺度上的遗传多样性。
medRxiv. 2023 Oct 4:2023.07.14.23292668. doi: 10.1101/2023.07.14.23292668.
PLoS Comput Biol. 2017 Nov 13;13(11):e1005868. doi: 10.1371/journal.pcbi.1005868. eCollection 2017 Nov.
4
Towards a genomics-informed, real-time, global pathogen surveillance system.迈向一个基于基因组学的、实时的全球病原体监测系统。
Nat Rev Genet. 2018 Jan;19(1):9-20. doi: 10.1038/nrg.2017.88. Epub 2017 Nov 13.
5
Influenza immunization of pregnant women in resource-constrained countries: an update for funding and implementation decisions.资源有限国家中孕妇的流感免疫:为资助和实施决策提供的最新情况
Curr Opin Infect Dis. 2017 Oct;30(5):455-462. doi: 10.1097/QCO.0000000000000392.
6
Defining HIV-1 transmission clusters based on sequence data.基于序列数据定义HIV-1传播簇。
AIDS. 2017 Jun 1;31(9):1211-1222. doi: 10.1097/QAD.0000000000001470.
7
Impacts and shortcomings of genetic clustering methods for infectious disease outbreaks.传染病暴发的基因聚类方法的影响与不足
Virus Evol. 2016 Oct 20;2(2):vew031. doi: 10.1093/ve/vew031. eCollection 2016 Jul.
8
Identifying Transmission Clusters with Cluster Picker and HIV-TRACE.使用聚类选择器和HIV-TRACE识别传播集群。
AIDS Res Hum Retroviruses. 2017 Mar;33(3):211-218. doi: 10.1089/AID.2016.0205. Epub 2016 Dec 13.
9
Exploring the temporal structure of heterochronous sequences using TempEst (formerly Path-O-Gen).使用TempEst(原Path-O-Gen)探索异时序列的时间结构。
Virus Evol. 2016 Apr 9;2(1):vew007. doi: 10.1093/ve/vew007. eCollection 2016 Jan.
10
Role for migratory wild birds in the global spread of avian influenza H5N8.候鸟在H5N8型禽流感全球传播中的作用。
Science. 2016 Oct 14;354(6309):213-217. doi: 10.1126/science.aaf8852.