• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从组件到群落:将网络科学引入分子流行病学聚类分析

From components to communities: bringing network science to clustering for molecular epidemiology.

作者信息

Liu Molly, Chato Connor, Poon Art F Y

机构信息

Department of Pathology and Laboratory Medicine, Western University, Dental Sciences Building, Rm. 4044, London, ON N6A 5C1, Canada.

Department of Microbiology and Immunology, Western University, 1151 Richmond Street, London, ON N6A 3K7, Canada.

出版信息

Virus Evol. 2023 Apr 25;9(1):vead026. doi: 10.1093/ve/vead026. eCollection 2023.

DOI:10.1093/ve/vead026
PMID:37187604
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10175948/
Abstract

Defining clusters of epidemiologically related infections is a common problem in the surveillance of infectious disease. A popular method for generating clusters is pairwise distance clustering, which assigns pairs of sequences to the same cluster if their genetic distance falls below some threshold. The result is often represented as a network or graph of nodes. A connected component is a set of interconnected nodes in a graph that are not connected to any other node. The prevailing approach to pairwise clustering is to map clusters to the connected components of the graph on a one-to-one basis. We propose that this definition of clusters is unnecessarily rigid. For instance, the connected components can collapse into one cluster by the addition of a single sequence that bridges nodes in the respective components. Moreover, the distance thresholds typically used for viruses like HIV-1 tend to exclude a large proportion of new sequences, making it difficult to train models for predicting cluster growth. These issues may be resolved by revisiting how we define clusters from genetic distances. Community detection is a promising class of clustering methods from the field of network science. A community is a set of nodes that are more densely inter-connected relative to the number of their connections to external nodes. Thus, a connected component may be partitioned into two or more communities. Here we describe community detection methods in the context of genetic clustering for epidemiology, demonstrate how a popular method (Markov clustering) enables us to resolve variation in transmission rates within a giant connected component of HIV-1 sequences, and identify current challenges and directions for further work.

摘要

在传染病监测中,定义与流行病学相关的感染集群是一个常见问题。一种常用的生成集群的方法是成对距离聚类,即如果两个序列的遗传距离低于某个阈值,就将它们分配到同一个集群中。结果通常表示为节点的网络或图。连通分量是图中一组相互连接的节点,它们不与任何其他节点相连。成对聚类的主流方法是将集群一对一地映射到图的连通分量上。我们认为这种集群定义过于严格。例如,通过添加一个连接各个分量中节点的单个序列,连通分量可以合并为一个集群。此外,像HIV-1这样的病毒通常使用的距离阈值往往会排除很大一部分新序列,从而难以训练预测集群增长的模型。通过重新审视我们如何从遗传距离定义集群,这些问题可能会得到解决。社区检测是网络科学领域中一类很有前景的聚类方法。社区是一组节点,相对于它们与外部节点的连接数量,它们之间的连接更为密集。因此,一个连通分量可能会被划分为两个或更多个社区。在这里,我们在流行病学遗传聚类的背景下描述社区检测方法,展示一种流行方法(马尔可夫聚类)如何使我们能够解决HIV-1序列巨大连通分量内传播率的变化问题,并确定当前的挑战和进一步工作的方向。

相似文献

1
From components to communities: bringing network science to clustering for molecular epidemiology.从组件到群落:将网络科学引入分子流行病学聚类分析
Virus Evol. 2023 Apr 25;9(1):vead026. doi: 10.1093/ve/vead026. eCollection 2023.
2
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
3
Public health in genetic spaces: a statistical framework to optimize cluster-based outbreak detection.基因领域的公共卫生:一种优化基于聚类的疫情检测的统计框架。
Virus Evol. 2020 Mar 13;6(1):veaa011. doi: 10.1093/ve/veaa011. eCollection 2020 Jan.
4
Identifying Transmission Clusters with Cluster Picker and HIV-TRACE.使用聚类选择器和HIV-TRACE识别传播集群。
AIDS Res Hum Retroviruses. 2017 Mar;33(3):211-218. doi: 10.1089/AID.2016.0205. Epub 2016 Dec 13.
5
Key-Node-Separated Graph Clustering and Layouts for Human Relationship Graph Visualization.用于人际关系图可视化的键节点分隔图聚类与布局
IEEE Comput Graph Appl. 2015 Nov-Dec;35(6):30-40. doi: 10.1109/MCG.2015.115. Epub 2015 Sep 23.
6
Hierarchical link clustering algorithm in networks.网络中的层次链接聚类算法
Phys Rev E Stat Nonlin Soft Matter Phys. 2015 Jun;91(6):062814. doi: 10.1103/PhysRevE.91.062814. Epub 2015 Jun 24.
7
Clustering biological sequences with dynamic sequence similarity threshold.使用动态序列相似度阈值对生物序列进行聚类。
BMC Bioinformatics. 2022 Mar 30;23(1):108. doi: 10.1186/s12859-022-04643-9.
8
A Multi-Hop Clustering Mechanism for Scalable IoT Networks.一种用于可扩展物联网网络的多跳聚类机制。
Sensors (Basel). 2018 Mar 23;18(4):961. doi: 10.3390/s18040961.
9
Connectivity differences in brain networks.脑网络的连通性差异。
Neuroimage. 2012 Apr 2;60(2):1055-62. doi: 10.1016/j.neuroimage.2012.01.068. Epub 2012 Jan 16.
10
A graph-based clustering method applied to protein sequences.
Bioinformation. 2011;6(10):372-4. doi: 10.6026/97320630006372. Epub 2011 Aug 2.

本文引用的文献

1
Optimized phylogenetic clustering of HIV-1 sequence data for public health applications.用于公共卫生应用的 HIV-1 序列数据的优化系统发育聚类。
PLoS Comput Biol. 2022 Nov 30;18(11):e1010745. doi: 10.1371/journal.pcbi.1010745. eCollection 2022 Nov.
2
Phylogenetic Cluster Analysis Identifies Virological and Behavioral Drivers of Human Immunodeficiency Virus Transmission in Men Who Have Sex With Men.系统发育聚类分析鉴定男男性行为人群中人类免疫缺陷病毒传播的病毒学和行为学驱动因素。
Clin Infect Dis. 2021 Jun 15;72(12):2175-2183. doi: 10.1093/cid/ciaa411.
3
Public health in genetic spaces: a statistical framework to optimize cluster-based outbreak detection.
基因领域的公共卫生:一种优化基于聚类的疫情检测的统计框架。
Virus Evol. 2020 Mar 13;6(1):veaa011. doi: 10.1093/ve/veaa011. eCollection 2020 Jan.
4
Identification of Hidden Population Structure in Time-Scaled Phylogenies.时间尺度系统发育中隐藏种群结构的鉴定。
Syst Biol. 2020 Sep 1;69(5):884-896. doi: 10.1093/sysbio/syaa009.
5
Inferring putative transmission clusters with Phydelity.使用Phydelity推断假定的传播集群。
Virus Evol. 2019 Oct 9;5(2):vez039. doi: 10.1093/ve/vez039. eCollection 2019 Jul.
6
HIV transmission networks among transgender women in Los Angeles County, CA, USA: a phylogenetic analysis of surveillance data.美国加利福尼亚州洛杉矶县跨性别女性中的 HIV 传播网络:监测数据的系统发育分析。
Lancet HIV. 2019 Mar;6(3):e164-e172. doi: 10.1016/S2352-3018(18)30359-X. Epub 2019 Feb 11.
7
Prediction of HIV Transmission Cluster Growth With Statewide Surveillance Data.利用全州监测数据预测 HIV 传播簇的增长。
J Acquir Immune Defic Syndr. 2019 Feb 1;80(2):152-159. doi: 10.1097/QAI.0000000000001905.
8
Identifying Clusters of Recent and Rapid HIV Transmission Through Analysis of Molecular Surveillance Data.通过分子监测数据分析识别近期和快速 HIV 传播簇。
J Acquir Immune Defic Syndr. 2018 Dec 15;79(5):543-550. doi: 10.1097/QAI.0000000000001856.
9
DM-PhyClus: a Bayesian phylogenetic algorithm for infectious disease transmission cluster inference.DM-PhyClus:一种用于传染病传播聚类推断的贝叶斯系统发育算法。
BMC Bioinformatics. 2018 Sep 14;19(1):324. doi: 10.1186/s12859-018-2347-3.
10
Ethical considerations in global HIV phylogenetic research.全球 HIV 系统进化研究中的伦理考量。
Lancet HIV. 2018 Nov;5(11):e656-e666. doi: 10.1016/S2352-3018(18)30134-6. Epub 2018 Aug 30.