• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在不同流行病学情景和采样下,利用 SARS-CoV-2 基因组数据进行信息聚类。

The utility of SARS-CoV-2 genomic data for informative clustering under different epidemiological scenarios and sampling.

机构信息

Department of Mathematics, Simon Fraser University, Burnaby, Canada.

Department of Mathematics, Simon Fraser University, Burnaby, Canada.

出版信息

Infect Genet Evol. 2023 Sep;113:105484. doi: 10.1016/j.meegid.2023.105484. Epub 2023 Jul 31.

DOI:10.1016/j.meegid.2023.105484
PMID:37531976
Abstract

OBJECTIVES

Clustering pathogen sequence data is a common practice in epidemiology to gain insights into the genetic diversity and evolutionary relationships among pathogens. We can find groups of cases with a shared transmission history and common origin, as well as identifying transmission hotspots. Motivated by the experience of clustering SARS-CoV-2 cases using whole genome sequence data during the COVID-19 pandemic to aid with public health investigation, we investigated how differences in epidemiology and sampling can influence the composition of clusters that are identified.

METHODS

We performed genomic clustering on simulated SARS-CoV-2 outbreaks produced with different transmission rates and levels of genomic diversity, along with varying the proportion of cases sampled.

RESULTS

In single outbreaks with a low transmission rate, decreasing the sampling fraction resulted in multiple, separate clusters being identified where intermediate cases in transmission chains are missed. Outbreaks simulated with a high transmission rate were more robust to changes in the sampling fraction and largely resulted in a single cluster that included all sampled outbreak cases. When considering multiple outbreaks in a sampled jurisdiction seeded by different introductions, low genomic diversity between introduced cases caused outbreaks to be merged into large clusters. If the transmission and sampling fraction, and diversity between introductions was low, a combination of the spurious break-up of outbreaks and the linking of closely related cases in different outbreaks resulted in clusters that may appear informative, but these did not reflect the true underlying population structure. Conversely, genomic clusters matched the true population structure when there was relatively high diversity between introductions and a high transmission rate.

CONCLUSION

Differences in epidemiology and sampling can impact our ability to identify genomic clusters that describe the underlying population structure. These findings can help to guide recommendations for the use of pathogen clustering in public health investigations.

摘要

目的

在流行病学中,对病原体序列数据进行聚类是一种常见的做法,可深入了解病原体的遗传多样性和进化关系。我们可以找到具有共同传播史和共同起源的病例组,并确定传播热点。受 COVID-19 大流行期间使用全基因组序列数据对 SARS-CoV-2 病例进行聚类以辅助公共卫生调查的经验启发,我们研究了流行病学和采样差异如何影响所识别的聚类的组成。

方法

我们对不同传播率和基因组多样性水平以及不同病例采样比例产生的模拟 SARS-CoV-2 暴发进行了基因组聚类。

结果

在低传播率的单一暴发中,减少采样比例会导致识别出多个单独的聚类,而在传播链中的中间病例则被遗漏。高传播率模拟的暴发对采样比例的变化具有更强的鲁棒性,并且主要导致包含所有采样暴发病例的单个聚类。在采样司法管辖区中考虑多个由不同引入引起的暴发时,如果引入病例之间的基因组多样性低,则暴发会合并为大聚类。如果传播率、采样比例和引入之间的多样性低,则暴发的虚假分裂和不同暴发中密切相关病例的链接会导致聚类看起来很有信息量,但这些聚类并不反映真实的潜在人群结构。相反,当引入之间存在相对较高的多样性和较高的传播率时,基因组聚类与真实的人群结构相匹配。

结论

流行病学和采样的差异会影响我们识别描述潜在人群结构的基因组聚类的能力。这些发现有助于指导在公共卫生调查中使用病原体聚类的建议。

相似文献

1
The utility of SARS-CoV-2 genomic data for informative clustering under different epidemiological scenarios and sampling.在不同流行病学情景和采样下,利用 SARS-CoV-2 基因组数据进行信息聚类。
Infect Genet Evol. 2023 Sep;113:105484. doi: 10.1016/j.meegid.2023.105484. Epub 2023 Jul 31.
2
Within-host diversity improves phylogenetic and transmission reconstruction of SARS-CoV-2 outbreaks.宿主内多样性提高了 SARS-CoV-2 爆发的系统发育和传播重建。
Elife. 2023 Sep 21;12:e84384. doi: 10.7554/eLife.84384.
3
Guiding the design of SARS-CoV-2 genomic surveillance by estimating the resolution of outbreak detection.通过估计暴发检测的分辨率来指导 SARS-CoV-2 基因组监测的设计。
Front Public Health. 2022 Oct 5;10:1004201. doi: 10.3389/fpubh.2022.1004201. eCollection 2022.
4
Utility of SARS-CoV-2 Genomic Sequencing for Understanding Transmission and School Outbreaks.SARS-CoV-2 基因组测序在了解传播和学校疫情中的应用。
Pediatr Infect Dis J. 2023 Apr 1;42(4):324-331. doi: 10.1097/INF.0000000000003834. Epub 2023 Jan 26.
5
Cov2clusters: genomic clustering of SARS-CoV-2 sequences.Cov2clusters:SARS-CoV-2 序列的基因组聚类。
BMC Genomics. 2022 Oct 19;23(1):710. doi: 10.1186/s12864-022-08936-4.
6
Combined epidemiological and genomic analysis of nosocomial SARS-CoV-2 infection early in the pandemic and the role of unidentified cases in transmission.结合大流行早期医院获得性 SARS-CoV-2 感染的流行病学和基因组分析,以及不明病例在传播中的作用。
Clin Microbiol Infect. 2022 Jan;28(1):93-100. doi: 10.1016/j.cmi.2021.07.040. Epub 2021 Aug 13.
7
A Bayesian inference method to estimate transmission trees with multiple introductions; applied to SARS-CoV-2 in Dutch mink farms.一种贝叶斯推断方法,用于估计具有多次传入的传播树;应用于荷兰水貂养殖场中的 SARS-CoV-2。
PLoS Comput Biol. 2023 Nov 27;19(11):e1010928. doi: 10.1371/journal.pcbi.1010928. eCollection 2023 Nov.
8
Genomics-informed responses in the elimination of COVID-19 in Victoria, Australia: an observational, genomic epidemiological study.澳大利亚维多利亚州利用基因组学信息消除 COVID-19 的反应:一项观察性、基因组流行病学研究。
Lancet Public Health. 2021 Aug;6(8):e547-e556. doi: 10.1016/S2468-2667(21)00133-X. Epub 2021 Jul 10.
9
Combining epidemiological data and whole genome sequencing to understand SARS-CoV-2 transmission dynamics in a large tertiary care hospital during the first COVID-19 wave in The Netherlands focusing on healthcare workers.结合流行病学数据和全基因组测序,了解荷兰首次 COVID-19 浪潮期间大型三级保健医院中 SARS-CoV-2 的传播动态,重点关注医护人员。
Antimicrob Resist Infect Control. 2023 May 10;12(1):46. doi: 10.1186/s13756-023-01247-7.
10
Detection of SARS-CoV-2 infection clusters: The useful combination of spatiotemporal clustering and genomic analyses.检测 SARS-CoV-2 感染集群:时空聚类和基因组分析的有益结合。
Front Public Health. 2022 Dec 1;10:1016169. doi: 10.3389/fpubh.2022.1016169. eCollection 2022.

引用本文的文献

1
Characterizing spatial epidemiology in a heterogeneous transmission landscape using the spatial transmission count statistic.使用空间传播计数统计量来描述异质传播环境中的空间流行病学。
Commun Med (Lond). 2025 May 9;5(1):165. doi: 10.1038/s43856-025-00888-6.