Lamichhane Binit, Wise Michael J, Chua Eng Guan, Marshall Barry J, Tay Chin Yen
Helicobacter pylori Research Laboratory Marshall Centre for Infectious Disease Research and Training School of Biomedical Sciences University of Western Australia Perth WA Australia.
Department of Computer Science and Software Engineering University of Western Australia Perth WA Australia.
Evol Appl. 2019 Sep 18;13(2):278-289. doi: 10.1111/eva.12864. eCollection 2020 Feb.
We present a novel method for taxon selection, the aim being to minimize problems arising from highly recombinant species such as . has accompanied modern-human migration out of Africa and is marked by a phylogeographic strain distribution, which has been exploited to add an extra layer of information about human migrations to that obtained from human sources. However, genome has high sequence heterogeneity combined with a very high rate of recombination, causing major allelic diversification across strains. On the other hand, recombination events that have become preserved in sub-populations are a useful source of phylogenetic information. This creates a potential problem in selecting representative strains for particular genetic or phylogeographic clusters and generally ameliorating the impact on analyses of extensive low-level recombination. To address this issue, we perform multiple population structure-based analyses on core genomes to select exemplar strains, called 'quintessents', which exhibit limited recombination. In essence, quintessent strains are representative of their specific phylogenetic clades and can be used to refine the current MLST concatenation-based population structure classification system. The use of quintessents reduces the noise due to local recombination events, while preserving recombination events that have become fixed in sub-populations. We illustrate the method with an analysis of core genome concatenations from 185 strains, which reveals a recent speciation event resulting from the recombination of strains from phylogeographic clade hpSahul, carried by Aboriginal Australians, and hpEurope, carried by some of the people who arrived in Australia over the past 200 years. The signal is much clearer when based on quintessent strains, but absent from the analysis based on MLST concatenations.
我们提出了一种新的分类单元选择方法,目的是尽量减少由高度重组的物种(如……)引发的问题。……伴随着现代人类从非洲迁出,其特点是具有系统发育地理菌株分布,这一分布已被用于在从人类来源获得的信息基础上,增加一层关于人类迁徙的额外信息。然而,……基因组具有高序列异质性,同时重组率极高,导致各菌株间出现主要的等位基因多样化。另一方面,在亚群体中保留下来的重组事件是系统发育信息的一个有用来源。这在为特定的遗传或系统发育地理簇选择代表性菌株以及总体上减轻广泛的低水平重组对分析的影响方面造成了潜在问题。为解决这一问题,我们对核心基因组进行了多次基于群体结构的分析,以选择称为“典型菌株”的示例菌株,这些菌株表现出有限的重组。本质上,典型菌株代表了其特定的系统发育分支,可用于完善当前基于多位点序列分型拼接的群体结构分类系统。使用典型菌株可减少因局部重组事件产生的噪声,同时保留在亚群体中固定下来的重组事件。我们通过对185株菌株的核心基因组拼接分析来说明该方法,该分析揭示了一个近期的物种形成事件,该事件是由澳大利亚原住民携带的系统发育地理分支hpSahul的菌株与过去200年来抵达澳大利亚的一些人携带的hpEurope的菌株重组导致的。基于典型菌株时信号要清晰得多,但基于多位点序列分型拼接的分析中则没有该信号。