Suppr超能文献

揭示了人类临床分离株中存在的种系和亚系特异性的蛋白编码基因的有无变化。

Highly contiguous genomes of human clinical isolates of reveal assemblage- and sub-assemblage-specific presence-absence variation in protein-coding genes.

机构信息

Department of Infectious Diseases, Unit 16 Mycotic and Parasitic Agents and Mycobacteria, Robert Koch-Institute, Berlin, Germany.

MWSchmid GmbH, Hauptstrasse 34, 8750 Glarus, Switzerland.

出版信息

Microb Genom. 2023 Mar;9(3). doi: 10.1099/mgen.0.000963.

Abstract

(syn. ) is a widespread gastrointestinal protozoan parasite with debated taxonomic status. Currently, eight distinct genetic sub-groups, termed assemblages A-H, are defined based on a few genetic markers. Assemblages A and B may represent distinct species and are both of human public health relevance. Genomic studies are scarce and the few reference genomes available, in particular for assemblage B, are insufficient for adequate comparative genomics. Here, by combining long- and short-read sequences generated by PacBio and Illumina sequencing technologies, we provide nine annotated genome sequences for reference from new clinical isolates (four assemblage A and five assemblage B parasite isolates). Isolates chosen represent the currently accepted classification of sub-assemblages AI, AII, BIII and BIV. Synteny over the whole genome was generally high, but we report chromosome-level translocations as a feature that distinguishes assemblage A from B parasites. Orthologue gene group analysis was used to define gene content differences between assemblage A and B and to contribute a gene-set-based operational definition of respective taxonomic units. is tetraploid, and high allelic sequence heterogeneity (ASH) for assemblage B vs. assemblage A has been observed so far. Noteworthy, here we report an extremely low ASH (0.002%) for one of the assemblage B isolates (a value even lower than the reference assemblage A isolate WB-C6). This challenges the view of low ASH being a notable feature that distinguishes assemblage A from B parasites, and low ASH allowed assembly of the most contiguous assemblage B genome currently available for reference. In conclusion, the description of nine highly contiguous genome assemblies of new isolates of assemblage A and B adds to our understanding of the genomics and species population structure of this widespread zoonotic parasite.

摘要

(同义词)是一种广泛存在的胃肠道原生动物寄生虫,其分类地位存在争议。目前,根据少数遗传标记,定义了八个不同的遗传亚群,称为集合 A-H。集合 A 和 B 可能代表不同的物种,两者都与人类公共卫生有关。基因组研究很少,少数可用的参考基因组,特别是对于集合 B,不足以进行充分的比较基因组学研究。在这里,我们通过结合 PacBio 和 Illumina 测序技术生成的长读和短读序列,提供了九个新的临床分离株的注释基因组序列作为参考(四个集合 A 和五个集合 B 寄生虫分离株)。选择的分离株代表了目前公认的亚集合 AI、AII、BIII 和 BIV 的分类。整个基因组的同线性通常很高,但我们报告了染色体水平的易位,这是区分集合 A 和 B 寄生虫的一个特征。直系同源基因簇分析用于定义集合 A 和 B 之间的基因内容差异,并为各自的分类单位提供基于基因集的操作定义。是四倍体,目前观察到集合 B 与集合 A 的等位基因序列高度异质性(ASH)很高。值得注意的是,在这里,我们报告了一个集合 B 分离株的极低 ASH(0.002%)(甚至低于参考集合 A 分离株 WB-C6)。这挑战了 ASH 低是区分集合 A 和 B 寄生虫的一个显著特征的观点,并且低 ASH 允许组装目前可用的最连续的集合 B 基因组作为参考。总之,描述了九个新的集合 A 和 B 寄生虫的高度连续基因组组装,增加了我们对这种广泛的人畜共患寄生虫的基因组学和物种群体结构的理解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9a47/10132058/684abe1136ac/mgen-9-963-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验