Nichols Robert G, Davenport Emily R
Department of Biology, The Pennsylvania State University, University Park, Pennsylvania, USA.
One Health Microbiome Center, Huck Life Sciences Institute, University Park, Pennsylvania, USA.
mSystems. 2025 Jan 21;10(1):e0148024. doi: 10.1128/msystems.01480-24. Epub 2024 Dec 16.
Phylogenetic marker gene sequencing is often used as a quick and cost-effective way of evaluating microbial composition within a community. While 16S rRNA gene sequencing (16S) is commonly used for bacteria and archaea, other marker genes are preferable in certain situations, such as when 16S sequences cannot distinguish between taxa within a group. Another situation is when researchers want to study cospeciation of host taxa that diverged much more recently than the slowly evolving 16S rRNA gene. For example, the bacterial gyrase subunit B () gene has been used to investigate cospeciation between the microbiome and various hominid species. However, to date, only primers that generate short-read Illumina MiSeq-length amplicons exist to investigate of the Bacteroidaceae, Bifidobacteriaceae, and Lachnospiraceae families. Here, we update this methodology by creating primers for the Bacteroidaceae, Bifidobacteriaceae, and Lachnospiraceae families for long-read PacBio sequencing and characterize them against established short-read primer sets. We demonstrate both bioinformatically and analytically that these longer amplicons offer more sequence space for greater taxonomic resolution, lower off-target amplification rates, and lower error rates with PacBio CCS sequencing versus established short-read sequencing. The availability of these long-read primers will prove to be integral to the continued analysis of cospeciation between bacterial members of the gut microbiome and recently diverging host species.
Previous studies have shown that the marker gene gyrase subunit B () can be used to study codiversification between the gut microbiome and hominids. However, only primers for short-read sequencing have been developed which have limited resolution for subspecies assignment. In the present study, we create new primer sets for long-read sequencing approaches and compare them to the existing short-read primers. We show that using longer reads leads to better taxonomic resolution, lower off-target amplification, and lower error rates, which are vital for accurate estimates of codiversification.
系统发育标记基因测序通常被用作一种快速且经济高效的方法来评估群落中的微生物组成。虽然16S rRNA基因测序(16S)常用于细菌和古菌,但在某些情况下其他标记基因更可取,比如当16S序列无法区分一组内的分类群时。另一种情况是当研究人员想要研究宿主分类群的共 speciation,而这些宿主分类群的分化时间比缓慢进化的16S rRNA基因更近。例如,细菌gyrase亚基B()基因已被用于研究微生物组与各种原始人类物种之间的共 speciation。然而,迄今为止,仅存在用于生成短读长Illumina MiSeq长度扩增子的引物来研究拟杆菌科、双歧杆菌科和毛螺菌科的。在此,我们通过为拟杆菌科、双歧杆菌科和毛螺菌科创建用于长读长PacBio测序的引物来更新此方法,并将它们与已建立的短读长引物组进行表征比较。我们通过生物信息学和分析证明,与已建立的短读长测序相比,这些更长的扩增子为更高的分类分辨率、更低的非靶向扩增率和更低的PacBio CCS测序错误率提供了更多的序列空间。这些长读长引物的可用性将被证明对于持续分析肠道微生物组的细菌成员与最近分化的宿主物种之间的共 speciation不可或缺。
先前的研究表明,标记基因gyrase亚基B()可用于研究肠道微生物组与原始人类之间的共同多样化。然而,仅开发了用于短读长测序的引物,其在亚种分配方面的分辨率有限。在本研究中,我们为长读长测序方法创建了新 的引物组,并将它们与现有的短读长引物进行比较。我们表明,使用更长的读长可带来更好 的分类分辨率、更低的非靶向扩增和更低的错误率,这对于准确估计共同多样化至关重要。