Murphy Robert, Strube Mikael Lenz
Section for Ecology and Evolution, Department of Biology, University of Copenhagen, Copenhagen 2100, Denmark.
Department of Biotechnology and Biomedicine, Technical University of Denmark, Kongens Lyngby 2800, Denmark.
Bioinform Adv. 2023 Aug 21;3(1):vbad111. doi: 10.1093/bioadv/vbad111. eCollection 2023.
As previously described, amplicon analysis of the bacterial 16S gene has several limitations owing to fundamental characteristics of both the 16S gene and technological restrictions. Previously, RibDif was introduced to help quantify these limitations by detailed analysis of a given genera and the 16S gene profile of its members, notably multiplicity and divergence of 16S alleles within genomes as well as shared alleles between species. Apart from using amplicon analysis for only the 16S gene, amplicons derived from genus-specific genes or even functional genes are increasingly being utilized. Moreover, long-read technologies are progressively being used to sequence longer amplicons, and since these inherently contain more information, they may likely alleviate the issues proposed in RibDif.
Taking these phenomena into account, we here propose RibDif2. RibDif2 retains the 16S-optimized functionality of the original RibDif but can now run any set of primers on any part of the genome in any set of organisms, be it prokaryote, eukaryote, or archaea. We demonstrate this new functionality by showing full species resolution of using complete rRNA-operon amplicons, as well as selection of optimally discriminatory primers for and . Moreover, we show a potential bias toward terrestrial bacteria relative to marine ones for primers amplifying biosynthetic gene clusters and lastly suggest optimal primers to differentiate the members of the insect genus . We believe that RibDif2 will facilitate the work of all scientists using amplicon sequencing, especially in the era of long-read sequencing.
Ribdif2 is freely available at https://github.com/Rob-murphys/ribdif.
如前所述,由于16S基因的基本特征和技术限制,细菌16S基因的扩增子分析存在若干局限性。此前,RibDif被引入,通过对特定属及其成员的16S基因谱进行详细分析,以量化这些局限性,特别是基因组内16S等位基因的多样性和分歧以及物种间共享的等位基因。除了仅对16S基因进行扩增子分析外,源自属特异性基因甚至功能基因的扩增子也越来越多地被使用。此外,长读长技术正逐渐用于对更长的扩增子进行测序,由于这些扩增子本身包含更多信息,它们可能会缓解RibDif中提出的问题。
考虑到这些现象,我们在此提出RibDif2。RibDif2保留了原始RibDif的16S优化功能,但现在可以在任何生物体的基因组的任何部分上运行任何引物集,无论是原核生物、真核生物还是古细菌。我们通过展示使用完整rRNA操纵子扩增子的全物种分辨率,以及为[具体物种1]和[具体物种2]选择最佳区分性引物,来证明这种新功能。此外,我们展示了扩增生物合成基因簇的引物相对于海洋细菌对陆地细菌的潜在偏向性,最后提出了区分昆虫属成员的最佳引物。我们相信RibDif2将促进所有使用扩增子测序的科学家的工作,特别是在长读长测序时代。