Institut de Recherche en Biologie Végétale, Département de Sciences Biologiques, Université de Montréal, Montréal, QC H1X 2B2, Canada.
Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montréal, QC H3A 1A2, Canada.
Genomics Proteomics Bioinformatics. 2020 Oct;18(5):601-612. doi: 10.1016/j.gpb.2018.07.010. Epub 2020 Dec 18.
Arbuscular mycorrhizal fungi (AMF) are plant root symbionts that play key roles in plant growth and soil fertility. They are obligate biotrophic fungi that form coenocytic multinucleated hyphae and spores. Numerous studies have shown that diverse microorganisms live on the surface of and inside their mycelia, resulting in a metagenome when whole-genome sequencing (WGS) data are obtained from sequencing AMF cultivated in vivo. The metagenome contains not only the AMF sequences, but also those from associated microorganisms. In this study, we introduce a novel bioinformatics program, Spore-associated Symbiotic Microbes (SeSaMe), designed for taxonomic classification of short sequences obtained by next-generation DNA sequencing. A genus-specific usage bias database was created based on amino acid usage and codon usage of a three consecutive codon DNA 9-mer encoding an amino acid trimer in a protein secondary structure. The program distinguishes between coding sequence (CDS) and non-CDS, and classifies a query sequence into a genus group out of 54 genera used as reference. The mean percentages of correct predictions of the CDS and the non-CDS test sets at the genus level were 71% and 50% for bacteria, 68% and 73% for fungi (excluding AMF), and 49% and 72% for AMF (Rhizophagus irregularis), respectively. SeSaMe provides not only a means for estimating taxonomic diversity and abundance but also the gene reservoir of the reference taxonomic groups associated with AMF. Therefore, it enables users to study the symbiotic roles of associated microorganisms. It can also be applicable to other microorganisms as well as soil metagenomes. SeSaMe is freely available at www.fungalsesame.org.
丛枝菌根真菌(AMF)是与植物根系共生的真菌,在植物生长和土壤肥力方面发挥着关键作用。它们是专性生物营养型真菌,形成合胞多核菌丝和孢子。大量研究表明,多种微生物生活在其菌丝的表面和内部,导致当从活体培养的 AMF 中获得全基因组测序(WGS)数据时,会产生一个宏基因组。该宏基因组不仅包含 AMF 序列,还包含与其相关的微生物序列。在本研究中,我们引入了一种新的生物信息学程序 Spore-associated Symbiotic Microbes(SeSaMe),旨在对下一代 DNA 测序获得的短序列进行分类学分类。基于氨基酸使用和编码三氨基酸的三联体的密码子使用,为每个氨基酸创建了一个特定于属的使用偏差数据库。该程序可区分编码序列(CDS)和非 CDS,并将查询序列分为 54 个属参考组中的一个属组。在细菌中,CDS 和非 CDS 测试集的正确预测百分比的平均值分别为 71%和 50%;在真菌(不包括 AMF)中分别为 68%和 73%;在 AMF(不规则隔孢囊霉)中分别为 49%和 72%。SeSaMe 不仅提供了一种估计分类多样性和丰度的方法,还提供了与 AMF 相关的参考分类群的基因库。因此,它使用户能够研究相关微生物的共生作用。它也可适用于其他微生物和土壤宏基因组。SeSaMe 可在 www.fungalsesame.org 上免费获得。