Suppr超能文献

使用概率序列分析和莱文斯坦距离算法对表皮葡萄球菌进行核糖体分型

Ribotyping Staphylococcus epidermidis Using Probabilistic Sequence Analysis and Levenshtein Distance Algorithm.

作者信息

Huang Ryan Yuki, Zhang Chengye, Lim Han Liang

机构信息

Department of Computer Science, Program in Liberal Medical Education (PLME), Brown University, Providence, RI, 02906, USA.

Information Science and Technology, Universiti Kebangsaan, 43600, Lingkungan Ilmu, Selangor, Malaysia.

出版信息

Curr Microbiol. 2025 Jan 10;82(2):78. doi: 10.1007/s00284-024-04057-1.

Abstract

Staphylococcus epidermidis (S. epidermidis) live in different human locations and natural environments. For ribotyping S. epidermidis sub-species, 2507 PCR-amplified reads of 16S rRNA genes of S. epidermidis in a public dataset were used for probabilistic sequence analysis. A sequence probability logo (sequence pLogo) as a reference sequence of 16S rRNA genes of S. epidermidis was constructed. Through implementation of Levenshtein Distance algorithm, two 20-base pairs (bp) motifs, commonly present in 2507 PCR-amplified reads, were identified. The top 38 S. epidermidis isolates, which carried 16S rRNA nucleotide domains that were made of different sequences but have high similarity scores to two 20-bp motifs, were found from 11 human, 8 animal, 9 plant and 10 environmental samples, indicating that these two 20-bp motifs were broadly present in diverse S. epidermidis isolates. Thirty-one PCR-amplified reads of 16S rRNA genes, which were currently not in the dataset, were utilized to verify the feasibility of using two 20-bp motifs for ribotyping S. epidermidis sub-species. S. epidermidis S1, S3, but not S2, isolates on the human scalp carried a 20-bp sequence domain with high similarities to a 20-bp motif in the sequence pLogo. The phylogenetic tree showed that S. epidermidis S1, S2 and S3 were not from a single common ancestor. Two newly identified 20-bp motifs here, thus, provided reference nucleotide residues for ribotyping S. epidermidis.

摘要

表皮葡萄球菌生活在人类的不同部位以及自然环境中。为了对表皮葡萄球菌亚种进行核糖体分型,在一个公共数据集中,利用2507条表皮葡萄球菌16S rRNA基因的PCR扩增读数进行概率序列分析。构建了一个序列概率图谱(序列pLogo)作为表皮葡萄球菌16S rRNA基因的参考序列。通过实施莱文斯坦距离算法,在2507条PCR扩增读数中鉴定出两个常见的20个碱基对(bp)的基序。从11份人类样本、8份动物样本、9份植物样本和10份环境样本中发现了38株表皮葡萄球菌分离株,这些分离株携带的16S rRNA核苷酸结构域由不同序列组成,但与两个20-bp基序具有较高的相似性得分,这表明这两个20-bp基序广泛存在于不同的表皮葡萄球菌分离株中。利用目前数据集中未有的31条16S rRNA基因的PCR扩增读数,验证了使用两个20-bp基序对表皮葡萄球菌亚种进行核糖体分型的可行性。人类头皮上的表皮葡萄球菌S1、S3分离株(而非S2分离株)携带一个20-bp序列结构域,与序列pLogo中的一个20-bp基序具有高度相似性。系统发育树显示,表皮葡萄球菌S1、S2和S3并非来自单一的共同祖先。因此,这里新鉴定的两个20-bp基序为表皮葡萄球菌的核糖体分型提供了参考核苷酸残基。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ed9/11723854/1f662404b4ad/284_2024_4057_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验