Suppr超能文献

使用16S核糖体DNA进行有效探针自动设计的综合比对序列构建(CASCADE-P)

Comprehensive aligned sequence construction for automated design of effective probes (CASCADE-P) using 16S rDNA.

作者信息

DeSantis T Z, Dubosarskiy I, Murray S R, Andersen G L

机构信息

Center for Environmental Biotechnology, Lawrence Berkeley National Laboratory, 1 Cyclotron Road, Mail Stop 70A-3317, Berkeley, CA 94720, USA.

出版信息

Bioinformatics. 2003 Aug 12;19(12):1461-8. doi: 10.1093/bioinformatics/btg200.

Abstract

MOTIVATION

Prokaryotic organisms have been identified utilizing the sequence variation of the 16S rRNA gene. Variations steer the design of DNA probes for the detection of taxonomic groups or specific organisms. The long-term goal of our project is to create probe arrays capable of identifying 16S rDNA sequences in unknown samples. This necessitated the authentication, categorization and alignment of the >75 000 publicly available '16S' sequences. Preferably, the entire process should be computationally administrated so the aligned collection could periodically absorb 16S rDNA sequences from the public records. A complete multiple sequence alignment would provide a foundation for computational probe selection and facilitates microbial taxonomy and phylogeny.

RESULTS

Here we report the alignment and similarity clustering of 62 662 16S rDNA sequences and an approach for designing effective probes for each cluster. A novel alignment compression algorithm, NAST (Nearest Alignment Space Termination), was designed to produce the uniform multiple sequence alignment referred to as the prokMSA. From the prokMSA, 9020 Operational Taxonomic Units (OTUs) were found based on transitive sequence similarities. An automated approach to probe design was straightforward using the prokMSA clustered into OTUs. As a test case, multiple probes were computationally picked for each of the 27 OTUs that were identified within the Staphylococcus Group. The probes were incorporated into a customized microarray and were able to correctly categorize Staphylococcus aureus and Bacillus anthracis into their correct OTUs. Although a successful probe picking strategy is outlined, the main focus of creating the prokMSA was to provide a comprehensive, categorized, updateable 16S rDNA collection useful as a foundation for any probe selection algorithm.

摘要

动机

已利用16S rRNA基因的序列变异鉴定出原核生物。这些变异指导用于检测分类群或特定生物的DNA探针的设计。我们项目的长期目标是创建能够识别未知样品中16S rDNA序列的探针阵列。这就需要对超过75000条公开可用的“16S”序列进行鉴定、分类和比对。最好整个过程能通过计算进行管理,以便比对后的序列集合能够定期吸收来自公共记录的16S rDNA序列。完整的多序列比对将为计算探针选择提供基础,并有助于微生物分类学和系统发育研究。

结果

在此我们报告了62662条16S rDNA序列的比对和相似性聚类,以及为每个聚类设计有效探针的方法。设计了一种新颖的比对压缩算法——NAST(最近比对空间终止),以产生称为原核生物多序列比对(prokMSA)的统一多序列比对。从prokMSA中,基于传递性序列相似性发现了9020个操作分类单元(OTU)。使用聚类成OTU的prokMSA,一种自动探针设计方法很简单。作为一个测试案例,为葡萄球菌属中鉴定出的27个OTU中的每一个都通过计算挑选了多个探针。这些探针被整合到定制的微阵列中,能够将金黄色葡萄球菌和炭疽芽孢杆菌正确分类到它们各自的OTU中。虽然概述了一种成功的探针挑选策略,但创建prokMSA的主要重点是提供一个全面、分类、可更新的16S rDNA集合,作为任何探针选择算法的基础。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验