Suppr超能文献

伤寒沙门氏菌基因组的样本序列与完整的大肠杆菌K-12基因组序列的比较。

Comparison of sample sequences of the Salmonella typhi genome to the sequence of the complete Escherichia coli K-12 genome.

作者信息

McClelland M, Wilson R K

机构信息

Sidney Kimmel Cancer Center, San Diego, California 92121, USA.

出版信息

Infect Immun. 1998 Sep;66(9):4305-12. doi: 10.1128/IAI.66.9.4305-4312.1998.

Abstract

Raw sequence data representing the majority of a bacterial genome can be obtained at a tiny fraction of the cost of a completed sequence. To demonstrate the utility of such a resource, 870 single-stranded M13 clones were sequenced from a shotgun library of the Salmonella typhi Ty2 genome. The sequence reads averaged over 400 bases and sampled the genome with an average spacing of once every 5,000 bases. A total of 339,243 bases of unique sequence was generated (approximately 7% representation). The sample of 870 sequences was compared to the complete Escherichia coli K-12 genome and to the rest of the GenBank database, which can also be considered a collection of sampled sequences. Despite the incomplete S. typhi data set, interesting categories could easily be discerned. Sixteen percent of the sequences determined from S. typhi had close homologs among known Salmonella sequences (P < 1e-40 in BlastX or BlastN), reflecting the proportion of these genomes that have been sequenced previously; 277 sequences (32%) had no apparent orthologs in the complete E. coli K-12 genome (P > 1e-20), of which 155 sequences (18%) had no close similarities to any sequence in the database (P > 1e-5). Eight of the 277 sequences had similarities to genes in other strains of E. coli or plasmids, and six sequences showed evidence of novel phage lysogens or sequence remnants of phage integrations, including a member of the lambda family (P < 1e-15). Twenty-three sample sequences had a significantly closer similarity a sequence in the database from organisms other than the E. coli/Salmonella clade (which includes Shigella and Citrobacter). These sequences are new candidate lateral transfer events to the S. typhi lineage or deletions on the E. coli K-12 lineage. Eleven putative junctions of insertion/deletion events greater than 100 bp were observed in the sample, indicating that well over 150 such events may distinguish S. typhi from E. coli K-12. The need for automatic methods to more effectively exploit sample sequences is discussed.

摘要

代表大多数细菌基因组的原始序列数据,其获取成本仅为完整测序成本的极小一部分。为了证明这种资源的实用性,从伤寒沙门氏菌Ty2基因组的鸟枪法文库中对870个单链M13克隆进行了测序。序列读数平均超过400个碱基,对基因组进行采样,平均间距为每5000个碱基一次。共生成了339,243个碱基的独特序列(约占7%)。将这870个序列的样本与完整的大肠杆菌K-12基因组以及GenBank数据库的其余部分进行了比较,后者也可被视为一个采样序列的集合。尽管伤寒沙门氏菌数据集不完整,但仍能轻松辨别出有趣的类别。从伤寒沙门氏菌确定的序列中有16%在已知沙门氏菌序列中有密切的同源物(在BlastX或BlastN中P < 1e-40),反映了这些基因组中先前已测序的比例;277个序列(32%)在完整的大肠杆菌K-12基因组中没有明显的直系同源物(P > 1e-20),其中155个序列(18%)与数据库中的任何序列都没有密切相似性(P > 1e-5)。277个序列中的8个与其他大肠杆菌菌株或质粒中的基因有相似性,6个序列显示出新型噬菌体溶原菌或噬菌体整合序列残余的证据,包括一个λ家族成员(P < 1e-15)。23个样本序列与数据库中来自大肠杆菌/沙门氏菌进化枝以外的生物体(包括志贺氏菌和柠檬酸杆菌)的序列有明显更密切的相似性。这些序列是向伤寒沙门氏菌谱系的新的横向转移事件候选者或大肠杆菌K-12谱系上的缺失。在样本中观察到11个大于100 bp的插入/缺失事件的推定连接点,表明超过150个这样的事件可能区分伤寒沙门氏菌和大肠杆菌K-12。文中讨论了对更有效利用样本序列的自动方法的需求。

相似文献

6
The genome of Salmonella enterica serovar Typhi.伤寒沙门氏菌血清型 Typhi 的基因组。
Clin Infect Dis. 2007 Jul 15;45 Suppl 1:S29-33. doi: 10.1086/518143.
8
Rearrangements in the genome of the bacterium Salmonella typhi.伤寒沙门氏菌基因组的重排
Proc Natl Acad Sci U S A. 1995 Feb 14;92(4):1018-22. doi: 10.1073/pnas.92.4.1018.

引用本文的文献

5
Genomics and bacterial pathogenesis.基因组学与细菌致病机制
Emerg Infect Dis. 2000 Sep-Oct;6(5):496-504. doi: 10.3201/eid0605.000509.
6
Search and discovery strategies for biotechnology: the paradigm shift.生物技术的搜索与发现策略:范式转变
Microbiol Mol Biol Rev. 2000 Sep;64(3):573-606. doi: 10.1128/MMBR.64.3.573-606.2000.

本文引用的文献

1
Metabolic databases.代谢数据库。
Trends Biochem Sci. 1998 Mar;23(3):114-6. doi: 10.1016/s0968-0004(98)01184-0.
4
The complete genome sequence of Escherichia coli K-12.大肠杆菌K-12的全基因组序列。
Science. 1997 Sep 5;277(5331):1453-62. doi: 10.1126/science.277.5331.1453.
9
The metabolic pathway collection: an update.代谢途径集合:最新进展。
Nucleic Acids Res. 1997 Jan 1;25(1):37-8. doi: 10.1093/nar/25.1.37.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验