Landstorfer Richard, Simon Svenja, Schober Steffen, Keim Daniel, Scherer Siegfried, Neuhaus Klaus
Lehrstuhl für Mikrobielle Ökologie, Wissenschaftszentrum Weihenstephan, Technische Universität München, Weihenstephaner Berg 3, D-85350 Freising, Germany.
BMC Genomics. 2014 May 9;15:353. doi: 10.1186/1471-2164-15-353.
Multiple infection sources for enterohemorrhagic Escherichia coli O157:H7 (EHEC) are known, including animal products, fruit and vegetables. The ecology of this pathogen outside its human host is largely unknown and one third of its annotated genes are still hypothetical. To identify genetic determinants expressed under a variety of environmental factors, we applied strand-specific RNA-sequencing, comparing the SOLiD and Illumina systems.
Transcriptomes of EHEC were sequenced under 11 different biotic and abiotic conditions: LB medium at pH4, pH7, pH9, or at 15°C; LB with nitrite or trimethoprim-sulfamethoxazole; LB-agar surface, M9 minimal medium, spinach leaf juice, surface of living radish sprouts, and cattle feces. Of 5379 annotated genes in strain EDL933 (genome and plasmid), a surprising minority of only 144 had null sequencing reads under all conditions. We therefore developed a statistical method to distinguish weakly transcribed genes from background transcription. We find that 96% of all genes and 91.5% of the hypothetical genes exhibit a significant transcriptional signal under at least one condition. Comparing SOLiD and Illumina systems, we find a high correlation between both approaches for fold-changes of the induced or repressed genes. The pathogenicity island LEE showed highest transcriptional activity in LB medium, minimal medium, and after treatment with antibiotics. Unique sets of genes, including many hypothetical genes, are highly up-regulated on radish sprouts, cattle feces, or in the presence of antibiotics. Furthermore, we observed induction of the shiga-toxin carrying phages by antibiotics and confirmed active biofilm related genes on radish sprouts, in cattle feces, and on agar plates.
Since only a minority of genes (2.7%) were not active under any condition tested (null reads), we suggest that the assumption of significant genome over-annotations is wrong. Environmental transcriptomics uncovered hitherto unknown gene functions and unique regulatory patterns in EHEC. For instance, the environmental function of azoR had been elusive, but this gene is highly active on radish sprouts. Thus, NGS-transcriptomics is an appropriate technique to propose new roles of hypothetical genes and to guide future research.
已知肠出血性大肠杆菌O157:H7(EHEC)有多种感染源,包括动物产品、水果和蔬菜。该病原体在人类宿主之外的生态环境很大程度上未知,其注释基因中有三分之一仍是假设性的。为了鉴定在各种环境因素下表达的遗传决定因素,我们应用链特异性RNA测序,比较了SOLiD和Illumina系统。
在11种不同的生物和非生物条件下对EHEC的转录组进行了测序:pH4、pH7、pH9的LB培养基,或15°C的LB培养基;含亚硝酸盐或甲氧苄啶-磺胺甲恶唑的LB培养基;LB琼脂表面、M9基本培养基、菠菜叶汁、活萝卜芽表面和牛粪。在菌株EDL933(基因组和质粒)的5379个注释基因中,只有144个在所有条件下测序读数为零,这一比例令人惊讶地低。因此,我们开发了一种统计方法来区分弱转录基因和背景转录。我们发现,所有基因的96%和假设基因的91.5%在至少一种条件下表现出显著的转录信号。比较SOLiD和Illumina系统,我们发现两种方法对诱导或抑制基因的倍数变化具有高度相关性。致病岛LEE在LB培养基、基本培养基和抗生素处理后表现出最高的转录活性。包括许多假设基因在内的独特基因集在萝卜芽、牛粪或存在抗生素的情况下高度上调。此外,我们观察到抗生素诱导携带志贺毒素的噬菌体,并证实了萝卜芽、牛粪和琼脂平板上与生物膜相关的活性基因。
由于只有少数基因(2.7%)在任何测试条件下都不活跃(测序读数为零),我们认为基因组过度注释的假设是错误的。环境转录组学揭示了EHEC中迄今未知的基因功能和独特的调控模式。例如,azoR的环境功能一直难以捉摸,但该基因在萝卜芽上高度活跃。因此,NGS转录组学是一种合适的技术,可用于提出假设基因的新作用并指导未来的研究。