Nagahama Institute of Bio-Science and Technology, Tamura-cho 1266, Nagahama-shi, Shiga-ken 526-0829, Japan.
DNA Res. 2011 Apr;18(2):125-36. doi: 10.1093/dnares/dsr005. Epub 2011 Mar 28.
Influenza virus poses a significant threat to public health, as exemplified by the recent introduction of the new pandemic strain H1N1/09 into human populations. Pandemics have been initiated by the occurrence of novel changes in animal sources that eventually adapt to human. One important issue in studies of viral genomes, particularly those of influenza virus, is to predict possible changes in genomic sequence that will become hazardous. We previously established a clustering method termed 'BLSOM' (batch-learning self-organizing map) that does not depend on sequence alignment and can characterize and compare even 1 million genomic sequences in one run. Strategies for comparing a vast number of genomic sequences simultaneously become increasingly important in genome studies because of remarkable progresses in nucleotide sequencing. In this study, we have constructed BLSOMs based on the oligonucleotide and codon composition of all influenza A viral strains available. Without prior information with regard to their hosts, sequences derived from strains isolated from avian or human sources were successfully clustered according to the hosts. Notably, the pandemic H1N1/09 strains have oligonucleotide and codon compositions that are clearly different from those of human seasonal influenza A strains. This enables us to infer future directional changes in the influenza A viral genome.
流感病毒对公共卫生构成重大威胁,最近引入的新型大流行株 H1N1/09 就是一个例证。大流行是由动物源的新型变化引起的,这些变化最终适应了人类。在病毒基因组研究中,一个重要问题是预测可能发生的基因组序列变化,这些变化将变得危险。我们之前建立了一种聚类方法,称为“BLSOM”(批处理学习自组织映射),它不依赖于序列比对,并且可以在一次运行中对即使是 100 万个基因组序列进行特征化和比较。由于核苷酸测序的显著进展,在基因组研究中,同时比较大量基因组序列的策略变得越来越重要。在这项研究中,我们根据所有可用的甲型流感病毒株的寡核苷酸和密码子组成构建了 BLSOM。根据宿主,成功地根据宿主对来自禽源或人源的分离株进行了聚类,而无需关于其宿主的先验信息。值得注意的是,大流行的 H1N1/09 株的寡核苷酸和密码子组成明显不同于人类季节性流感 A 株。这使我们能够推断甲型流感病毒基因组的未来方向性变化。