Yamane Junko, Aburatani Sachiyo, Imanishi Satoshi, Akanuma Hiromi, Nagano Reiko, Kato Tsuyoshi, Sone Hideko, Ohsako Seiichiroh, Fujibuchi Wataru
Center for iPS Cell Research and Application, Kyoto University, 53 Kawahara-cho, Shogoin, Sakyo-ku, Kyoto 606-8507, Japan Center for Disease Biology and Integrative Medicine, Graduate School of Medicine, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-8654, Japan.
Computational Biology Research Center, Advanced Industrial Science and Technology (AIST), 2-4-7 Aomi, Koto-ku, Tokyo 135-0064, Japan.
Nucleic Acids Res. 2016 Jul 8;44(12):5515-28. doi: 10.1093/nar/gkw450. Epub 2016 May 20.
Predictive toxicology using stem cells or their derived tissues has gained increasing importance in biomedical and pharmaceutical research. Here, we show that toxicity category prediction by support vector machines (SVMs), which uses qRT-PCR data from 20 categorized chemicals based on a human embryonic stem cell (hESC) system, is improved by the adoption of gene networks, in which network edge weights are added as feature vectors when noisy qRT-PCR data fail to make accurate predictions. The accuracies of our system were 97.5-100% for three toxicity categories: neurotoxins (NTs), genotoxic carcinogens (GCs) and non-genotoxic carcinogens (NGCs). For two uncategorized chemicals, bisphenol-A and permethrin, our system yielded reasonable results: bisphenol-A was categorized as an NGC, and permethrin was categorized as an NT; both predictions were supported by recently published papers. Our study has two important features: (i) as the first study to employ gene networks without using conventional quantitative structure-activity relationships (QSARs) as input data for SVMs to analyze toxicogenomics data in an hESC validation system, it uses additional information of gene-to-gene interactions to significantly increase prediction accuracies for noisy gene expression data; and (ii) using only undifferentiated hESCs, our study has considerable potential to predict late-onset chemical toxicities, including abnormalities that occur during embryonic development.
在生物医学和制药研究中,利用干细胞或其衍生组织进行预测毒理学研究的重要性日益凸显。在此,我们表明,基于人类胚胎干细胞(hESC)系统,通过支持向量机(SVM)利用20种分类化学品的qRT-PCR数据进行毒性类别预测时,采用基因网络可提高预测准确性。当有噪声的qRT-PCR数据无法做出准确预测时,基因网络的边权重作为特征向量添加到模型中。对于神经毒素(NTs)、遗传毒性致癌物(GCs)和非遗传毒性致癌物(NGCs)这三种毒性类别,我们系统的预测准确率为97.5%-100%。对于两种未分类的化学品双酚A和氯菊酯,我们的系统给出了合理的结果:双酚A被分类为非遗传毒性致癌物,氯菊酯被分类为神经毒素;这两种预测都得到了最近发表论文的支持。我们的研究有两个重要特点:(i)作为第一项在hESC验证系统中不使用传统定量构效关系(QSARs)作为SVM输入数据,而是利用基因网络来分析毒理基因组学数据的研究,它利用基因间相互作用的额外信息显著提高了对有噪声基因表达数据的预测准确率;(ii)仅使用未分化的hESC,我们的研究在预测包括胚胎发育过程中出现的异常在内的迟发性化学毒性方面具有很大潜力。