Department of Computer Science and Engineering, Bagley College of Engineering, Mississippi State University, Mississippi 39762, USA.
Toxicol Sci. 2012 Sep;129(1):57-73. doi: 10.1093/toxsci/kfs201. Epub 2012 Jun 13.
No method has been reported to predict, even approximately, the impact of mild-to-moderate changes in several immunological parameters on resistance to infection. The ability to make such predictions would be useful in risk assessment. In addition, equations that predict host resistance on the basis of changes in components of a complex biological system (the immune system) would fulfill one of the major goals of systems biology. In this study, multiple machine learning classification methods were used to predict the effects of a series of drugs and chemicals on host resistance to Listeria monocytogenes in mice on the basis of changes in several holistic immunological parameters. A data set produced under the sponsorship of the National Toxicology Program (NTP) was used in this study. The NTP data set was found to have a high percentage of missing data and to be noisy (probably due to the intrinsically stochastic nature of immune responses). Data preprocessing steps were used to mitigate these problems. In evaluating the machine learning classifiers, we first randomly partitioned the NTP data set into 10 subsets. Each time, we used nine subsets of the data to train the machine learning classifiers, and the remaining single subset to predict outcomes with regard to host resistance. This process was repeated until all 10 combinations of the 9-1 split of the subsets have been tested. The best of the classifiers predicted host resistance outcome correctly for 94.7% of cases, a result which indicates it is possible to identify mathematical expressions that will be useful for risk assessment and to establish a basis for systems immunology.
尚未有方法可以预测,即使是大致预测,轻度至中度改变几个免疫学参数对感染抵抗力的影响。能够进行这种预测将有助于风险评估。此外,基于复杂生物系统(免疫系统)成分变化来预测宿主抵抗力的方程将实现系统生物学的主要目标之一。在这项研究中,使用了多种机器学习分类方法,根据一系列整体免疫学参数的变化,预测药物和化学品对感染李斯特菌的小鼠宿主抵抗力的影响。本研究使用了由国家毒理学计划(NTP)资助产生的数据集。发现 NTP 数据集具有很高的缺失数据百分比并且存在噪声(可能是由于免疫反应的固有随机性)。使用数据预处理步骤来减轻这些问题。在评估机器学习分类器时,我们首先将 NTP 数据集随机划分为 10 个子集。每次,我们使用数据集的九个子集来训练机器学习分类器,而剩余的单个子集则用于预测宿主抵抗力的结果。此过程重复进行,直到已经测试了所有 9-1 子集划分的 10 种组合。最佳的分类器正确预测了 94.7%的宿主抵抗力结果,这表明有可能确定有助于风险评估的数学表达式,并为系统免疫学建立基础。