Aghayev Zahir, Walker George F, Iseri Funda, Ali Moustafa, Szafran Adam T, Stossi Fabio, Mancini Michael A, Pistikopoulos Efstratios N, Beykal Burcu
Department of Chemical and Biomolecular Engineering, University of Connecticut, Storrs, CT 06269, USA.
Center for Clean Energy Engineering, University of Connecticut, Storrs, CT 06269, USA.
ESCAPE. 2023;52:2631-2636. doi: 10.1016/b978-0-443-15274-0.50418-2. Epub 2023 Jul 18.
We develop a machine learning framework that integrates high content/high throughput image analysis and artificial neural networks (ANNs) to model the separation between chemical compounds based on their estrogenic receptor activity. Natural and man-made chemicals have the potential to disrupt the endocrine system by interfering with hormone actions in people and wildlife. Although numerous studies have revealed new knowledge on the mechanism through which these compounds interfere with various hormone receptors, it is still a very challenging task to comprehensively evaluate the endocrine disrupting potential of all existing chemicals and their mixtures by pure or approaches. Machine learning offers a unique advantage in the rapid evaluation of chemical toxicity through learning the underlying patterns in the experimental biological activity data. Motivated by this, we train and test ANN classifiers for modeling the activity of estrogen receptor-α agonists and antagonists at the single-cell level by using high throughput/high content microscopy descriptors. Our framework preprocesses the experimental data by cleaning, scaling, and feature engineering where only the middle 50% of the values from each sample with detectable receptor-DNA binding is considered in the dataset. Principal component analysis is also used to minimize the effects of experimental noise in modeling where these projected features are used in classification model building. The results show that our ANN-based nonlinear data-driven framework classifies the benchmark agonist and antagonist chemicals with 98.41% accuracy.
我们开发了一个机器学习框架,该框架集成了高内涵/高通量图像分析和人工神经网络(ANN),以基于化合物的雌激素受体活性对其进行分离建模。天然和人造化学物质有可能通过干扰人类和野生动物体内的激素作用来扰乱内分泌系统。尽管众多研究揭示了这些化合物干扰各种激素受体的机制的新知识,但通过纯实验或其他方法全面评估所有现有化学物质及其混合物的内分泌干扰潜力仍然是一项极具挑战性的任务。机器学习在通过学习实验生物活性数据中的潜在模式来快速评估化学毒性方面具有独特优势。受此启发,我们通过使用高通量/高内涵显微镜描述符,训练和测试ANN分类器,以在单细胞水平上对雌激素受体-α激动剂和拮抗剂的活性进行建模。我们的框架通过清理、缩放和特征工程对实验数据进行预处理,数据集中仅考虑每个具有可检测受体-DNA结合的样本中间50%的值。主成分分析也用于在建模中最小化实验噪声的影响,这些投影特征用于分类模型构建。结果表明,我们基于ANN的非线性数据驱动框架对基准激动剂和拮抗剂化学物质的分类准确率为98.41%。