Department of Biomolecular Engineering, Graduate School of Engineering, Nagoya University, Nagoya 464-8603, Japan.
Department of Basic Medicinal Sciences, Graduate School of Pharmaceutical Sciences, Nagoya 464-8601, Japan.
ACS Biomater Sci Eng. 2020 Nov 9;6(11):6117-6125. doi: 10.1021/acsbiomaterials.0c01010. Epub 2020 Oct 2.
We developed a method for efficiently activating functional peptides with a large structural contribution using the peptide-searching method with machine learning. The physicochemical properties of the amino acids were employed as variables. As a model peptide, we used GHWYYRCW, which is a functional peptide that inhibits α-amylase derived from human pancreatic juice. First, training data were acquired. A total of 153 peptides were prepared in which 1 amino acid in GHWYYRCW was replaced to construct a 1-amino acid substitution coverage peptide library. The inhibitory activity of each peptide against α-amylase and α-glucosidase was evaluated. Second, random forest (RF) regression analysis was performed using 120 variables, and the enzyme inhibitory activity of the peptide was related to the physicochemical properties. The constructed model had many features describing the charge of the amino acid (isoelectric point and p). Then, high inhibitory (HI) peptides were predicted using a library of peptides with 2- or 3-amino acid substitution as test data, which were called HI2 and HI3 peptides. As results, the first or seventh amino acid of the HI2 peptide was replaced with Arg, Trp, or Tyr. We found that all 30 HI2 peptides had significantly higher activity than the original sequence (100%) and 26 of the 30 HI3 peptides were significantly active (86.7%). However, the actual inhibitory activity of the HI3 peptides was improved to a lesser extent. The docking simulation clarified that the CDOCKER energy decrease was roughly correlated with the inhibitory activity. The machine learning-based predictive model was a promising tool for design of substituted peptides with high activity values, and it was assumed that the advanced model that forecasts the interaction index such as the CDOCKER energy substituting for the inhibitory activity would be used to design HI peptides, even in the case of the HI3 peptides.
我们开发了一种使用机器学习的肽搜索方法来有效激活具有较大结构贡献的功能肽的方法。将氨基酸的物理化学性质用作变量。作为模型肽,我们使用 GHWYYRCW,它是一种抑制来自人胰液的α-淀粉酶的功能肽。首先,获取训练数据。总共制备了 153 个肽,其中 GHWYYRCW 中的 1 个氨基酸被替换以构建 1 个氨基酸取代覆盖率肽文库。评估了每个肽对α-淀粉酶和α-葡萄糖苷酶的抑制活性。其次,使用 120 个变量进行随机森林(RF)回归分析,并且肽的酶抑制活性与物理化学性质相关。构建的模型具有许多描述氨基酸电荷的特征(等电点和 p)。然后,使用具有 2 或 3 个氨基酸取代的肽文库作为测试数据来预测高抑制(HI)肽,这些肽称为 HI2 和 HI3 肽。结果,HI2 肽的第 1 或第 7 位氨基酸被 Arg、Trp 或 Tyr 取代。我们发现所有 30 个 HI2 肽的活性均明显高于原始序列(100%),并且 30 个 HI3 肽中的 26 个活性明显(86.7%)。但是,HI3 肽的实际抑制活性的提高程度较小。对接模拟表明 CDOCKER 能量降低与抑制活性大致相关。基于机器学习的预测模型是设计具有高活性值的取代肽的有前途的工具,并且假设可以使用预测诸如 CDOCKER 能量的相互作用指数的高级模型来替代抑制活性来设计 HI 肽,即使在 HI3 肽的情况下也是如此。