Li Jianan, Yanagisawa Keisuke, Yoshikawa Yasushi, Ohue Masahito, Akiyama Yutaka
Department of Computer Science, School of Computing, Tokyo Institute of Technology, Meguro-ku, Tokyo 152-8550, Japan.
AIST-TokyoTech Real World Big-Data Computation Open Innovation Laboratory (RWBC-OIL), National Institute of Advanced Industrial Science and Technology, Tsukuba, Ibaraki 305-8560, Japan.
Bioinformatics. 2022 Jan 27;38(4):1110-1117. doi: 10.1093/bioinformatics/btab726.
In recent years, cyclic peptide drugs have been receiving increasing attention because they can target proteins that are difficult to be tackled by conventional small-molecule drugs or antibody drugs. Plasma protein binding rate (%PPB) is a significant pharmacokinetic property of a compound in drug discovery and design. However, due to structural differences, previous computational prediction methods developed for small-molecule compounds cannot be successfully applied to cyclic peptides, and methods for predicting the PPB rate of cyclic peptides with high accuracy are not yet available.
Cyclic peptides are larger than small molecules, and their local structures have a considerable impact on PPB; thus, molecular descriptors expressing residue-level local features of cyclic peptides, instead of those expressing the entire molecule, as well as the circularity of the cyclic peptides should be considered. Therefore, we developed a prediction method named CycPeptPPB using deep learning that considers both factors. First, the macrocycle ring of cyclic peptides was decomposed residue by residue. The residue-based descriptors were arranged according to the sequence information of the cyclic peptide. Furthermore, the circular data augmentation method was used, and the circular convolution method CyclicConv was devised to express the cyclic structure. CycPeptPPB exhibited excellent performance, with mean absolute error (MAE) of 4.79% and correlation coefficient (R) of 0.92 for the public drug dataset, compared to the prediction performance of the existing PPB rate prediction software (MAE=15.08%, R=0.63).
The data underlying this article are available in the online supplementary material. The source code of CycPeptPPB is available at https://github.com/akiyamalab/cycpeptppb.
Supplementary data are available at Bioinformatics online.
近年来,环肽药物越来越受到关注,因为它们可以靶向传统小分子药物或抗体药物难以对付的蛋白质。血浆蛋白结合率(%PPB)是药物发现和设计中化合物的一项重要药代动力学性质。然而,由于结构差异,先前为小分子化合物开发的计算预测方法无法成功应用于环肽,目前尚无高精度预测环肽PPB率的方法。
环肽比小分子大,其局部结构对PPB有相当大的影响;因此,应考虑表达环肽残基水平局部特征而非整个分子特征的分子描述符,以及环肽的环状性。因此,我们利用深度学习开发了一种名为CycPeptPPB的预测方法,该方法考虑了这两个因素。首先,将环肽的大环逐个残基分解。基于残基的描述符根据环肽的序列信息排列。此外,使用了循环数据增强方法,并设计了循环卷积方法CyclicConv来表达环状结构。与现有PPB率预测软件的预测性能(平均绝对误差MAE = 15.08%,相关系数R = 0.63)相比,CycPeptPPB表现出优异的性能,对于公共药物数据集,平均绝对误差为4.79%,相关系数为0.92。
本文的基础数据可在在线补充材料中获取。CycPeptPPB的源代码可在https://github.com/akiyamalab/cycpeptppb获取。
补充数据可在《生物信息学》在线获取。