• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于复合推理的双层模型中的二次描述符和约简方法。

Quadratic descriptors and reduction methods in a two-layered model for compound inference.

作者信息

Zhu Jianshen, Azam Naveed Ahmed, Cao Shengjuan, Ido Ryota, Haraguchi Kazuya, Zhao Liang, Nagamochi Hiroshi, Akutsu Tatsuya

机构信息

Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, Kyoto, Japan.

Department of Mathematics, Quaid-i-Azam University, Islamabad, Pakistan.

出版信息

Front Genet. 2025 Jan 29;15:1483490. doi: 10.3389/fgene.2024.1483490. eCollection 2024.

DOI:10.3389/fgene.2024.1483490
PMID:39944352
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11814466/
Abstract

Compound inference models are crucial for discovering novel drugs in bioinformatics and chemo-informatics. These models rely heavily on useful descriptors of chemical compounds that effectively capture important information about the underlying compounds for constructing accurate prediction functions. In this article, we introduce quadratic descriptors, the products of two graph-theoretic descriptors, to enhance the learning performance of a novel two-layered compound inference model. A mixed-integer linear programming formulation is designed to approximate these quadratic descriptors for inferring desired compounds with the two-layered model. Furthermore, we introduce different methods to reduce descriptors, aiming to avoid computational complexity and overfitting issues during the learning process caused by the large number of quadratic descriptors. Experimental results show that for 32 chemical properties of monomers and 10 chemical properties of polymers, the prediction functions constructed by the proposed method achieved high test coefficients of determination. Furthermore, our method inferred chemical compounds in a time ranging from a few seconds to approximately 60 s. These results indicate a strong correlation between the properties of chemical graphs and their quadratic graph-theoretic descriptors.

摘要

复合推理模型对于生物信息学和化学信息学中发现新型药物至关重要。这些模型严重依赖于化学化合物的有用描述符,这些描述符能有效捕获有关基础化合物的重要信息,以构建准确的预测函数。在本文中,我们引入二次描述符,即两个图论描述符的乘积,以提高新型两层复合推理模型的学习性能。设计了一种混合整数线性规划公式来近似这些二次描述符,以便用两层模型推断所需化合物。此外,我们引入了不同的方法来减少描述符,旨在避免学习过程中因大量二次描述符导致的计算复杂性和过拟合问题。实验结果表明,对于单体的32种化学性质和聚合物的10种化学性质,所提方法构建的预测函数实现了较高的测试决定系数。此外,我们的方法推断化学化合物的时间从几秒到大约60秒不等。这些结果表明化学图的性质与其二次图论描述符之间存在很强的相关性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6c/11814466/8fed6b1987d3/fgene-15-1483490-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6c/11814466/0dda988be18f/fgene-15-1483490-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6c/11814466/f7462ea71c42/fgene-15-1483490-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6c/11814466/6121aecd9b0a/fgene-15-1483490-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6c/11814466/8fed6b1987d3/fgene-15-1483490-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6c/11814466/0dda988be18f/fgene-15-1483490-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6c/11814466/f7462ea71c42/fgene-15-1483490-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6c/11814466/6121aecd9b0a/fgene-15-1483490-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/aa6c/11814466/8fed6b1987d3/fgene-15-1483490-g004.jpg

相似文献

1
Quadratic descriptors and reduction methods in a two-layered model for compound inference.用于复合推理的双层模型中的二次描述符和约简方法。
Front Genet. 2025 Jan 29;15:1483490. doi: 10.3389/fgene.2024.1483490. eCollection 2024.
2
A unified approach to inferring chemical compounds with the desired aqueous solubility.一种推断具有所需水溶性的化合物的统一方法。
J Cheminform. 2025 Mar 26;17(1):37. doi: 10.1186/s13321-025-00966-w.
3
A novel method for inference of acyclic chemical compounds with bounded branch-height based on artificial neural networks and integer programming.一种基于人工神经网络和整数规划的推断具有有界分支高度的无环化合物的新方法。
Algorithms Mol Biol. 2021 Aug 14;16(1):18. doi: 10.1186/s13015-021-00197-2.
4
An Inverse QSAR Method Based on a Two-Layered Model and Integer Programming.基于双层模型和整数规划的逆定量构效关系方法。
Int J Mol Sci. 2021 Mar 11;22(6):2847. doi: 10.3390/ijms22062847.
5
An Inverse QSAR Method Based on Linear Regression and Integer Programming.基于线性回归和整数规划的逆定量构效关系方法。
Front Biosci (Landmark Ed). 2022 Jun 10;27(6):188. doi: 10.31083/j.fbl2706188.
6
A Method for Inferring Polymers Based on Linear Regression and Integer Programming.一种基于线性回归和整数规划的聚合物推断方法。
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):1623-1632. doi: 10.1109/TCBB.2024.3447780. Epub 2024 Dec 10.
7
QuBiLS-MAS, open source multi-platform software for atom- and bond-based topological (2D) and chiral (2.5D) algebraic molecular descriptors computations.QuBiLS-MAS,一款用于基于原子和键的拓扑(二维)和手性(2.5维)代数分子描述符计算的开源多平台软件。
J Cheminform. 2017 Jun 7;9(1):35. doi: 10.1186/s13321-017-0211-5.
8
Molecular Design Based on Integer Programming and Splitting Data Sets by Hyperplanes.基于整数规划的分子设计和超平面数据集分割。
IEEE/ACM Trans Comput Biol Bioinform. 2024 Sep-Oct;21(5):1529-1541. doi: 10.1109/TCBB.2024.3402675. Epub 2024 Oct 9.
9
Chemical graphs, molecular matrices and topological indices in chemoinformatics and quantitative structure-activity relationships.化学信息学与定量构效关系中的化学图、分子矩阵和拓扑指数
Curr Comput Aided Drug Des. 2013 Jun;9(2):153-63. doi: 10.2174/1573409911309020002.
10
What are the limits of applicability for graph theoretic descriptors in QSPR/QSAR? Modeling dipole moments of aromatic compounds with TOPS-MODE descriptors.定量构效关系/定量结构活性关系中,图论描述符的适用范围有哪些?使用TOPS-MODE描述符对芳香族化合物的偶极矩进行建模。
J Chem Inf Comput Sci. 2003 Jan-Feb;43(1):75-84. doi: 10.1021/ci025604w.

本文引用的文献

1
A Method for Inferring Polymers Based on Linear Regression and Integer Programming.一种基于线性回归和整数规划的聚合物推断方法。
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):1623-1632. doi: 10.1109/TCBB.2024.3447780. Epub 2024 Dec 10.
2
Machine learning in preclinical drug discovery.机器学习在临床前药物发现中的应用。
Nat Chem Biol. 2024 Aug;20(8):960-973. doi: 10.1038/s41589-024-01679-1. Epub 2024 Jul 19.
3
Comprehensive Survey of Recent Drug Discovery Using Deep Learning.深度学习在药物发现中的最新应用综述
Int J Mol Sci. 2021 Sep 15;22(18):9983. doi: 10.3390/ijms22189983.
4
A novel method for inference of acyclic chemical compounds with bounded branch-height based on artificial neural networks and integer programming.一种基于人工神经网络和整数规划的推断具有有界分支高度的无环化合物的新方法。
Algorithms Mol Biol. 2021 Aug 14;16(1):18. doi: 10.1186/s13015-021-00197-2.
5
An Inverse QSAR Method Based on a Two-Layered Model and Integer Programming.基于双层模型和整数规划的逆定量构效关系方法。
Int J Mol Sci. 2021 Mar 11;22(6):2847. doi: 10.3390/ijms22062847.
6
Graph neural networks for automated de novo drug design.图神经网络在药物从头设计中的应用。
Drug Discov Today. 2021 Jun;26(6):1382-1393. doi: 10.1016/j.drudis.2021.02.011. Epub 2021 Feb 17.
7
From Big Data to Artificial Intelligence: chemoinformatics meets new challenges.从大数据到人工智能:化学信息学面临新挑战。
J Cheminform. 2020 Dec 18;12(1):74. doi: 10.1186/s13321-020-00475-y.
8
A New Machine-Learning Tool for Fast Estimation of Liquid Viscosity. Application to Cosmetic Oils.一种用于快速估算液体粘度的新机器学习工具。在化妆品油中的应用。
J Chem Inf Model. 2020 Apr 27;60(4):2012-2023. doi: 10.1021/acs.jcim.0c00083. Epub 2020 Apr 13.
9
Calculation of the Isobaric Heat Capacities of the Liquid and Solid Phase of Organic Compounds at and around 298.15 K Based on Their "True" Molecular Volume.基于“真实”分子体积计算有机化合物在 298.15 K 及附近的液相和固相等压热容。
Molecules. 2019 Apr 24;24(8):1626. doi: 10.3390/molecules24081626.
10
Neural network and deep-learning algorithms used in QSAR studies: merits and drawbacks.QSAR研究中使用的神经网络和深度学习算法:优点与缺点。
Drug Discov Today. 2018 Oct;23(10):1784-1790. doi: 10.1016/j.drudis.2018.06.016. Epub 2018 Jun 21.