• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于化学结构编码的N线性代数映射:对原子对方法的适当推广?

N-linear algebraic maps for chemical structure codification: a suitable generalization for atom-pair approaches?

作者信息

Garcia-Jacas Cesar R, Marrero-Ponce Yovani, Barigye Stephen J, Valdes-Martini Jose R, Rivera-Borroto Oscar M, Olivero-Verbel Jesus

机构信息

Unit of Computer-Aided Molecular "Biosilico" Discovery and Bioinformatic Research (CAMD-BIR Unit), Faculty of Chemistry-Pharmacy. Universidad Central "Martha Abreu" de Las Villas, Santa Clara, 54830, Villa Clara, Cuba.

出版信息

Curr Drug Metab. 2014;15(4):441-69. doi: 10.2174/1389200215666140605124506.

DOI:10.2174/1389200215666140605124506
PMID:24909423
Abstract

The present manuscript introduces, for the first time, a novel 3D-QSAR alignment free method (QuBiLS-MIDAS) based on tensor concepts through the use of the three-linear and four-linear algebraic forms as specific cases of n-linear maps. To this end, the k(th) three-tuple and four-tuple spatial-(dis)similarity matrices are defined, as tensors of order 3 and 4, respectively, to represent 3Dinformation among "three and four" atoms of the molecular structures. Several measures (multi-metrics) to establish (dis)-similarity relations among "three and four" atoms are discussed, as well as, normalization schemes proposed for the n-tuple spatial-(dis)similarity matrices based on the simple-stochastic and mutual probability algebraic transformations. To consider specific interactions among atoms, both for the global and local indices, n-tuple path and length cut-off constraints are introduced. This algebraic scaffold can also be seen as a generalization of the vector-matrix-vector multiplication procedure (which is a matrix representation of the traditional linear, quadratic and bilinear forms) for the calculation of molecular descriptors and is thus a new theoretical approach with a methodological contribution. A variability analysis based on Shannon's entropy reveals that the best distributions are achieved with the ternary and quaternary measures corresponding to the bond and dihedral angles. In addition, the proposed indices have superior entropy behavior than the descriptors calculated by other programs used in chemo-informatics studies, such as, DRAGON, PADEL, Mold2, and so on. A principal component analysis shows that the novel 3D n-tuple indices codify the same information captured by the DRAGON 3D-indices, as well as, information not codified by the latter. A QSAR study to obtain deeper criteria on the contribution of the novel molecular parameters was performed for the binding affinity to the corticosteroid-binding globulin, using Cramer's steroid database. The achieved results reveal superior statistical parameters for the Bond Angle and Dihedral Angle approaches, consistent with the results obtained in variability analysis. Finally, the obtained QuBiLS-MIDAS models yield superior performances than all 3D-QSAR methods reported in the literature using the 31 steroids as training set, and for the popular division of Cramer's database in training (1-21) and test (22-31) sets, comparable to superior results in the prediction of the activity of the steroids are obtained. From the results achieved, it can be suggested that the proposed QuBiLS-MIDAS N-tuples indices are a useful tool to be considered in chemo-informatics studies.

摘要

本手稿首次介绍了一种基于张量概念的新型3D-QSAR无对齐方法(QuBiLS-MIDAS),该方法通过使用三线性和四线性代数形式作为n线性映射的具体实例。为此,分别定义了第k个三元组和四元组空间(不)相似性矩阵,作为三阶和四阶张量,以表示分子结构中“三个和四个”原子之间的三维信息。讨论了几种用于建立“三个和四个”原子之间(不)相似性关系的度量(多指标),以及基于简单随机和互概率代数变换为n元组空间(不)相似性矩阵提出的归一化方案。为了考虑原子之间的特定相互作用,对于全局和局部指标,引入了n元组路径和长度截止约束。这种代数框架也可以看作是向量-矩阵-向量乘法过程(它是传统线性、二次和双线性形式的矩阵表示)的推广,用于计算分子描述符,因此是一种具有方法学贡献的新理论方法。基于香农熵的变异性分析表明,与键角和二面角对应的三元和四元度量可实现最佳分布。此外,所提出的指标比化学信息学研究中使用的其他程序(如DRAGON、PADEL、Mold2等)计算的描述符具有更好的熵行为。主成分分析表明,新型3D n元组指标编码了DRAGON 3D指标捕获的相同信息,以及后者未编码的信息。使用克莱默类固醇数据库,进行了一项QSAR研究,以获得关于新型分子参数贡献的更深入标准,用于与皮质类固醇结合球蛋白的结合亲和力。所取得的结果表明,键角和二面角方法具有更好的统计参数,这与变异性分析中获得的结果一致。最后,使用31种类固醇作为训练集,所获得的QuBiLS-MIDAS模型比文献中报道的所有3D-QSAR方法具有更好的性能,并且对于克莱默数据库在训练集(1-21)和测试集(22-31)中的常见划分,在类固醇活性预测方面获得了可比的优异结果。从所取得的结果可以看出,所提出的QuBiLS-MIDAS N元组指标是化学信息学研究中值得考虑的有用工具。

相似文献

1
N-linear algebraic maps for chemical structure codification: a suitable generalization for atom-pair approaches?用于化学结构编码的N线性代数映射:对原子对方法的适当推广?
Curr Drug Metab. 2014;15(4):441-69. doi: 10.2174/1389200215666140605124506.
2
N-tuple topological/geometric cutoffs for 3D N-linear algebraic molecular codifications: variability, linear independence and QSAR analysis.用于3D N线性代数分子编码的N元组拓扑/几何截止值:变异性、线性独立性和定量构效关系分析。
SAR QSAR Environ Res. 2016 Dec;27(12):949-975. doi: 10.1080/1062936X.2016.1231714. Epub 2016 Oct 6.
3
QuBiLS-MAS, open source multi-platform software for atom- and bond-based topological (2D) and chiral (2.5D) algebraic molecular descriptors computations.QuBiLS-MAS,一款用于基于原子和键的拓扑(二维)和手性(2.5维)代数分子描述符计算的开源多平台软件。
J Cheminform. 2017 Jun 7;9(1):35. doi: 10.1186/s13321-017-0211-5.
4
Examining the predictive accuracy of the novel 3D N-linear algebraic molecular codifications on benchmark datasets.检验新型3D N线性代数分子编码在基准数据集上的预测准确性。
J Cheminform. 2016 Feb 25;8:10. doi: 10.1186/s13321-016-0122-x. eCollection 2016.
5
QuBiLS-MIDAS: a parallel free-software for molecular descriptors computation based on multilinear algebraic maps.QuBiLS-MIDAS:一种基于多元线性代数映射的分子描述符计算并行免费软件。
J Comput Chem. 2014 Jul 5;35(18):1395-409. doi: 10.1002/jcc.23640. Epub 2014 Jun 2.
6
LEGO-based generalized set of two linear algebraic 3D bio-macro-molecular descriptors: Theory and validation by QSARs.基于乐高的两组线性代数 3D 生物大分子描述符的广义集:QSAR 的理论和验证。
J Theor Biol. 2020 Jan 21;485:110039. doi: 10.1016/j.jtbi.2019.110039. Epub 2019 Oct 4.
7
Distributed and multicore QuBiLS-MIDAS software v2.0: Computing chiral, fuzzy, weighted and truncated geometrical molecular descriptors based on tensor algebra.分布式多核 QuBiLS-MIDAS 软件 v2.0:基于张量代数计算手性、模糊、加权和截断的几何分子描述符。
J Comput Chem. 2020 May 5;41(12):1209-1227. doi: 10.1002/jcc.26167. Epub 2020 Feb 14.
8
Tensor algebra-based geometric methodology to codify central chirality on organic molecules.基于张量代数的几何方法对有机分子中的中心手性进行编码。
SAR QSAR Environ Res. 2017 Jun;28(6):541-556. doi: 10.1080/1062936X.2017.1344729. Epub 2017 Jul 14.
9
: A Novel Multiplatform Framework to Compute Tensor Algebra-Based Three-Dimensional Protein Descriptors.: 一种用于计算基于张量代数的三维蛋白质描述符的新型多平台框架。
J Chem Inf Model. 2020 Feb 24;60(2):1042-1059. doi: 10.1021/acs.jcim.9b00629. Epub 2019 Oct 30.
10
When global and local molecular descriptors are more than the sum of its parts: Simple, But Not Simpler?当全局和局部分子描述符不仅仅是其各部分之和时:简单,但不更简单?
Mol Divers. 2020 Nov;24(4):913-932. doi: 10.1007/s11030-019-10002-3. Epub 2019 Oct 28.

引用本文的文献

1
Molecular and Descriptor Spaces for Predicting Initial Rate of Catalytic Homogeneous Quinoline Hydrogenation with Ru, Rh, Os, and Ir Catalysts.用于预测钌、铑、锇和铱催化剂催化均相喹啉氢化初始速率的分子和描述符空间
ACS Omega. 2025 Apr 30;10(18):18312-18331. doi: 10.1021/acsomega.4c09503. eCollection 2025 May 13.
2
Tensor Algebra-based Geometrical (3D) Biomacro-Molecular Descriptors for Protein Research: Theory, Applications and Comparison with other Methods.基于张量代数的生物大分子蛋白质的几何(3D)描述符:理论、应用及与其他方法的比较。
Sci Rep. 2019 Aug 6;9(1):11391. doi: 10.1038/s41598-019-47858-2.
3
Choquet integral-based fuzzy molecular characterizations: when global definitions are computed from the dependency among atom/bond contributions (LOVIs/LOEIs).
基于Choquet积分的模糊分子表征:当根据原子/键贡献之间的依赖性(局部重叠价指数/局部重叠电子指数)计算全局定义时。
J Cheminform. 2018 Oct 25;10(1):51. doi: 10.1186/s13321-018-0306-7.
4
QuBiLS-MAS, open source multi-platform software for atom- and bond-based topological (2D) and chiral (2.5D) algebraic molecular descriptors computations.QuBiLS-MAS,一款用于基于原子和键的拓扑(二维)和手性(2.5维)代数分子描述符计算的开源多平台软件。
J Cheminform. 2017 Jun 7;9(1):35. doi: 10.1186/s13321-017-0211-5.
5
Physico-Chemical and Structural Interpretation of Discrete Derivative Indices on N-Tuples Atoms.N元组原子上离散导数指数的物理化学和结构解释
Int J Mol Sci. 2016 May 27;17(6):812. doi: 10.3390/ijms17060812.
6
Examining the predictive accuracy of the novel 3D N-linear algebraic molecular codifications on benchmark datasets.检验新型3D N线性代数分子编码在基准数据集上的预测准确性。
J Cheminform. 2016 Feb 25;8:10. doi: 10.1186/s13321-016-0122-x. eCollection 2016.