• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于表面相似性的分子查询检索

Surface similarity-based molecular query-retrieval.

作者信息

Singh Rahul

机构信息

Department of Computer Science, San Francisco State University, San Francisco, CA 94132, USA.

出版信息

BMC Cell Biol. 2007 Jul 10;8 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2121-8-S1-S6.

DOI:10.1186/1471-2121-8-S1-S6
PMID:17634096
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1924511/
Abstract

BACKGROUND

Discerning the similarity between molecules is a challenging problem in drug discovery as well as in molecular biology. The importance of this problem is due to the fact that the biochemical characteristics of a molecule are closely related to its structure. Therefore molecular similarity is a key notion in investigations targeting exploration of molecular structural space, query-retrieval in molecular databases, and structure-activity modelling. Determining molecular similarity is related to the choice of molecular representation. Currently, representations with high descriptive power and physical relevance like 3D surface-based descriptors are available. Information from such representations is both surface-based and volumetric. However, most techniques for determining molecular similarity tend to focus on idealized 2D graph-based descriptors due to the complexity that accompanies reasoning with more elaborate representations.

RESULTS

This paper addresses the problem of determining similarity when molecules are described using complex surface-based representations. It proposes an intrinsic, spherical representation that systematically maps points on a molecular surface to points on a standard coordinate system (a sphere). Molecular surface properties such as shape, field strengths, and effects due to field super-positioning can then be captured as distributions on the surface of the sphere. Surface-based molecular similarity is subsequently determined by computing the similarity of the surface-property distributions using a novel formulation of histogram-intersection. The similarity formulation is not only sensitive to the 3D distribution of the surface properties, but is also highly efficient to compute.

CONCLUSION

The proposed method obviates the computationally expensive step of molecular pose-optimisation, can incorporate conformational variations, and facilitates highly efficient determination of similarity by directly comparing molecular surfaces and surface-based properties. Retrieval performance, applications in structure-activity modeling of complex biological properties, and comparisons with existing research and commercial methods demonstrate the validity and effectiveness of the approach.

摘要

背景

识别分子间的相似性在药物发现以及分子生物学领域都是一个具有挑战性的问题。该问题之所以重要,是因为分子的生化特性与其结构密切相关。因此,分子相似性是针对分子结构空间探索、分子数据库中的查询检索以及构效建模等研究的关键概念。确定分子相似性与分子表示的选择有关。目前,具有高描述能力和物理相关性的表示方法,如基于3D表面的描述符已经存在。来自此类表示的信息既有基于表面的,也有基于体积的。然而,由于使用更精细的表示进行推理会带来复杂性,大多数确定分子相似性的技术往往侧重于理想化的基于2D图形的描述符。

结果

本文解决了使用复杂的基于表面的表示来描述分子时确定相似性的问题。它提出了一种内在的球形表示,该表示将分子表面上的点系统地映射到标准坐标系(一个球体)上的点。然后,分子表面特性,如形状、场强以及场叠加效应等,可以作为球体表面上的分布来捕获。随后,通过使用一种新颖的直方图相交公式计算表面特性分布的相似性,来确定基于表面的分子相似性。该相似性公式不仅对表面特性的3D分布敏感,而且计算效率很高。

结论

所提出的方法避免了分子构象优化这一计算成本高昂的步骤,可以纳入构象变化,并通过直接比较分子表面和基于表面的特性来促进高效的相似性确定。检索性能、在复杂生物学特性的构效建模中的应用以及与现有研究和商业方法的比较,都证明了该方法的有效性和实用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/613a/1924511/2f437f686bc1/1471-2121-8-S1-S6-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/613a/1924511/9b38542458bd/1471-2121-8-S1-S6-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/613a/1924511/f574a9225d0a/1471-2121-8-S1-S6-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/613a/1924511/c6b7b40d9e26/1471-2121-8-S1-S6-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/613a/1924511/2f437f686bc1/1471-2121-8-S1-S6-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/613a/1924511/9b38542458bd/1471-2121-8-S1-S6-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/613a/1924511/f574a9225d0a/1471-2121-8-S1-S6-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/613a/1924511/c6b7b40d9e26/1471-2121-8-S1-S6-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/613a/1924511/2f437f686bc1/1471-2121-8-S1-S6-4.jpg

相似文献

1
Surface similarity-based molecular query-retrieval.基于表面相似性的分子查询检索
BMC Cell Biol. 2007 Jul 10;8 Suppl 1(Suppl 1):S6. doi: 10.1186/1471-2121-8-S1-S6.
2
Reasoning about molecular similarity and properties.关于分子相似性和性质的推理。
Proc IEEE Comput Syst Bioinform Conf. 2004:266-77. doi: 10.1109/csb.2004.1332440.
3
Noncontiguous atom matching structural similarity function.非连续原子匹配结构相似度函数。
J Chem Inf Model. 2013 Oct 28;53(10):2511-24. doi: 10.1021/ci400324u. Epub 2013 Oct 8.
4
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
5
Three dimensional shape comparison of flexible proteins using the local-diameter descriptor.使用局部直径描述符对柔性蛋白质进行三维形状比较。
BMC Struct Biol. 2009 May 12;9:29. doi: 10.1186/1472-6807-9-29.
6
[A retrieval method of drug molecules based on graph collapsing].基于图折叠的药物分子检索方法
Beijing Da Xue Xue Bao Yi Xue Ban. 2018 Apr 18;50(2):368-374.
7
Relevance feedback for enhancing content based image retrieval and automatic prediction of semantic image features: Application to bone tumor radiographs.基于相关性反馈的图像检索增强和语义图像特征的自动预测:在骨肿瘤 X 光片上的应用。
J Biomed Inform. 2018 Aug;84:123-135. doi: 10.1016/j.jbi.2018.07.002. Epub 2018 Jul 5.
8
Impact of Molecular Descriptors on Computational Models.分子描述符对计算模型的影响。
Methods Mol Biol. 2018;1825:171-209. doi: 10.1007/978-1-4939-8639-2_5.
9
Molecular similarity concepts and search calculations.分子相似性概念与搜索计算。
Methods Mol Biol. 2008;453:327-47. doi: 10.1007/978-1-60327-429-6_17.
10
Image-based surface matching algorithm oriented to structural biology.面向结构生物学的基于图像的表面匹配算法。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Jul-Aug;8(4):1004-16. doi: 10.1109/TCBB.2010.21.

引用本文的文献

1
Predicting anatomic therapeutic chemical classification codes using tiered learning.使用分层学习预测解剖学治疗化学分类代码
BMC Bioinformatics. 2017 Jun 7;18(Suppl 8):266. doi: 10.1186/s12859-017-1660-6.
2
Using diffusion distances for flexible molecular shape comparison.利用扩散距离进行灵活的分子形状比较。
BMC Bioinformatics. 2010 Sep 24;11:480. doi: 10.1186/1471-2105-11-480.
3
Large scale study of multiple-molecule queries.大规模多分子查询研究。

本文引用的文献

1
Structural determination of paraffin boiling points.石蜡沸点的结构测定
J Am Chem Soc. 1947 Jan;69(1):17-20. doi: 10.1021/ja01193a005.
2
Reasoning about molecular similarity and properties.关于分子相似性和性质的推理。
Proc IEEE Comput Syst Bioinform Conf. 2004:266-77. doi: 10.1109/csb.2004.1332440.
3
ChemDB: a public database of small molecules and related chemoinformatics resources.化学数据库(ChemDB):一个小分子及相关化学信息学资源的公共数据库。
J Cheminform. 2009 Jun 4;1(1):7. doi: 10.1186/1758-2946-1-7.
4
Proceedings of the 2006 International Workshop on Multiscale Biological Imaging, Data Mining and Informatics, Santa Barbara, USA (BII06).《2006年国际多尺度生物成像、数据挖掘与信息学研讨会论文集》,美国圣巴巴拉(BII06)
BMC Cell Biol. 2007 Jul 10;8 Suppl 1(Suppl 1):S1-10. doi: 10.1186/1471-2121-8-S1-S1.
Bioinformatics. 2005 Nov 15;21(22):4133-9. doi: 10.1093/bioinformatics/bti683. Epub 2005 Sep 20.
4
Secondary-structure matching (SSM), a new tool for fast protein structure alignment in three dimensions.二级结构匹配(SSM),一种用于三维蛋白质结构快速比对的新工具。
Acta Crystallogr D Biol Crystallogr. 2004 Dec;60(Pt 12 Pt 1):2256-68. doi: 10.1107/S0907444904026460. Epub 2004 Nov 26.
5
Use of non-crystallographic symmetry in protein structure refinement.非晶体学对称性在蛋白质结构精修中的应用。
Acta Crystallogr D Biol Crystallogr. 1996 Jul 1;52(Pt 4):842-57. doi: 10.1107/S0907444995016477.
6
Index-based similarity search for protein structure databases.基于索引的蛋白质结构数据库相似性搜索。
J Bioinform Comput Biol. 2004 Mar;2(1):99-126. doi: 10.1142/s0219720004000491.
7
Identification of protein biochemical functions by similarity search using the molecular surface database eF-site.利用分子表面数据库eF-site通过相似性搜索鉴定蛋白质生化功能。
Protein Sci. 2003 Aug;12(8):1589-95. doi: 10.1110/ps.0368703.
8
Molecular properties that influence the oral bioavailability of drug candidates.影响候选药物口服生物利用度的分子特性。
J Med Chem. 2002 Jun 6;45(12):2615-23. doi: 10.1021/jm020017n.
9
Heuristics for similarity searching of chemical graphs using a maximum common edge subgraph algorithm.使用最大公共边子图算法进行化学图相似性搜索的启发式方法。
J Chem Inf Comput Sci. 2002 Mar-Apr;42(2):305-16. doi: 10.1021/ci010381f.
10
Flexible alignment of small molecules.
J Med Chem. 2001 May 10;44(10):1483-90. doi: 10.1021/jm0002634.