• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用大量专利进行药物发现:广泛使用匹配和编辑操作的一般策略。

Drug discovery using very large numbers of patents: general strategy with extensive use of match and edit operations.

机构信息

St Matthews University School of Medicine, Grand Cayman, Cayman Islands, The University of Wisconsin-Stout, Menomonie, USA.

出版信息

J Comput Aided Mol Des. 2011 May;25(5):427-41. doi: 10.1007/s10822-011-9429-x. Epub 2011 May 3.

DOI:10.1007/s10822-011-9429-x
PMID:21538091
Abstract

A patent data base of 6.7 million compounds generated by a very high performance computer (Blue Gene) requires new techniques for exploitation when extensive use of chemical similarity is involved. Such exploitation includes the taxonomic classification of chemical themes, and data mining to assess mutual information between themes and companies. Importantly, we also launch candidates that evolve by "natural selection" as failure of partial match against the patent data base and their ability to bind to the protein target appropriately, by simulation on Blue Gene. An unusual feature of our method is that algorithms and workflows rely on dynamic interaction between match-and-edit instructions, which in practice are regular expressions. Similarity testing by these uses SMILES strings and, less frequently, graph or connectivity representations. Examining how this performs in high throughput, we note that chemical similarity and novelty are human concepts that largely have meaning by utility in specific contexts. For some purposes, mutual information involving chemical themes might be a better concept.

摘要

一个由高性能计算机(Blue Gene)生成的包含 670 万种化合物的专利数据库,在涉及广泛使用化学相似性时,需要新的技术来开发利用。这种开发利用包括化学主题的分类学分类,以及数据挖掘以评估主题和公司之间的互信息。重要的是,我们还通过在 Blue Gene 上的模拟,推出了通过“自然选择”进化的候选物,因为它们与专利数据库的部分匹配失败,以及它们与蛋白质靶标适当结合的能力。我们的方法的一个不寻常的特点是,算法和工作流程依赖于匹配和编辑指令之间的动态交互,这些指令在实践中是正则表达式。这些用法通过 SMILES 字符串进行相似性测试,并且不太频繁地使用图形或连通性表示。在考察这种方法在高通量中的表现时,我们注意到化学相似性和新颖性是人类概念,它们在特定上下文中的实用性方面具有很大的意义。对于某些目的而言,涉及化学主题的互信息可能是一个更好的概念。

相似文献

1
Drug discovery using very large numbers of patents: general strategy with extensive use of match and edit operations.利用大量专利进行药物发现:广泛使用匹配和编辑操作的一般策略。
J Comput Aided Mol Des. 2011 May;25(5):427-41. doi: 10.1007/s10822-011-9429-x. Epub 2011 May 3.
2
Graph edit distance from spectral seriation.基于频谱序列化的图编辑距离。
IEEE Trans Pattern Anal Mach Intell. 2005 Mar;27(3):365-378. doi: 10.1109/TPAMI.2005.56.
3
Self-organizing maps for learning the edit costs in graph matching.用于学习图匹配中编辑成本的自组织映射。
IEEE Trans Syst Man Cybern B Cybern. 2005 Jun;35(3):503-14. doi: 10.1109/tsmcb.2005.846635.
4
Symbol recognition via statistical integration of pixel-level constraint histograms: a new descriptor.通过像素级约束直方图的统计积分进行符号识别:一种新的描述符。
IEEE Trans Pattern Anal Mach Intell. 2005 Feb;27(2):278-81. doi: 10.1109/TPAMI.2005.38.
5
A binary linear programming formulation of the graph edit distance.图编辑距离的二元线性规划公式化表述。
IEEE Trans Pattern Anal Mach Intell. 2006 Aug;28(8):1200-14. doi: 10.1109/TPAMI.2006.152.
6
Research on similarity measurement for texture image retrieval.纹理图像检索的相似性度量研究。
PLoS One. 2012;7(9):e45302. doi: 10.1371/journal.pone.0045302. Epub 2012 Sep 25.
7
Effective image retrieval based on hidden concept discovery in image database.基于图像数据库中隐藏概念发现的有效图像检索
IEEE Trans Image Process. 2007 Feb;16(2):562-72. doi: 10.1109/tip.2006.888350.
8
Polynomial-time metrics for attributed trees.属性树的多项式时间度量。
IEEE Trans Pattern Anal Mach Intell. 2005 Jul;27(7):1087-99. doi: 10.1109/tpami.2005.146.
9
Navigation and discovery in 3D CAD repositories.三维计算机辅助设计知识库中的导航与发现
IEEE Comput Graph Appl. 2007 Jul-Aug;27(4):38-47. doi: 10.1109/mcg.2007.87.
10
The Bayes decision rule induced similarity measures.贝叶斯决策规则诱导的相似性度量。
IEEE Trans Pattern Anal Mach Intell. 2007 Jun;29(6):1086-90. doi: 10.1109/TPAMI.2007.1063.

引用本文的文献

1
The New Coronavirus (SARS-CoV-2): A Comprehensive Review on Immunity and the Application of Bioinformatics and Molecular Modeling to the Discovery of Potential Anti-SARS-CoV-2 Agents.新型冠状病毒(SARS-CoV-2):免疫综述及生物信息学和分子建模在发现潜在抗 SARS-CoV-2 药物中的应用。
Molecules. 2020 Sep 7;25(18):4086. doi: 10.3390/molecules25184086.
2
COVID-19 Coronavirus spike protein analysis for synthetic vaccines, a peptidomimetic antagonist, and therapeutic drugs, and analysis of a proposed achilles' heel conserved region to minimize probability of escape mutations and drug resistance.用于合成疫苗、肽模拟拮抗剂和治疗性药物的 COVID-19 冠状病毒刺突蛋白分析,以及对保守区域阿喀琉斯之踵的分析,以最大程度地降低逃逸突变和耐药性的可能性。
Comput Biol Med. 2020 Jun;121:103749. doi: 10.1016/j.compbiomed.2020.103749. Epub 2020 Apr 11.
3

本文引用的文献

1
SIML: a fast SIMD algorithm for calculating LINGO chemical similarities on GPUs and CPUs.SIML:一种在 GPU 和 CPU 上计算 LINGO 化学相似度的快速 SIMD 算法。
J Chem Inf Model. 2010 Apr 26;50(4):560-4. doi: 10.1021/ci100011z.
2
Protein folding revisited.蛋白质折叠再探讨。
Prog Mol Biol Transl Sci. 2008;84:161-202. doi: 10.1016/S0079-6603(08)00405-4.
3
Clinical and pharmacogenomic data mining: 4. The FANO program and command set as an example of tools for biomedical discovery and evidence based medicine.
Computers and viral diseases. Preliminary bioinformatics studies on the design of a synthetic vaccine and a preventative peptidomimetic antagonist against the SARS-CoV-2 (2019-nCoV, COVID-19) coronavirus.计算机与病毒性疾病。针对 SARS-CoV-2(2019-nCoV,COVID-19)冠状病毒的合成疫苗和预防性肽模拟拮抗剂的设计的初步生物信息学研究。
Comput Biol Med. 2020 Apr;119:103670. doi: 10.1016/j.compbiomed.2020.103670. Epub 2020 Feb 26.
J Proteome Res. 2008 Sep;7(9):3922-47. doi: 10.1021/pr800204f. Epub 2008 Aug 13.
4
Identifying targets for drug discovery using bioinformatics.利用生物信息学确定药物研发的靶点。
Expert Opin Ther Targets. 2008 Apr;12(4):383-9. doi: 10.1517/14728222.12.4.383.
5
Lingos, finite state machines, and fast similarity searching.
J Chem Inf Model. 2006 Sep-Oct;46(5):1912-8. doi: 10.1021/ci6002152.
6
Global mapping of pharmacological space.药理空间的全球图谱。
Nat Biotechnol. 2006 Jul;24(7):805-15. doi: 10.1038/nbt1228.
7
Fast 3D molecular superposition and similarity search in databases of flexible molecules.灵活分子数据库中的快速三维分子叠加与相似性搜索
J Comput Aided Mol Des. 2003 Jan;17(1):13-38. doi: 10.1023/a:1024503712135.
8
Is 11beta-hydroxysteroid dehydrogenase type 1 a therapeutic target? Effects of carbenoxolone in lean and obese Zucker rats.11β-羟基类固醇脱氢酶1型是一个治疗靶点吗?甘珀酸对瘦型和肥胖型 Zucker 大鼠的影响。
J Pharmacol Exp Ther. 2003 Apr;305(1):167-72. doi: 10.1124/jpet.102.044842.
9
Studies in the assessment of folding quality for protein modeling and structure prediction.蛋白质建模与结构预测中折叠质量评估的研究。
J Proteome Res. 2002 Mar-Apr;1(2):115-33. doi: 10.1021/pr0155228.
10
A method for rapidly assessing and refining simple solvent treatments in molecular modelling. Example studies on the antigen-combining loop H2 from FAB fragment McPC603.一种在分子建模中快速评估和优化简单溶剂处理的方法。关于FAB片段McPC603中抗原结合环H2的实例研究。
Protein Eng. 1994 Feb;7(2):221-33. doi: 10.1093/protein/7.2.221.