• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LinChemIn:SynGraph——一种用于分析和比较合成路线的数据模型及工具包。

LinChemIn: SynGraph-a data model and a toolkit to analyze and compare synthetic routes.

作者信息

Pasquini Marta, Stenta Marco

机构信息

Syngenta Crop Protection AG, Schaffhauserstrasse, 4332, Stein, AG, Switzerland.

出版信息

J Cheminform. 2023 Apr 1;15(1):41. doi: 10.1186/s13321-023-00714-y.

DOI:10.1186/s13321-023-00714-y
PMID:37005691
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10067316/
Abstract

BACKGROUND

The increasing amount of chemical reaction data makes traditional ways to navigate its corpus less effective, while the demand for novel approaches and instruments is rising. Recent data science and machine learning techniques support the development of new ways to extract value from the available reaction data. On the one side, Computer-Aided Synthesis Planning tools can predict synthetic routes in a model-driven approach; on the other side, experimental routes can be extracted from the Network of Organic Chemistry, in which reaction data are linked in a network. In this context, the need to combine, compare and analyze synthetic routes generated by different sources arises naturally.

RESULTS

Here we present LinChemIn, a python toolkit that allows chemoinformatics operations on synthetic routes and reaction networks. Wrapping some third-party packages for handling graph arithmetic and chemoinformatics and implementing new data models and functionalities, LinChemIn allows the interconversion between data formats and data models and enables route-level analysis and operations, including route comparison and descriptors calculation. Object-Oriented Design principles inspire the software architecture, and the modules are structured to maximize code reusability and support code testing and refactoring. The code structure should facilitate external contributions, thus encouraging open and collaborative software development.

CONCLUSIONS

The current version of LinChemIn allows users to combine synthetic routes generated from various tools and analyze them, and constitutes an open and extensible framework capable of incorporating contributions from the community and fostering scientific discussion. Our roadmap envisages the development of sophisticated metrics for routes evaluation, a multi-parameter scoring system, and the implementation of an entire "ecosystem" of functionalities operating on synthetic routes. LinChemIn is freely available at https://github.com/syngenta/linchemin.

摘要

背景

化学反应数据量的不断增加使得传统的浏览其语料库的方法效果越来越差,而对新方法和工具的需求却在不断上升。最近的数据科学和机器学习技术支持开发从现有反应数据中提取价值的新方法。一方面,计算机辅助合成规划工具可以通过模型驱动的方法预测合成路线;另一方面,可以从有机化学网络中提取实验路线,其中反应数据在网络中相互关联。在这种背景下,自然就产生了对不同来源生成的合成路线进行组合、比较和分析的需求。

结果

在此,我们展示了LinChemIn,这是一个Python工具包,可对合成路线和反应网络进行化学信息学操作。通过包装一些用于处理图算法和化学信息学的第三方软件包,并实现新的数据模型和功能,LinChemIn允许数据格式和数据模型之间的相互转换,并支持路线级分析和操作,包括路线比较和描述符计算。面向对象设计原则启发了软件架构,模块的结构设计旨在最大限度地提高代码可重用性,并支持代码测试和重构。代码结构应便于外部贡献,从而鼓励开放和协作的软件开发。

结论

当前版本的LinChemIn允许用户组合从各种工具生成的合成路线并进行分析,构成了一个开放且可扩展的框架,能够纳入社区的贡献并促进科学讨论。我们的路线图设想开发用于路线评估的复杂指标、多参数评分系统,以及实现对合成路线进行操作的整个功能“生态系统”。LinChemIn可在https://github.com/syngenta/linchemin上免费获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/2895e2ef1a07/13321_2023_714_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/b38515045c9c/13321_2023_714_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/08229b23edbc/13321_2023_714_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/f3a405384ee1/13321_2023_714_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/ba8a51e9bafe/13321_2023_714_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/900f06f1598e/13321_2023_714_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/1e2dc7e067e8/13321_2023_714_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/594010e8b5ae/13321_2023_714_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/4e730d3e8c0a/13321_2023_714_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/2895e2ef1a07/13321_2023_714_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/b38515045c9c/13321_2023_714_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/08229b23edbc/13321_2023_714_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/f3a405384ee1/13321_2023_714_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/ba8a51e9bafe/13321_2023_714_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/900f06f1598e/13321_2023_714_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/1e2dc7e067e8/13321_2023_714_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/594010e8b5ae/13321_2023_714_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/4e730d3e8c0a/13321_2023_714_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c442/10067316/2895e2ef1a07/13321_2023_714_Fig9_HTML.jpg

相似文献

1
LinChemIn: SynGraph-a data model and a toolkit to analyze and compare synthetic routes.LinChemIn:SynGraph——一种用于分析和比较合成路线的数据模型及工具包。
J Cheminform. 2023 Apr 1;15(1):41. doi: 10.1186/s13321-023-00714-y.
2
LinChemIn: Route Arithmetic─Operations on Digital Synthetic Routes.林克茵:路线算法─数字综合路线的运算。
J Chem Inf Model. 2024 Mar 25;64(6):1765-1771. doi: 10.1021/acs.jcim.3c01819. Epub 2024 Mar 13.
3
NeuroPycon: An open-source python toolbox for fast multi-modal and reproducible brain connectivity pipelines.NeuroPycon:一个开源的 Python 工具包,用于快速进行多模态和可重复的脑连接管道。
Neuroimage. 2020 Oct 1;219:117020. doi: 10.1016/j.neuroimage.2020.117020. Epub 2020 Jun 6.
4
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
ExtractionScore: A Quantitative Framework for Evaluating Synthetic Routes on Predicted Liquid-Liquid Extraction Performance.萃取评分:用于评估预测的液液萃取性能的合成路线的定量框架。
J Chem Inf Model. 2021 May 24;61(5):2274-2282. doi: 10.1021/acs.jcim.0c01426. Epub 2021 Apr 21.
7
Clustering of Synthetic Routes Using Tree Edit Distance.使用树编辑距离对合成路线进行聚类
J Chem Inf Model. 2021 Aug 23;61(8):3899-3907. doi: 10.1021/acs.jcim.1c00232. Epub 2021 Aug 3.
8
The connectome viewer toolkit: an open source framework to manage, analyze, and visualize connectomes.连接组学视图工具包:一个用于管理、分析和可视化连接组学的开源框架。
Front Neuroinform. 2011 Jun 6;5:3. doi: 10.3389/fninf.2011.00003. eCollection 2011.
9
GReNaDIne: A Data-Driven Python Library to Infer Gene Regulatory Networks from Gene Expression Data.GReNaDIne:一个基于数据驱动的 Python 库,用于从基因表达数据中推断基因调控网络。
Genes (Basel). 2023 Jan 20;14(2):269. doi: 10.3390/genes14020269.
10
Machine Learning in Computer-Aided Synthesis Planning.计算机辅助合成规划中的机器学习
Acc Chem Res. 2018 May 15;51(5):1281-1289. doi: 10.1021/acs.accounts.8b00087. Epub 2018 May 1.

引用本文的文献

1
ASKCOS: Open-Source, Data-Driven Synthesis Planning.ASKCOS:开源、数据驱动的合成规划。
Acc Chem Res. 2025 Jun 3;58(11):1764-1775. doi: 10.1021/acs.accounts.5c00155. Epub 2025 May 21.
2
Improving route development using convergent retrosynthesis planning.使用汇聚逆合成规划改进路线开发。
J Cheminform. 2025 Feb 27;17(1):26. doi: 10.1186/s13321-025-00953-1.
3
AiZynthFinder 4.0: developments based on learnings from 3 years of industrial application.艾辛思寻径器4.0:基于三年工业应用经验的发展成果

本文引用的文献

1
Predictive chemistry: machine learning for reaction deployment, reaction development, and reaction discovery.预测化学:用于反应部署、反应开发和反应发现的机器学习
Chem Sci. 2022 Nov 28;14(2):226-244. doi: 10.1039/d2sc05089g. eCollection 2023 Jan 4.
2
Merging enzymatic and synthetic chemistry with computational synthesis planning.将酶学和合成化学与计算合成规划相结合。
Nat Commun. 2022 Dec 14;13(1):7747. doi: 10.1038/s41467-022-35422-y.
3
Reaction classification and yield prediction using the differential reaction fingerprint DRFP.
J Cheminform. 2024 May 23;16(1):57. doi: 10.1186/s13321-024-00860-x.
使用微分反应指纹DRFP进行反应分类和产率预测。
Digit Discov. 2022 Jan 21;1(2):91-97. doi: 10.1039/d1dd00006c. eCollection 2022 Apr 11.
4
Computational tools and resources for designing new pathways to small molecules.设计小分子新途径的计算工具和资源。
Curr Opin Biotechnol. 2022 Aug;76:102722. doi: 10.1016/j.copbio.2022.102722. Epub 2022 Apr 26.
5
Machine Learning Applications for Chemical Reactions.机器学习在化学反应中的应用。
Chem Asian J. 2022 Jul 15;17(14):e202200203. doi: 10.1002/asia.202200203. Epub 2022 May 30.
6
Making the collective knowledge of chemistry open and machine actionable.使化学的集体知识开放并可用于机器操作。
Nat Chem. 2022 Apr;14(4):365-376. doi: 10.1038/s41557-022-00910-7. Epub 2022 Apr 4.
7
Autonomous Reaction Network Exploration in Homogeneous and Heterogeneous Catalysis.均相和多相催化中的自主反应网络探索
Top Catal. 2022;65(1-4):6-39. doi: 10.1007/s11244-021-01543-9. Epub 2022 Jan 13.
8
Biocatalysed synthesis planning using data-driven learning.基于数据驱动学习的生物催化合成规划。
Nat Commun. 2022 Feb 18;13(1):964. doi: 10.1038/s41467-022-28536-w.
9
Uncertainty-aware prediction of chemical reaction yields with graph neural networks.基于图神经网络的化学反应产率不确定性感知预测。
J Cheminform. 2022 Jan 10;14(1):2. doi: 10.1186/s13321-021-00579-z.
10
The Evolution of Data-Driven Modeling in Organic Chemistry.有机化学中数据驱动建模的发展历程
ACS Cent Sci. 2021 Oct 27;7(10):1622-1637. doi: 10.1021/acscentsci.1c00535. Epub 2021 Oct 19.