• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于结构的机器学习预测药物 LC 方法开发的保留时间。

Structure Based Machine Learning Prediction of Retention Times for LC Method Development of Pharmaceuticals.

机构信息

Analytical Research & Development, MRL, Merck & Co., Inc., Rahway, NJ, 07065, USA.

出版信息

Pharm Res. 2024 Feb;41(2):365-374. doi: 10.1007/s11095-023-03646-2. Epub 2024 Feb 8.

DOI:10.1007/s11095-023-03646-2
PMID:38332389
Abstract

PURPOSE

Significant resources are spent on developing robust liquid chromatography (LC) methods with optimum conditions for all project in the pipeline. Although, data-driven computer assisted modelling has been implemented to shorten the method development timelines, these modelling approaches require project-specific screening data to model retention time (RT) as function of method parameters. Sometimes method re-development is required, leading to additional investments and redundant laboratory work. Cheminformatics techniques have been successfully used to predict the RT of metabolites & other component mixtures for similar use cases. Here we will show that these techniques can be used to model structurally diverse molecules and predictions of these models trained on multiple LC conditions can be used for downstream data-driven modelling.

METHODS

The Molecular Operating Environment (MOE) was used to calculate over 800 descriptors using the strucutres of the analytes. These descriptors were used to model the RT of the analytes under four chromatographic conditions. These models were then used to create data-driven models using LC-SIM.

RESULTS

A structural-based Random Forest (RF) model outperformed other techniques in cross-validation studies and predicted the RTs of a randomized test set with a median percentage error less than 4% for all LC conditions. RTs predicted by this structure-based model were used to fit a data-driven model that identifies optimum LC conditions without any additional experimental work.

CONCLUSIONS

These results show that small training sets yield pharmaceutically relevant models when used in a combination of structure-based and data-driven model.

摘要

目的

大量资源用于开发具有最佳条件的稳健液相色谱 (LC) 方法,以满足管道中所有项目的需求。尽管已经实施了数据驱动的计算机辅助建模来缩短方法开发时间,但这些建模方法需要项目特定的筛选数据来将保留时间 (RT) 建模为方法参数的函数。有时需要重新开发方法,导致额外的投资和冗余的实验室工作。化学信息学技术已成功用于预测类似用例中代谢物和其他成分混合物的 RT。在这里,我们将展示这些技术可用于建模结构多样的分子,并且可以使用在多种 LC 条件下训练的这些模型的预测值来进行下游数据驱动的建模。

方法

使用分子操作环境 (MOE) 计算了超过 800 个描述符,使用分析物的结构。这些描述符用于在四种色谱条件下对分析物的 RT 进行建模。然后,使用 LC-SIM 为这些模型创建数据驱动模型。

结果

结构基随机森林 (RF) 模型在交叉验证研究中表现优于其他技术,并且对随机测试集的 RT 进行预测,所有 LC 条件下的中位数百分比误差均小于 4%。该结构基模型预测的 RT 用于拟合数据驱动模型,无需进行任何额外的实验工作即可确定最佳 LC 条件。

结论

这些结果表明,当在结构基和数据驱动模型的组合中使用时,小的训练集可以产生具有药物相关性的模型。

相似文献

1
Structure Based Machine Learning Prediction of Retention Times for LC Method Development of Pharmaceuticals.基于结构的机器学习预测药物 LC 方法开发的保留时间。
Pharm Res. 2024 Feb;41(2):365-374. doi: 10.1007/s11095-023-03646-2. Epub 2024 Feb 8.
2
Performance comparison of nonlinear and linear regression algorithms coupled with different attribute selection methods for quantitative structure - retention relationships modelling in micellar liquid chromatography.胶束液相色谱中非线性和线性回归算法与不同属性选择方法相结合的定量结构 - 保留关系建模的性能比较。
J Chromatogr A. 2020 Jul 19;1623:461146. doi: 10.1016/j.chroma.2020.461146. Epub 2020 Apr 29.
3
Cross-column density functional theory-based quantitative structure-retention relationship model development powered by machine learning.基于跨柱密度泛函理论的定量结构-保留关系模型开发:机器学习助力
Anal Bioanal Chem. 2024 May;416(12):2951-2968. doi: 10.1007/s00216-024-05243-7. Epub 2024 Mar 20.
4
Prediction of retention in hydrophilic interaction liquid chromatography using solute molecular descriptors based on chemical structures.基于化学结构的溶质分子描述符预测亲水作用液相色谱中的保留行为。
J Chromatogr A. 2017 Feb 24;1486:59-67. doi: 10.1016/j.chroma.2016.12.025. Epub 2016 Dec 14.
5
Quantitative structure retention relationship (QSRR) modelling for Analytes' retention prediction in LC-HRMS by applying different Machine Learning algorithms and evaluating their performance.应用不同机器学习算法对 LC-HRMS 中分析物保留进行定量结构保留关系 (QSRR) 建模,以预测分析物保留时间,并评估它们的性能。
J Chromatogr B Analyt Technol Biomed Life Sci. 2022 Feb 15;1191:123132. doi: 10.1016/j.jchromb.2022.123132. Epub 2022 Jan 19.
6
Machine learning models and performance dependency on 2D chemical descriptor space for retention time prediction of pharmaceuticals.机器学习模型及其在二维化学描述符空间上的性能对药物保留时间预测的依赖性。
J Chromatogr A. 2024 Aug 16;1730:465109. doi: 10.1016/j.chroma.2024.465109. Epub 2024 Jun 18.
7
Development of a single retention time prediction model integrating multiple liquid chromatography systems: Application to new psychoactive substances.开发一种整合多个液相色谱系统的单一保留时间预测模型:在新精神活性物质中的应用。
Anal Chim Acta. 2021 Nov 1;1184:339035. doi: 10.1016/j.aca.2021.339035. Epub 2021 Sep 6.
8
Prediction of pesticide retention time in reversed-phase liquid chromatography using quantitative-structure retention relationship models: A comparative study of seven molecular descriptors datasets.采用定量结构保留关系模型预测反相液相色谱中的农药保留时间:七种分子描述符数据集的比较研究。
Chemosphere. 2021 Jul;275:130036. doi: 10.1016/j.chemosphere.2021.130036. Epub 2021 Mar 1.
9
Quantitative structure-retention relationships models for prediction of high performance liquid chromatography retention time of small molecules: endogenous metabolites and banned compounds.小分子高效液相色谱保留时间的定量构效关系模型预测:内源性代谢物和禁用化合物。
Anal Chim Acta. 2013 Oct 3;797:13-9. doi: 10.1016/j.aca.2013.08.025. Epub 2013 Aug 20.
10
Towards a chromatographic similarity index to establish localised quantitative structure-retention relationships for retention prediction. II Use of Tanimoto similarity index in ion chromatography.迈向用于建立局部定量结构-保留关系以进行保留预测的色谱相似性指数。II. 塔尼莫托相似性指数在离子色谱中的应用
J Chromatogr A. 2017 Nov 10;1523:173-182. doi: 10.1016/j.chroma.2017.02.054. Epub 2017 Feb 24.

引用本文的文献

1
Active Learning Improves Ionization Efficiency Predictions and Quantification in Nontargeted LC/HRMS.主动学习提高了非靶向液相色谱/高分辨质谱中的电离效率预测和定量分析。
Anal Chem. 2025 Jul 1;97(25):13131-13139. doi: 10.1021/acs.analchem.5c00816. Epub 2025 Jun 13.
2
Predictions of Chromatography Methods by Chemical Structure Similarity to Accelerate High-Throughput Medicinal Chemistry.通过化学结构相似性预测色谱方法以加速高通量药物化学
ACS Med Chem Lett. 2024 Jul 12;15(8):1396-1401. doi: 10.1021/acsmedchemlett.4c00145. eCollection 2024 Aug 8.

本文引用的文献

1
Enantioselective UHPLC Screening Combined with Modeling for Streamlined Development of Ultrafast Enantiopurity Assays.对映体选择性 UHPLC 筛选结合建模,用于超快速对映体纯度分析的简化开发。
Anal Chem. 2022 Jan 25;94(3):1804-1812. doi: 10.1021/acs.analchem.1c04585. Epub 2021 Dec 21.
2
Multifactorial Modeling for Streamlined Development and Optimization of Two-Dimensional Liquid Chromatography.多因素建模用于简化二维液相色谱的开发和优化。
Anal Chem. 2021 Aug 24;93(33):11532-11539. doi: 10.1021/acs.analchem.1c01970. Epub 2021 Aug 10.
3
Graph-based machine learning interprets and predicts diagnostic isomer-selective ion-molecule reactions in tandem mass spectrometry.
基于图的机器学习可解释并预测串联质谱中诊断性异构体选择性离子-分子反应。
Chem Sci. 2020 Oct 5;11(43):11849-11858. doi: 10.1039/d0sc02530e.
4
Perspective on the Future Approaches to Predict Retention in Liquid Chromatography.展望液相色谱保留预测的未来方法。
Anal Chem. 2021 Apr 13;93(14):5653-5664. doi: 10.1021/acs.analchem.0c05078. Epub 2021 Apr 2.
5
Prediction of Liquid Chromatographic Retention Time with Graph Neural Networks to Assist in Small Molecule Identification.利用图神经网络预测液相色谱保留时间以辅助小分子鉴定
Anal Chem. 2021 Feb 2;93(4):2200-2206. doi: 10.1021/acs.analchem.0c04071. Epub 2021 Jan 7.
6
Prediction of Analyte Retention Time in Liquid Chromatography.液相色谱中分析物保留时间的预测
Anal Chem. 2021 Jan 12;93(1):228-256. doi: 10.1021/acs.analchem.0c04190. Epub 2020 Oct 21.
7
Accelerated Reactivity Mechanism and Interpretable Machine Learning Model of -Sulfonylimines toward Fast Multicomponent Reactions.- 磺酰亚胺对快速多组分反应的加速反应机制及可解释机器学习模型。
Org Lett. 2020 Nov 6;22(21):8480-8486. doi: 10.1021/acs.orglett.0c03083. Epub 2020 Oct 19.
8
Retention Index Prediction Using Quantitative Structure-Retention Relationships for Improving Structure Identification in Nontargeted Metabolomics.使用定量结构-保留关系预测保留指数,以改进非靶向代谢组学中的结构鉴定。
Anal Chem. 2018 Aug 7;90(15):9434-9440. doi: 10.1021/acs.analchem.8b02084. Epub 2018 Jul 10.
9
Error measures in quantitative structure-retention relationships studies.定量结构-保留关系研究中的误差度量。
J Chromatogr A. 2017 Nov 17;1524:298-302. doi: 10.1016/j.chroma.2017.09.050. Epub 2017 Sep 23.
10
Identification of small molecules using accurate mass MS/MS search.利用精确质量 MS/MS 搜索鉴定小分子。
Mass Spectrom Rev. 2018 Jul;37(4):513-532. doi: 10.1002/mas.21535. Epub 2017 Apr 24.