• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于分子动力学模拟数据预测天然橡胶拉伸应力的机器学习框架。

A Machine Learning Framework to Predict the Tensile Stress of Natural Rubber: Based on Molecular Dynamics Simulation Data.

作者信息

Huang Yongdi, Chen Qionghai, Zhang Zhiyu, Gao Ke, Hu Anwen, Dong Yining, Liu Jun, Cui Lihong

机构信息

College of Mathematics and Physics, Beijing University of Chemical Technology, Beijing 100029, China.

State Key Laboratory of Organic-Inorganic Composites, Beijing University of Chemical Technology, Beijing 100029, China.

出版信息

Polymers (Basel). 2022 May 6;14(9):1897. doi: 10.3390/polym14091897.

DOI:10.3390/polym14091897
PMID:35567066
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9103449/
Abstract

Natural rubber (NR), with its excellent mechanical properties, has been attracting considerable scientific and technological attention. Through molecular dynamics (MD) simulations, the effects of key structural factors on tensile stress at the molecular level can be examined. However, this high-precision method is computationally inefficient and time-consuming, which limits its application. The combination of machine learning and MD is one of the most promising directions to speed up simulations and ensure the accuracy of results. In this work, a surrogate machine learning method trained with MD data is developed to predict not only the tensile stress of NR but also other mechanical behaviors. We propose a novel idea based on feature processing by combining our previous experience in performing predictions of small samples. The proposed ML method consists of (i) an extreme gradient boosting (XGB) model to predict the tensile stress of NR, and (ii) a data augmentation algorithm based on nearest-neighbor interpolation (NNI) and the synthetic minority oversampling technique (SMOTE) to maximize the use of limited training data. Among the data enhancement algorithms that we design, the NNI algorithm finally achieves the effect of approaching the original data sample distribution by interpolating at the neighborhood of the original sample, and the SMOTE algorithm is used to solve the problem of sample imbalance by interpolating at the clustering boundaries of minority samples. The augmented samples are used to establish the XGB prediction model. Finally, the robustness of the proposed models and their predictive ability are guaranteed by high performance values, which indicate that the obtained regression models have good internal and external predictive capacities.

摘要

天然橡胶(NR)凭借其优异的机械性能,一直备受科技界的关注。通过分子动力学(MD)模拟,可以在分子水平上研究关键结构因素对拉伸应力的影响。然而,这种高精度方法计算效率低且耗时,限制了其应用。机器学习与MD相结合是加速模拟并确保结果准确性的最有前途的方向之一。在这项工作中,开发了一种用MD数据训练的替代机器学习方法,不仅可以预测NR的拉伸应力,还可以预测其他机械行为。我们结合之前进行小样本预测的经验,提出了一种基于特征处理的新思路。所提出的机器学习方法包括:(i)一个用于预测NR拉伸应力的极端梯度提升(XGB)模型,以及(ii)一种基于最近邻插值(NNI)和合成少数过采样技术(SMOTE)的数据增强算法,以最大限度地利用有限的训练数据。在我们设计的数据增强算法中,NNI算法最终通过在原始样本的邻域进行插值实现了接近原始数据样本分布的效果,而SMOTE算法则通过在少数样本的聚类边界进行插值来解决样本不平衡问题。增强后的样本用于建立XGB预测模型。最后,通过高性能值保证了所提出模型的稳健性及其预测能力,这表明所获得的回归模型具有良好的内部和外部预测能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/9444443542be/polymers-14-01897-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/a018f3b78ab2/polymers-14-01897-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/74734dbab13b/polymers-14-01897-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/9b44b065dfde/polymers-14-01897-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/b97c63f25f65/polymers-14-01897-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/28deff16466e/polymers-14-01897-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/7b91cdb70c1f/polymers-14-01897-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/3dc31a3b9bd7/polymers-14-01897-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/ed39d8cfaf17/polymers-14-01897-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/9444443542be/polymers-14-01897-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/a018f3b78ab2/polymers-14-01897-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/74734dbab13b/polymers-14-01897-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/9b44b065dfde/polymers-14-01897-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/b97c63f25f65/polymers-14-01897-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/28deff16466e/polymers-14-01897-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/7b91cdb70c1f/polymers-14-01897-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/3dc31a3b9bd7/polymers-14-01897-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/ed39d8cfaf17/polymers-14-01897-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/673e/9103449/9444443542be/polymers-14-01897-g009.jpg

相似文献

1
A Machine Learning Framework to Predict the Tensile Stress of Natural Rubber: Based on Molecular Dynamics Simulation Data.一种基于分子动力学模拟数据预测天然橡胶拉伸应力的机器学习框架。
Polymers (Basel). 2022 May 6;14(9):1897. doi: 10.3390/polym14091897.
2
Predicting Natural Rubber Crystallinity by a Novel Machine Learning Algorithm Based on Molecular Dynamics Simulation Data.基于分子动力学模拟数据的新型机器学习算法预测天然橡胶结晶度
Langmuir. 2023 Dec 5;39(48):17088-17099. doi: 10.1021/acs.langmuir.3c01878. Epub 2023 Nov 20.
3
A novel method for detecting credit card fraud problems.一种用于检测信用卡欺诈问题的新方法。
PLoS One. 2024 Mar 6;19(3):e0294537. doi: 10.1371/journal.pone.0294537. eCollection 2024.
4
STB: synthetic minority oversampling technique for tree-boosting models for imbalanced datasets of intrusion detection systems.STB:用于入侵检测系统不平衡数据集的树增强模型的合成少数类过采样技术。
PeerJ Comput Sci. 2023 Nov 27;9:e1580. doi: 10.7717/peerj-cs.1580. eCollection 2023.
5
Social Reminiscence in Older Adults' Everyday Conversations: Automated Detection Using Natural Language Processing and Machine Learning.老年人日常对话中的社会怀旧:使用自然语言处理和机器学习的自动检测。
J Med Internet Res. 2020 Sep 15;22(9):e19133. doi: 10.2196/19133.
6
A hybrid sampling algorithm combining synthetic minority over-sampling technique and edited nearest neighbor for missed abortion diagnosis.一种结合合成少数过采样技术和编辑最近邻的混合采样算法,用于诊断漏诊的流产。
BMC Med Inform Decis Mak. 2022 Dec 29;22(1):344. doi: 10.1186/s12911-022-02075-2.
7
Semi-surrogate modelling of droplets evaporation process via XGBoost integrated CFD simulations.通过XGBoost集成计算流体动力学(CFD)模拟对液滴蒸发过程进行半代理建模。
Sci Total Environ. 2023 Oct 15;895:164968. doi: 10.1016/j.scitotenv.2023.164968. Epub 2023 Jun 24.
8
Combining Resampling Strategies and Ensemble Machine Learning Methods to Enhance Prediction of Neonates with a Low Apgar Score After Induction of Labor in Northern Tanzania.结合重采样策略和集成机器学习方法以增强对坦桑尼亚北部引产术后低阿氏评分新生儿的预测
Risk Manag Healthc Policy. 2021 Sep 7;14:3711-3720. doi: 10.2147/RMHP.S331077. eCollection 2021.
9
DeepStack-DTIs: Predicting Drug-Target Interactions Using LightGBM Feature Selection and Deep-Stacked Ensemble Classifier.DeepStack-DTIs:使用 LightGBM 特征选择和深度堆叠集成分类器预测药物-靶标相互作用。
Interdiscip Sci. 2022 Jun;14(2):311-330. doi: 10.1007/s12539-021-00488-7. Epub 2021 Nov 3.
10
CURE-SMOTE algorithm and hybrid algorithm for feature selection and parameter optimization based on random forests.基于随机森林的用于特征选择和参数优化的CURE-SMOTE算法及混合算法。
BMC Bioinformatics. 2017 Mar 14;18(1):169. doi: 10.1186/s12859-017-1578-z.

引用本文的文献

1
A review of machine learning methods for imbalanced data challenges in chemistry.化学中不平衡数据挑战的机器学习方法综述。
Chem Sci. 2025 Apr 22;16(18):7637-7658. doi: 10.1039/d5sc00270b. eCollection 2025 May 7.

本文引用的文献

1
Mimicking the Mechanical Robustness of Natural Rubber Based on a Sacrificial Network Constructed by Phospholipids.基于由磷脂构建的牺牲网络模拟天然橡胶的机械鲁棒性。
ACS Appl Mater Interfaces. 2020 Mar 25;12(12):14468-14475. doi: 10.1021/acsami.0c01994. Epub 2020 Mar 13.
2
Tailoring the mechanical properties of polymer nanocomposites via interfacial engineering.通过界面工程定制聚合物纳米复合材料的机械性能。
Phys Chem Chem Phys. 2019 Aug 28;21(34):18714-18726. doi: 10.1039/c9cp02948f.
3
Biosynthesis of Natural Rubber: Current State and Perspectives.
天然橡胶的生物合成:现状与展望。
Int J Mol Sci. 2018 Dec 22;20(1):50. doi: 10.3390/ijms20010050.
4
Accelerated discovery of stable lead-free hybrid organic-inorganic perovskites via machine learning.通过机器学习加速发现稳定的无铅混合有机-无机钙钛矿。
Nat Commun. 2018 Aug 24;9(1):3405. doi: 10.1038/s41467-018-05761-w.
5
Machine learning for molecular and materials science.机器学习在分子和材料科学中的应用。
Nature. 2018 Jul;559(7715):547-555. doi: 10.1038/s41586-018-0337-2. Epub 2018 Jul 25.
6
Tailoring the mechanical properties by molecular integration of flexible and stiff polymer networks.通过柔性和刚性聚合物网络的分子集成来调整机械性能。
Soft Matter. 2018 Mar 28;14(12):2379-2390. doi: 10.1039/c7sm02282d. Epub 2018 Mar 5.
7
Machine learning of accurate energy-conserving molecular force fields.机器学习精准节能分子力场。
Sci Adv. 2017 May 5;3(5):e1603015. doi: 10.1126/sciadv.1603015. eCollection 2017 May.
8
Data mining of molecular dynamics data reveals Li diffusion characteristics in garnet LiLaZrO.分子动力学数据的数据挖掘揭示了石榴石 LiLaZrO 中 Li 的扩散特性。
Sci Rep. 2017 Jan 17;7:40769. doi: 10.1038/srep40769.
9
Quantum-chemical insights from deep tensor neural networks.基于深度张量神经网络的量子化学研究。
Nat Commun. 2017 Jan 9;8:13890. doi: 10.1038/ncomms13890.
10
Modeling polymer grafted nanoparticle networks reinforced by high-strength chains.高分子接枝纳米粒子网络的建模,由高强度链增强。
Soft Matter. 2014 Mar 7;10(9):1374-83. doi: 10.1039/c3sm52300d.