• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于机器学习应用的纳米催化剂的X射线吸收光谱数据预处理

XAS Data Preprocessing of Nanocatalysts for Machine Learning Applications.

作者信息

Kartashov Oleg O, Chernov Andrey V, Polyanichenko Dmitry S, Butakova Maria A

机构信息

The Smart Materials Research Institute, Southern Federal University, 178/24 Sladkova, 344090 Rostov-on-Don, Russia.

出版信息

Materials (Basel). 2021 Dec 20;14(24):7884. doi: 10.3390/ma14247884.

DOI:10.3390/ma14247884
PMID:34947477
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8709119/
Abstract

Innovative development in the energy and chemical industries is mainly dependent on advances in the accelerated design and development of new functional materials. The success of research in new nanocatalysts mainly relies on modern techniques and approaches for their precise characterization. The existing methods of experimental characterization of nanocatalysts, which make it possible to assess the possibility of using these materials in specific chemical reactions or applications, generate significant amounts of heterogeneous data. The acceleration of new functional materials, including nanocatalysts, directly depends on the speed and quality of extracting hidden dependencies and knowledge from the obtained experimental data. Usually, such experiments involve different characterization techniques and different types of X-ray absorption spectroscopy (XAS) too. Using the machine learning (ML) methods based on XAS data, we can study and predict the atomic-scale structure and another bunch of parameters for the nanocatalyst efficiently. However, before using any ML model, it is necessary to make sure that the XAS raw experimental data is properly pre-processed, cleared, and prepared for ML application. Usually, the XAS preprocessing stage is vaguely presented in scientific studies, and the main efforts of researchers are devoted to the ML description and implementation stage. However, the quality of the input data influences the quality of ML analysis and the prediction results used in the future. This paper fills the gap between the stage of obtaining XAS data from synchrotron facilities and the stage of using and customizing various ML analysis and prediction models. We aimed this study to develop automated tools for the preprocessing and presentation of data from physical experiments and the creation of deposited datasets on the basis of the example of studying palladium-based nanocatalysts using synchrotron radiation facilities. During the study, methods of preliminary processing of XAS data were considered, which can be conditionally divided into X-ray absorption near edge structure (XANES) and extended X-ray absorption fine structure (EXAFS). This paper proposes a software toolkit that implements data preprocessing scenarios in the form of a single pipeline. The main preprocessing methods used in this study proposed are principal component analysis (PCA); z-score normalization; the interquartile method for eliminating outliers in the data; as well as the k-means machine learning method, which makes it possible to clarify the phase of the studied material sample by clustering feature vectors of experiments. Among the results of this study, one should also highlight the obtained deposited datasets of physical experiments on palladium-based nanocatalysts using synchrotron radiation. This will allow for further high-quality data mining to extract new knowledge about materials using artificial intelligence methods and machine learning models, and will ensure the smooth dissemination of these datasets to researchers and their reuse.

摘要

能源和化工行业的创新发展主要依赖于新型功能材料加速设计与开发方面的进展。新型纳米催化剂的研究成功主要依赖于对其进行精确表征的现代技术和方法。现有的纳米催化剂实验表征方法能够评估这些材料在特定化学反应或应用中的使用可能性,但会产生大量异构数据。新型功能材料(包括纳米催化剂)的加速研发直接取决于从所获实验数据中提取隐藏的相关性和知识的速度与质量。通常,此类实验还涉及不同的表征技术以及不同类型的X射线吸收光谱(XAS)。利用基于XAS数据的机器学习(ML)方法,我们可以高效地研究和预测纳米催化剂的原子尺度结构及其他一系列参数。然而,在使用任何ML模型之前,有必要确保XAS原始实验数据经过适当的预处理、清理,并为ML应用做好准备。通常,XAS预处理阶段在科学研究中阐述得较为模糊,研究人员的主要精力都集中在ML描述和实施阶段。然而,输入数据的质量会影响ML分析的质量以及未来使用的预测结果。本文填补了从同步加速器设施获取XAS数据阶段与使用和定制各种ML分析及预测模型阶段之间的空白。我们开展这项研究的目的是,以利用同步辐射设施研究钯基纳米催化剂为例,开发用于物理实验数据预处理和呈现以及创建沉积数据集的自动化工具。在研究过程中,考虑了XAS数据的初步处理方法,这些方法可按条件分为X射线吸收近边结构(XANES)和扩展X射线吸收精细结构(EXAFS)。本文提出了一个软件工具包,该工具包以单一管道的形式实现数据预处理方案。本研究提出的主要预处理方法有主成分分析(PCA);z分数归一化;用于消除数据中异常值的四分位数法;以及k均值机器学习方法,该方法能够通过对实验特征向量进行聚类来明确所研究材料样品 的相。在本研究结果中,还应突出利用同步辐射获得的钯基纳米催化剂物理实验沉积数据集。这将有助于进一步进行高质量的数据挖掘,以利用人工智能方法和机器学习模型提取有关材料的新知识,并将确保这些数据集顺利传播给研究人员并供其重复使用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/135b94c59a83/materials-14-07884-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/a22867e32b30/materials-14-07884-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/7be181ef6192/materials-14-07884-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/8f69d7275a2c/materials-14-07884-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/fb31af947164/materials-14-07884-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/807501daae5a/materials-14-07884-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/6647a3b6b963/materials-14-07884-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/a98aa873a694/materials-14-07884-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/c93fd679570b/materials-14-07884-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/135b94c59a83/materials-14-07884-g009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/a22867e32b30/materials-14-07884-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/7be181ef6192/materials-14-07884-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/8f69d7275a2c/materials-14-07884-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/fb31af947164/materials-14-07884-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/807501daae5a/materials-14-07884-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/6647a3b6b963/materials-14-07884-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/a98aa873a694/materials-14-07884-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/c93fd679570b/materials-14-07884-g008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0e60/8709119/135b94c59a83/materials-14-07884-g009.jpg

相似文献

1
XAS Data Preprocessing of Nanocatalysts for Machine Learning Applications.用于机器学习应用的纳米催化剂的X射线吸收光谱数据预处理
Materials (Basel). 2021 Dec 20;14(24):7884. doi: 10.3390/ma14247884.
2
Structural Analysis of Single-Atom Catalysts by X-ray Absorption Spectroscopy.利用X射线吸收光谱法对单原子催化剂进行结构分析
Acc Chem Res. 2024 Feb 9. doi: 10.1021/acs.accounts.3c00693.
3
XANES and EXAFS of dilute solutions of transition metals at XFELs.X射线自由电子激光条件下过渡金属稀溶液的X射线吸收近边结构和扩展X射线吸收精细结构
J Synchrotron Radiat. 2019 Sep 1;26(Pt 5):1716-1724. doi: 10.1107/S1600577519007550. Epub 2019 Aug 7.
4
X-ray absorption spectroscopy and X-ray diffraction data for molybdenum minerals and compounds.钼矿物与化合物的X射线吸收光谱和X射线衍射数据。
Data Brief. 2022 Sep 13;45:108576. doi: 10.1016/j.dib.2022.108576. eCollection 2022 Dec.
5
Arsenic mineral and compound data as analyzed by X-ray absorption spectroscopy and X-ray diffraction.通过X射线吸收光谱和X射线衍射分析的砷矿物和化合物数据。
Data Brief. 2024 Jun 19;55:110634. doi: 10.1016/j.dib.2024.110634. eCollection 2024 Aug.
6
Machine learning powered by principal component descriptors as the key for sorted structural fit of XANES.由主成分描述符驱动的机器学习作为X射线吸收近边结构(XANES)排序结构拟合的关键。
Phys Chem Chem Phys. 2021 Sep 7;23(33):17873-17887. doi: 10.1039/d1cp01794b. Epub 2021 Aug 11.
7
The EXAFS family tree: a personal history of the development of extended X-ray absorption fine structure.扩展X射线吸收精细结构(EXAFS)的家族谱系:一段个人发展历程
J Synchrotron Radiat. 1999 May 1;6(Pt 3):123-34. doi: 10.1107/S0909049599001260.
8
Neural network assisted analysis of bimetallic nanocatalysts using X-ray absorption near edge structure spectroscopy.利用X射线吸收近边结构光谱对双金属纳米催化剂进行神经网络辅助分析。
Phys Chem Chem Phys. 2020 Sep 8;22(34):18902-18910. doi: 10.1039/d0cp02098b.
9
/ Electrocatalyst Characterization by X-ray Absorption Spectroscopy./ 利用 X 射线吸收光谱学进行电催化剂表征。
Chem Rev. 2021 Jan 27;121(2):882-961. doi: 10.1021/acs.chemrev.0c00396. Epub 2020 Sep 28.
10
AutoTemplate: enhancing chemical reaction datasets for machine learning applications in organic chemistry.自动模板:增强用于有机化学机器学习应用的化学反应数据集。
J Cheminform. 2024 Jun 27;16(1):74. doi: 10.1186/s13321-024-00869-2.

引用本文的文献

1
A new framework for X-ray absorption spectroscopy data analysis based on machine learning: XASDAML.一种基于机器学习的X射线吸收光谱数据分析新框架:XASDAML
J Synchrotron Radiat. 2025 Sep 1;32(Pt 5):1244-1256. doi: 10.1107/S1600577525005351. Epub 2025 Jul 21.
2
Nanoscience in Action: Unveiling Emerging Trends in Materials and Applications.纳米科学在行动:揭示材料与应用的新兴趋势。
ACS Omega. 2025 Feb 17;10(8):7530-7548. doi: 10.1021/acsomega.4c10929. eCollection 2025 Mar 4.
3
Deep Reinforcement Learning Environment Approach Based on Nanocatalyst XAS Diagnostics Graphic Formalization.

本文引用的文献

1
Hydrogenation of ethylene over palladium: evolution of the catalyst structure by operando synchrotron-based techniques.钯催化乙烯加氢反应:基于同步加速器的原位技术对催化剂结构的演变研究
Faraday Discuss. 2021 May 1;229:197-207. doi: 10.1039/c9fd00139e. Epub 2021 Mar 3.
2
Data-Driven Strategies for Accelerated Materials Design.数据驱动的材料设计加速策略。
Acc Chem Res. 2021 Feb 16;54(4):849-860. doi: 10.1021/acs.accounts.0c00785. Epub 2021 Feb 2.
3
/ Electrocatalyst Characterization by X-ray Absorption Spectroscopy./ 利用 X 射线吸收光谱学进行电催化剂表征。
基于纳米催化剂X射线吸收光谱诊断图形形式化的深度强化学习环境方法
Materials (Basel). 2023 Jul 28;16(15):5321. doi: 10.3390/ma16155321.
4
Iterative multiscale and multi-physics computations for operando catalyst nanostructure elucidation and kinetic modeling.用于原位催化剂纳米结构解析和动力学建模的迭代多尺度和多物理场计算
iScience. 2023 Jun 7;26(7):107029. doi: 10.1016/j.isci.2023.107029. eCollection 2023 Jul 21.
Chem Rev. 2021 Jan 27;121(2):882-961. doi: 10.1021/acs.chemrev.0c00396. Epub 2020 Sep 28.
4
Fast computation of absorption spectra for lidar data processing using principal component analysis.利用主成分分析进行激光雷达数据处理时吸收光谱的快速计算。
Opt Lett. 2019 Apr 15;44(8):1900-1903. doi: 10.1364/OL.44.001900.
5
High-throughput computational X-ray absorption spectroscopy.高通量计算 X 射线吸收光谱学。
Sci Data. 2018 Jul 31;5:180151. doi: 10.1038/sdata.2018.151.
6
A biological perspective towards a standard for information exchange and reporting in XAS.一种关于X射线吸收光谱中信息交换与报告标准的生物学视角。
J Synchrotron Radiat. 2018 Jul 1;25(Pt 4):944-952. doi: 10.1107/S1600577518008779. Epub 2018 Jun 27.
7
Polycapillary Optics Based Confocal Micro X-ray Fluorescence and X-ray Absorption Spectroscopy Setup at The European Synchrotron Radiation Facility Collaborative Research Group Dutch-Belgian Beamline, BM26A.基于多毛细管光学的共焦微 X 射线荧光和 X 射线吸收光谱学装置,在欧洲同步辐射设施荷兰-比利时光束线 BM26A 的合作研究组。
Anal Chem. 2018 Feb 6;90(3):2389-2394. doi: 10.1021/acs.analchem.7b05110. Epub 2018 Jan 22.
8
Principal component analysis: a review and recent developments.主成分分析:综述与最新进展
Philos Trans A Math Phys Eng Sci. 2016 Apr 13;374(2065):20150202. doi: 10.1098/rsta.2015.0202.
9
Outlier Detection for Mass Spectrometric Data.质谱数据的异常值检测
Methods Mol Biol. 2016;1362:91-102. doi: 10.1007/978-1-4939-3106-4_5.
10
Applications of extended X-ray absorption fine-structure spectroscopy to studies of bimetallic nanoparticle catalysts.扩展 X 射线吸收精细结构光谱学在双金属纳米粒子催化剂研究中的应用。
Chem Soc Rev. 2012 Dec 21;41(24):8163-78. doi: 10.1039/c2cs35174a.