• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于泥岩中有机生物标志物分析的电子显微镜图像与飞行时间二次离子质谱的机器学习相关性

Machine Learning Correlation of Electron Micrographs and ToF-SIMS for the Analysis of Organic Biomarkers in Mudstone.

作者信息

Pasterski Michael J, Lorenz Matthias, Ievlev Anton V, Wickramasinghe Raveendra C, Hanley Luke, Kenig Fabien

机构信息

Department of Earth and Environmental Sciences, University of Illinois Chicago, Chicago, Illinois 60607, United States.

Center for Nanophase Materials Sciences, Oak Ridge National Laboratory, Oak Ridge, Tennessee 37830, United States.

出版信息

J Am Soc Mass Spectrom. 2025 Jan 1;36(1):58-71. doi: 10.1021/jasms.4c00300. Epub 2024 Dec 19.

DOI:10.1021/jasms.4c00300
PMID:39698945
Abstract

The spatial distribution of organics in geological samples can be used to determine when and how these organics were incorporated into the host rock. Mass spectrometry (MS) imaging can rapidly collect a large amount of data, but ions produced are mixed without discrimination, resulting in complex mass spectra that can be difficult to interpret. Here, we apply unsupervised and supervised machine learning (ML) to help interpret spectra from time-of-flight-secondary ion mass spectrometry (ToF-SIMS) of an organic-carbon-rich mudstone of the Middle Jurassic of England (UK). It was previously shown that the presence of sterane molecular biomarkers in this sample can be detected via ToF-SIMS (Pasterski, M. J. et al., 2023, 23, 936). We use unsupervised ML on scanning electron microscopy-electron dispersive spectroscopy (SEM-EDS) measurements to define compositional categories based on differences in elemental abundances. We then test the ability of four ML algorithms─k-nearest neighbors (KNN), recursive partitioning and regressive trees (RPART), eXtreme gradient boost (XGBoost), and random forest (RF)─to classify the ToF-SIM spectra using (1) the categories assigned via SEM-EDS, (2) organic and inorganic labels assigned via SEM-EDS, and (3) the presence or absence of detectable steranes in ToF-SIMS spectra. In terms of predictive accuracy and balanced accuracy, KNN was the best performing model and RPART the worst. The feature importance, or the specific features of the ToF-SIM spectra used by the models to make classifications, cannot be determined for KNN, preventing posthoc model interpretation. Nevertheless, the feature importance extracted from the other models was useful for interpreting spectra. We determined that some of the organic ions used to classify biomarker containing spectra may be fragment ions derived from kerogen which is abundant in this mudstone sample.

摘要

地质样品中有机物的空间分布可用于确定这些有机物何时以及如何被纳入母岩。质谱成像(MS)可以快速收集大量数据,但产生的离子是混合的,没有区分,导致质谱复杂,难以解释。在这里,我们应用无监督和有监督的机器学习(ML)来帮助解释来自英国英格兰中侏罗世富有机碳泥岩的飞行时间二次离子质谱(ToF-SIMS)的光谱。此前研究表明,通过ToF-SIMS可以检测到该样品中甾烷分子生物标志物的存在(帕斯特斯基,M. J. 等人,2023年,23卷,936页)。我们对扫描电子显微镜-电子色散光谱(SEM-EDS)测量数据使用无监督机器学习,根据元素丰度差异定义成分类别。然后,我们测试了四种机器学习算法——k近邻(KNN)、递归划分和回归树(RPART)、极端梯度提升(XGBoost)和随机森林(RF)——使用(1)通过SEM-EDS分配的类别、(2)通过SEM-EDS分配的有机和无机标签以及(3)ToF-SIMS光谱中可检测甾烷的存在与否对ToF-SIM光谱进行分类的能力。在预测准确性和平衡准确性方面,KNN是表现最佳的模型,而RPART是最差的。对于KNN,无法确定模型用于进行分类的ToF-SIM光谱的特征重要性,这妨碍了事后模型解释。然而,从其他模型中提取的特征重要性对于解释光谱很有用。我们确定,一些用于对含有生物标志物的光谱进行分类的有机离子可能是来自该泥岩样品中丰富的干酪根的碎片离子。

相似文献

1
Machine Learning Correlation of Electron Micrographs and ToF-SIMS for the Analysis of Organic Biomarkers in Mudstone.用于泥岩中有机生物标志物分析的电子显微镜图像与飞行时间二次离子质谱的机器学习相关性
J Am Soc Mass Spectrom. 2025 Jan 1;36(1):58-71. doi: 10.1021/jasms.4c00300. Epub 2024 Dec 19.
2
The Determination of the Spatial Distribution of Indigenous Lipid Biomarkers in an Immature Jurassic Sediment Using Time-of-Flight-Secondary Ion Mass Spectrometry.运用飞行时间二次离子质谱法测定未成熟侏罗纪沉积物中本土脂质生物标志物的空间分布。
Astrobiology. 2023 Sep;23(9):936-950. doi: 10.1089/ast.2022.0145. Epub 2023 Jul 17.
3
Extensive FE-SEM/EDS, HR-TEM/EDS and ToF-SIMS studies of micron- to nano-particles in anthracite fly ash.对无烟煤灰中微米至纳米颗粒的 FE-SEM/EDS、HR-TEM/EDS 和 ToF-SIMS 广泛研究。
Sci Total Environ. 2013 May 1;452-453:98-107. doi: 10.1016/j.scitotenv.2013.02.010. Epub 2013 Mar 15.
4
Applications of multivariate analysis and unsupervised machine learning to ToF-SIMS images of organic, bioorganic, and biological systems.多元分析和无监督机器学习在有机、生物有机和生物系统的飞行时间二次离子质谱图像中的应用。
Biointerphases. 2022 Mar 28;17(2):020802. doi: 10.1116/6.0001590.
5
Investigating activated sludge flocs using microanalytical techniques: demonstration of environmental scanning electron microscopy and time-of-flight secondary ion mass spectrometry for wastewater applications.
Water Environ Res. 2006 Apr;78(4):381-91. doi: 10.2175/106143005x90092.
6
Development of Peptide Identification System for ToF-SIMS Spectra Using Supervised Machine Learning.基于监督式机器学习的飞行时间二次离子质谱光谱肽鉴定系统的开发
J Am Soc Mass Spectrom. 2024 Dec 4;35(12):3057-3062. doi: 10.1021/jasms.4c00310. Epub 2024 Oct 12.
7
The Application of a Random Forest Classifier to ToF-SIMS Imaging Data.随机森林分类器在飞行时间二次离子质谱成像数据中的应用。
J Am Soc Mass Spectrom. 2024 Dec 4;35(12):2801-2814. doi: 10.1021/jasms.4c00324. Epub 2024 Oct 25.
8
Characterisation of 0.22 caliber rimfire gunshot residues by time-of-flight secondary ion mass spectrometry (TOF-SIMS): a preliminary study.通过飞行时间二次离子质谱法(TOF-SIMS)对0.22口径边缘发火枪弹残留物的表征:一项初步研究。
Forensic Sci Int. 2001 Jun 1;119(1):72-81. doi: 10.1016/s0379-0738(00)00421-7.
9
Evaluation of Time-of-Flight Secondary Ion Mass Spectrometry Spectra of Peptides by Random Forest with Amino Acid Labels: Results from a Versailles Project on Advanced Materials and Standards Interlaboratory Study.采用氨基酸标签的随机森林算法对肽段飞行时间二次离子质谱图谱的评估:来自于一个关于先进材料和标准的凡尔赛项目的实验室间研究结果。
Anal Chem. 2021 Mar 9;93(9):4191-4197. doi: 10.1021/acs.analchem.0c04577. Epub 2021 Feb 26.
10
Quantitative analysis of ToF-SIMS data of a two organic compound mixture using an autoencoder and simple artificial neural networks.使用自动编码器和简单人工神经网络对两种有机化合物混合物的 ToF-SIMS 数据进行定量分析。
Rapid Commun Mass Spectrom. 2023 Feb 28;37(4):e9445. doi: 10.1002/rcm.9445.