利用基于神经网络的低维空间增强光谱分类指标。

Enhancing the classification metrics of spectroscopy spectrums using neural network based low dimensional space.

作者信息

Yousuff Mohamed, Babu Rajasekhara

机构信息

School of Computer Science and Engineering, Vellore Institute of Technology, Vellore Campus, Vellore, 632014 Tamilnadu India.

出版信息

Earth Sci Inform. 2023;16(1):825-844. doi: 10.1007/s12145-022-00917-1. Epub 2022 Dec 23.

DOI:10.1007/s12145-022-00917-1

PMID:36575666

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9782283/

Abstract

Spectroscopy is a methodology for gaining knowledge of particles, especially biomolecules, by quantifying the interactions between matter and light. By examining the level of light absorbed, reflected or released by a specimen, its constituents, properties, and volume can be determined. Spectra obtained through spectroscopy procedures are quick, harmless and contactless; hence nowadays preferred in chemometrics. Due to the high dimensional nature of the spectra, it is challenging to build a robust classifier with good performance metrics. Many linear and nonlinear dimensionality reduction-based classification models have been previously implemented to overcome this issue. However, they lack in capturing the subtle details of the spectra into the low dimension space or cannot efficiently handle the nonlinearity present in the spectral data. We propose a graph-based neural network embedding approach to extract appropriate features into latent space and circumvent the spectrums' nonlinearity problem. Our approach performs dimensionality reduction into two phases: constructing a nearest neighbor graph and producing almost linear embedding using a fully connected neural network. Further, the low dimensional embedding is subjected to classification using the Random Forest algorithm. In this paper, we have implemented and compared our technique with four nonlinear dimensionality techniques widely used for spectral data analysis. In this study, we have considered five different spectral datasets belonging to specific applications. The various classification performance metrics of all the techniques are evaluated. The proposed approach is able to perform competitively well on six different low-dimensional spaces for each dataset with an accuracy score above 95% and Matthew's correlation coefficient value close to 1. The trustworthiness score of almost 1 show that the presented dimensionality reduction approach preserves the closest neighbor structure of high dimensional spectral inputs into latent space.

摘要

光谱学是一种通过量化物质与光之间的相互作用来了解粒子，尤其是生物分子的方法。通过检查样本吸收、反射或释放的光的水平，可以确定其成分、性质和体积。通过光谱学程序获得的光谱快速、无害且非接触式；因此，如今在化学计量学中备受青睐。由于光谱具有高维特性，构建一个具有良好性能指标的强大分类器具有挑战性。此前已经实施了许多基于线性和非线性降维的分类模型来解决这个问题。然而，它们在将光谱的细微细节捕捉到低维空间方面存在不足，或者无法有效处理光谱数据中存在的非线性。我们提出了一种基于图的神经网络嵌入方法，以将适当的特征提取到潜在空间中，并规避光谱的非线性问题。我们的方法分两个阶段进行降维：构建最近邻图并使用全连接神经网络生成近似线性嵌入。此外，使用随机森林算法对低维嵌入进行分类。在本文中，我们已经实现了我们的技术，并将其与广泛用于光谱数据分析的四种非线性降维技术进行了比较。在本研究中，我们考虑了属于特定应用的五个不同光谱数据集。评估了所有技术的各种分类性能指标。所提出的方法能够在每个数据集的六个不同低维空间上具有竞争力地良好运行，准确率得分高于95%，马修斯相关系数值接近1。可信度得分接近1表明所提出的降维方法将高维光谱输入的最邻近结构保留到潜在空间中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a22e/9782283/08d5d646cfb3/12145_2022_917_Fig1_HTML.jpg

相似文献

Enhancing the classification metrics of spectroscopy spectrums using neural network based low dimensional space.利用基于神经网络的低维空间增强光谱分类指标。

Earth Sci Inform. 2023;16(1):825-844. doi: 10.1007/s12145-022-00917-1. Epub 2022 Dec 23.

Dimensionality Reduction of Hyperspectral Images Based on Improved Spatial-Spectral Weight Manifold Embedding.基于改进的空谱加权流形嵌入的高光谱图像降维。

Sensors (Basel). 2020 Aug 7;20(16):4413. doi: 10.3390/s20164413.

Laser-Induced Breakdown Spectroscopy Combined with Nonlinear Manifold Learning for Improvement Aluminum Alloy Classification Accuracy.激光诱导击穿光谱结合非线性流形学习提高铝合金分类精度。

Sensors (Basel). 2022 Apr 20;22(9):3129. doi: 10.3390/s22093129.

Improved classifier for computer-aided polyp detection in CT colonography by nonlinear dimensionality reduction.通过非线性降维改进的计算机辅助CT结肠成像息肉检测分类器。

Med Phys. 2008 Apr;35(4):1377-86. doi: 10.1118/1.2870218.

A Study on Dimensionality Reduction and Parameters for Hyperspectral Imagery Based on Manifold Learning.基于流形学习的高光谱图像降维和参数研究

Sensors (Basel). 2024 Mar 25;24(7):2089. doi: 10.3390/s24072089.

Consensus embedding: theory, algorithms and application to segmentation and classification of biomedical data.共识嵌入：理论、算法及其在生物医学数据分割和分类中的应用。

BMC Bioinformatics. 2012 Feb 8;13:26. doi: 10.1186/1471-2105-13-26.

Exploring nonlinear feature space dimension reduction and data representation in breast Cadx with Laplacian eigenmaps and t-SNE.探讨基于拉普拉斯特征映射和 t-SNE 的乳腺 CADx 非线性特征空间降维和数据表示。

Med Phys. 2010 Jan;37(1):339-51. doi: 10.1118/1.3267037.

Structural Analysis and Classification of Low-Molecular-Weight Hyaluronic Acid by Near-Infrared Spectroscopy: A Comparison between Traditional Machine Learning and Deep Learning.采用近红外光谱对低分子量透明质酸进行结构分析和分类：传统机器学习与深度学习的比较。

Molecules. 2023 Jan 13;28(2):809. doi: 10.3390/molecules28020809.

Numerically stable locality-preserving partial least squares discriminant analysis for efficient dimensionality reduction and classification of high-dimensional data.用于高维数据有效降维和分类的数值稳定的局部保持偏最小二乘判别分析。

Heliyon. 2024 Feb 12;10(4):e26157. doi: 10.1016/j.heliyon.2024.e26157. eCollection 2024 Feb 29.

Machine Learning and Feature Selection for soil spectroscopy. An evaluation of Random Forest wrappers to predict soil organic matter, clay, and carbonates.用于土壤光谱学的机器学习与特征选择。对随机森林包装器预测土壤有机质、黏土和碳酸盐的评估。

Heliyon. 2024 Apr 25;10(9):e30228. doi: 10.1016/j.heliyon.2024.e30228. eCollection 2024 May 15.

引用本文的文献

Synergistic effects of and biochar on the biocontrol of two soil-borne phytopathogens in chickpeas.[原文中“and biochar”前面缺少具体内容，无法准确翻译完整句子，暂且翻译为] [某种物质]与生物炭对鹰嘴豆中两种土传植物病原体的生物防治协同效应。

Front Microbiol. 2025 May 1;16:1583114. doi: 10.3389/fmicb.2025.1583114. eCollection 2025.

Optimizing multi-spectral ore sorting incorporating wavelength selection utilizing neighborhood component analysis for effective arsenic mineral detection.利用邻域成分分析进行波长选择以优化多光谱矿石分选，实现有效的砷矿物检测。

Sci Rep. 2024 May 21;14(1):11544. doi: 10.1038/s41598-024-62166-0.

本文引用的文献

Integration of surface-enhanced Raman spectroscopy (SERS) and machine learning tools for coffee beverage classification.用于咖啡饮品分类的表面增强拉曼光谱（SERS）与机器学习工具的整合。

Digit Chem Eng. 2022 Jun;3. doi: 10.1016/j.dche.2022.100020. Epub 2022 Mar 12.

Comprehensive examination and comparison of machine learning techniques for the quantitative determination of adulterants in honey using Fourier infrared spectroscopy with attenuated total reflectance accessory.采用傅里叶变换衰减全反射红外光谱法结合化学计量学技术定量检测蜂蜜中掺杂物的综合考察与比较。

Spectrochim Acta A Mol Biomol Spectrosc. 2022 Aug 5;276:121186. doi: 10.1016/j.saa.2022.121186. Epub 2022 Mar 29.

Infra-red spectroscopy combined with machine learning algorithms enables early determination of Pseudomonas aeruginosa's susceptibility to antibiotics.红外光谱结合机器学习算法可实现对铜绿假单胞菌对抗生素敏感性的早期测定。

Spectrochim Acta A Mol Biomol Spectrosc. 2022 Jun 5;274:121080. doi: 10.1016/j.saa.2022.121080. Epub 2022 Feb 26.

Machine learning prediction of lignin content in poplar with Raman spectroscopy.基于拉曼光谱的杨树木质素含量的机器学习预测。

Bioresour Technol. 2022 Mar;348:126812. doi: 10.1016/j.biortech.2022.126812. Epub 2022 Feb 4.

The application of machine-learning and Raman spectroscopy for the rapid detection of edible oils type and adulteration.机器学习和拉曼光谱在食用油类型和掺伪快速检测中的应用。

Food Chem. 2022 Mar 30;373(Pt B):131471. doi: 10.1016/j.foodchem.2021.131471. Epub 2021 Oct 26.

Finding reduced Raman spectroscopy fingerprint of skin samples for melanoma diagnosis through machine learning.通过机器学习找到皮肤样本的拉曼光谱指纹，用于黑色素瘤诊断。

Artif Intell Med. 2021 Oct;120:102161. doi: 10.1016/j.artmed.2021.102161. Epub 2021 Aug 28.

Screening ovarian cancers with Raman spectroscopy of blood plasma coupled with machine learning data processing.采用血浆拉曼光谱结合机器学习数据处理筛查卵巢癌。

Spectrochim Acta A Mol Biomol Spectrosc. 2022 Jan 15;265:120355. doi: 10.1016/j.saa.2021.120355. Epub 2021 Sep 4.

Raman spectroscopy and machine learning for the classification of breast cancers.拉曼光谱和机器学习在乳腺癌分类中的应用。

Spectrochim Acta A Mol Biomol Spectrosc. 2022 Jan 5;264:120300. doi: 10.1016/j.saa.2021.120300. Epub 2021 Aug 21.

A random forest model for the classification of wheat and rye leaf rust symptoms based on pure spectra at leaf scale.一种基于叶片尺度纯光谱的小麦和黑麦叶锈病症状分类随机森林模型。

J Photochem Photobiol B. 2021 Oct;223:112278. doi: 10.1016/j.jphotobiol.2021.112278. Epub 2021 Aug 8.

Infrared spectroscopy combined with random forest to determine tylosin residues in powdered milk.利用红外光谱结合随机森林法测定奶粉中的泰乐菌素残留。

Food Chem. 2021 Dec 15;365:130477. doi: 10.1016/j.foodchem.2021.130477. Epub 2021 Jun 27.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用基于神经网络的低维空间增强光谱分类指标。

Enhancing the classification metrics of spectroscopy spectrums using neural network based low dimensional space.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献