Suppr超能文献

用于大规模光谱图像数据有效聚类的光谱特征空间的合理划分

Rational partitioning of spectral feature space for effective clustering of massive spectral image data.

作者信息

Ito Yusei, Takeichi Yasuo, Hino Hideitsu, Ono Kanta

机构信息

Department of Applied Physics, Osaka University, 2-1 Yamadaoka, Suita, 565-0871, Osaka, Japan.

The Institute of Statistical Mathematics, 10-3 Midori-cho, Tachikawa, Tokyo, 190- 8562, Japan.

出版信息

Sci Rep. 2024 Sep 29;14(1):22549. doi: 10.1038/s41598-024-74016-0.

Abstract

We have successfully proposed and demonstrated a clustering method that overcomes the "needle-in-a-haystack problem" (finding minuscule important regions from massive spectral image data sets). The needle-in-a-haystack problem is of central importance in the characterization of materials since in bulk materials, the properties of a very tiny region often dominate the entire function. To solve this problem, we propose that rational partitioning of the spectral feature space in which spectra are distributed, or defining of the decision boundaries for clustering, can be performed by focusing on the discrimination limit defined by the measurement noise and partitioning the space at intervals of this limit. We verified the proposed method, applied it to actual measurement data, and succeeded in detecting tiny (~ 0.5%) important regions that were difficult for human researchers and other machine learning methods to detect in discovering unknown phases. The ability to detect these crucial regions helps in understanding materials and designing more functional materials.

摘要

我们成功地提出并演示了一种聚类方法,该方法克服了“大海捞针问题”(即从海量光谱图像数据集中找到极小的重要区域)。大海捞针问题在材料表征中至关重要,因为在块状材料中,一个非常小的区域的特性往往主导着整体功能。为了解决这个问题,我们提出,可以通过关注由测量噪声定义的辨别极限,并以该极限为间隔对光谱分布所在的光谱特征空间进行合理划分,或者定义聚类的决策边界。我们验证了所提出的方法,并将其应用于实际测量数据,成功检测到了微小的(约0.5%)重要区域,这些区域对于人类研究人员和其他机器学习方法来说,在发现未知相时很难检测到。检测这些关键区域的能力有助于理解材料并设计出功能更强的材料。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/db44/11439947/e587397b36fc/41598_2024_74016_Fig1_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验