Duponchel Ludovic, Guerrini Ruggero, Ferreira Victor H C, Llamas César Alvarez, Dujardin Christophe, Motto-Ros Vincent
Université de Lille, CNRS, UMR 8516 - LASIRE - Laboratoire de Spectroscopie pour les Interactions, la Réactivité et l'Environnement, Lille F-59000, France.
Université Claude Bernard Lyon 1, Institut Lumière Matière, UMR 5306, CNRS, Villeurbanne 69622, France.
Anal Chem. 2024 Mar 12;96(10):3994-3998. doi: 10.1021/acs.analchem.3c05724. Epub 2024 Feb 13.
Analytical chemistry has never yielded such a wealth of experimental data as it does today, and this exponential trend shows no sign of abating. We continually advance the capabilities of our instruments and conceive innovative concepts, all in a concerted effort to naturally push the boundaries of our understanding regarding intricate sample matrices. Spectroscopic imaging, in the broadest sense, is certainly the field where we observe this acceleration even more pronouncedly. Analytical chemistry swiftly grasped the significance of processing acquired data for comprehensive exploration through utilization of chemometrics or machine learning tools. One can assert today that chemometrics undeniably constitutes an integral facet in the advancement of an analytical approach. However, we are now faced with a new challenge, as the experimental data accumulated for certain analytical techniques are so vast and massive that exploring them with such tools has become unfeasible, and this is by no means a computational capacity issue. Analytical chemistry is far from being the sole field affected by this issue, and one could argue that others have grappled with it long before us, such as, for instance, social media, to name just one. The purpose of this paper is to demonstrate that such a domain, which may initially seem distant from our concerns, can offer novel tools capable of overcoming these barriers, even though we are not necessarily dealing with the same objects. More specifically, we delve into the clustering of over 10 million LIBS spectra acquired as part of an imaging experiment aimed at exploring a singular rock sample. This will serve to demonstrate that an open-source library developed by Meta (formerly known as Facebook) can enable us to conduct a comprehensive exploration of this sample, a feat deemed impossible with conventional data analysis approaches.
分析化学从未像如今这样产生过如此丰富的实验数据,而且这种指数增长趋势毫无减弱的迹象。我们不断提升仪器的性能,并构思创新概念,所有这些都是为了共同努力自然地拓展我们对复杂样品基质的理解边界。从最广泛的意义上讲,光谱成像无疑是我们更明显地观察到这种加速发展的领域。分析化学迅速认识到利用化学计量学或机器学习工具处理采集到的数据以进行全面探索的重要性。如今可以断言,化学计量学无疑是分析方法发展中不可或缺的一个方面。然而,我们现在面临一个新挑战,因为某些分析技术积累的实验数据量如此庞大,以至于用这些工具对其进行探索已变得不可行,而这绝不是计算能力的问题。分析化学远不是受这个问题影响的唯一领域,有人可能会说其他领域在我们之前很久就已经在应对这个问题了,比如社交媒体,仅举一例。本文的目的是证明,这样一个乍一看似乎与我们的关注点无关的领域,能够提供新颖的工具来克服这些障碍,尽管我们处理的对象不一定相同。更具体地说,我们深入研究了作为一个旨在探索单个岩石样本的成像实验的一部分而采集的超过1000万条激光诱导击穿光谱(LIBS)的聚类。这将证明Meta(前身为Facebook)开发的一个开源库能够使我们对这个样本进行全面探索,而这是传统数据分析方法无法实现的壮举。