Clough Madeline E, Ochoa Rivera Eduardo, Parham Rebecca L, Ault Andrew P, Zimmerman Paul M, McNeil Anne J, Tewari Ambuj
Department of Chemistry, University of Michigan, 930 North University Avenue, Ann Arbor, Michigan 48109-1055, United States.
Department of Statistics, University of Michigan, 1085 South University Avenue, Ann Arbor, Michigan 48109-1055, United States.
Environ Sci Technol. 2024 Dec 10;58(49):21740-21749. doi: 10.1021/acs.est.4c05167. Epub 2024 Nov 26.
Microplastics are an emerging pollutant of concern, with environmental observations recorded across the world. Identifying the type of microplastic is challenging due to spectral similarities among the most common polymers, necessitating methods that can confidently distinguish plastic identities. In practice, a researcher chooses the reference vibrational spectrum that is most like the unknown spectrum, where the likeness between the two spectra is expressed numerically as the hit quality index (HQI). Despite the widespread use of HQI thresholds in the literature, acceptance of a spectral label often lacks any associated confidence. To address this gap, we apply a machine-learning framework called conformal prediction to output a set of possible labels that contain the true identity of the unknown spectrum with a user-defined probability (e.g., 90%). Microplastic reference libraries of environmentally aged and pristine polymeric materials, as well as unknown environmental plastic spectra, were employed to illustrate the benefits of this approach when used with two similarity metrics to compute HQI. We present an adaptable workflow using our open-access code to ensure spectral matching confidence for the microplastic community, reducing manual inspection of spectral matches and enhancing the robustness of quantification in the field.
微塑料是一种新出现的受关注污染物,在全球范围内都有相关环境观测记录。由于最常见聚合物之间的光谱相似性,识别微塑料的类型具有挑战性,因此需要能够可靠区分塑料身份的方法。在实际操作中,研究人员会选择与未知光谱最相似的参考振动光谱,这两种光谱之间的相似性通过命中质量指数(HQI)以数字形式表示。尽管文献中广泛使用HQI阈值,但对光谱标签的接受往往缺乏任何相关的置信度。为了弥补这一差距,我们应用了一种名为共形预测的机器学习框架,以输出一组可能的标签,这些标签以用户定义的概率(例如90%)包含未知光谱的真实身份。使用环境老化和原始聚合物材料的微塑料参考库以及未知环境塑料光谱,来说明该方法与两种相似性度量一起用于计算HQI时的优势。我们使用开放获取代码展示了一个适应性工作流程,以确保微塑料领域的光谱匹配置信度,减少对光谱匹配的人工检查,并增强现场量化的稳健性。