Southwest Research Institute, San Antonio, Texas 78228, United States.
Million Marker Wellness, Inc., Berkeley, California 94708, United States.
J Am Soc Mass Spectrom. 2023 Aug 2;34(8):1653-1662. doi: 10.1021/jasms.3c00107. Epub 2023 Jul 6.
This work demonstrates high-throughput screening of personal care products to provide an overview of potential exposure. Sixty-seven products from five categories (body/fragrance oil, cleaning product, hair care, hand/body wash, lotion, sunscreen) were rapidly extracted and then analyzed using suspect screening by two-dimensional gas chromatography (GCxGC) high-resolution mass spectrometry (GCxGC-HRT). Initial peak finding and integration were performed using commercial software, followed by batch processing using the machine learning program Highlight. Highlight automatically performs background subtraction, chromatographic alignment, signal quality review, multidilution aggregation, peak grouping, and iterative integration. This data set resulted in 2,195 compound groups and 43,713 individual detections. Compounds of concern (101) were downselected and classified as mild irritants (29%), environmental toxicants/severe irritants (51%) and endocrine disrupting chemicals/carcinogens (20%). High risk compounds such as phthalates, parabens, and avobenzone were detected in 46 out of the 67 products (69%), and only 5 out of the 67 products (7%) listed these compounds on their ingredient labels. The Highlight results for the compounds of concern were compared to commercial software results (ChromaTOF) and 5.3% of the individual detections were discerned only by Highlight, demonstrating the strength of the iterative algorithm to effectively discover low-level signatures. Highlight provides a significant labor advantage, requiring only 2.6% of the time estimated for a largely manual workflow using commercial software. In order to address significant time needed for postprocessing assignment of identification confidence, a new machine-learning-based algorithm was developed to assess the quality of assigned library matches, and a balanced accuracy of 79% was achieved.
本研究通过二维气相色谱-高分辨质谱联用(GCxGC-HRMS)的可疑筛查对 5 类(体香/油、清洁产品、头发护理、手/身体洗液、乳液、防晒霜)67 种个人护理产品进行高通量筛选,以提供潜在暴露的概述。初始峰发现和积分使用商业软件完成,然后使用机器学习程序 Highlight 进行批处理。Highlight 自动执行背景扣除、色谱对齐、信号质量检查、多次稀释聚集、峰分组和迭代积分。该数据集得到 2195 个化合物组和 43713 个单体检测。关注化合物(101)被降级选择并分为轻度刺激物(29%)、环境毒物/严重刺激物(51%)和内分泌干扰化学物质/致癌物(20%)。在 67 种产品中的 46 种(69%)中检测到邻苯二甲酸酯、对羟基苯甲酸酯和阿伏苯宗等高风险化合物,而在 67 种产品中只有 5 种(7%)在其成分标签上列出了这些化合物。关注化合物的 Highlight 结果与商业软件结果(ChromaTOF)进行了比较,有 5.3%的单体检测仅通过 Highlight 发现,这证明了迭代算法在有效发现低水平特征方面的强大优势。Highlight 大大节省了劳动力,仅需使用商业软件的手动工作流程的 2.6%的时间。为了解决后处理鉴定置信度分配所需的大量时间,开发了一种新的基于机器学习的算法来评估分配库匹配的质量,实现了 79%的平衡准确率。