Department of Mathematical Sciences, Middle Tennessee State University, Murfreesboro, TN 37132, U.S.A.
Stat Med. 2011 Mar 30;30(7):753-68. doi: 10.1002/sim.4147. Epub 2010 Dec 29.
Imaging mass spectrometry (IMS) shows great potential for the rapid mapping of protein localization and for detecting of sizeable differences in protein expression. However, data processing remains challenging due to the difficulty of analyzing high dimensionality, the fact that the number of predictors is significantly larger than the number of observations, and the need to consider both spectral and spatial information in order to represent the advantage of IMS technology. Ideally one would like to efficiently analyze all acquired data to find trace features based on both spectral and spatial patterns. Therefore, biomarker selection from IMS data is a problem of global optimization. A recently developed regularization and variable selection method,elastic net (EN), produces a sparse model with admirable prediction accuracy and can be an effective tool for IMS data processing. In this paper, we incorporate a spatial penalty term into the EN model and develop anew tool for IMS data biomarker selection and classification. A comprehensive IMS data processing software package, called EN4IMS, is also presented. The results of applying our method to both simulated and real data show that the EN4IMS algorithm works efficiently and effectively for IMS data processing: producing a more precise listing of selected peaks, helping confirmation of new potential biomarkers discovery, and providing more accurate classification results.
成像质谱 (IMS) 显示出在蛋白质定位的快速映射和检测蛋白质表达的显著差异方面的巨大潜力。然而,由于分析高维数据的困难、预测器的数量明显大于观测值的数量,以及需要考虑光谱和空间信息以代表 IMS 技术的优势,数据处理仍然具有挑战性。理想情况下,人们希望能够有效地分析所有采集的数据,以便根据光谱和空间模式找到痕量特征。因此,从 IMS 数据中选择生物标志物是一个全局优化问题。最近开发的一种正则化和变量选择方法——弹性网络 (EN)——产生了一个具有令人钦佩的预测准确性的稀疏模型,并且可以成为 IMS 数据处理的有效工具。在本文中,我们将空间惩罚项纳入 EN 模型,并开发了一种用于 IMS 数据生物标志物选择和分类的新工具。还提出了一个称为 EN4IMS 的综合 IMS 数据处理软件包。将我们的方法应用于模拟和真实数据的结果表明,EN4IMS 算法能够高效、有效地处理 IMS 数据:生成更精确的选定峰列表,有助于确认新的潜在生物标志物的发现,并提供更准确的分类结果。