基于非负主成分分析的血清质谱轮廓研究和生物标志物发现。

Nonnegative principal component analysis for mass spectral serum profiles and biomarker discovery.

机构信息

Department of Mathematics and Bioinformatics, Eastern Michigan University, Ypsilanti, MI 48109, USA.

出版信息

BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2105-11-S1-S1.

DOI:10.1186/1471-2105-11-S1-S1

PMID:20122180

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3009481/

Abstract

BACKGROUND

As a novel cancer diagnostic paradigm, mass spectroscopic serum proteomic pattern diagnostics was reported superior to the conventional serologic cancer biomarkers. However, its clinical use is not fully validated yet. An important factor to prevent this young technology to become a mainstream cancer diagnostic paradigm is that robustly identifying cancer molecular patterns from high-dimensional protein expression data is still a challenge in machine learning and oncology research. As a well-established dimension reduction technique, PCA is widely integrated in pattern recognition analysis to discover cancer molecular patterns. However, its global feature selection mechanism prevents it from capturing local features. This may lead to difficulty in achieving high-performance proteomic pattern discovery, because only features interpreting global data behavior are used to train a learning machine.

METHODS

In this study, we develop a nonnegative principal component analysis algorithm and present a nonnegative principal component analysis based support vector machine algorithm with sparse coding to conduct a high-performance proteomic pattern classification. Moreover, we also propose a nonnegative principal component analysis based filter-wrapper biomarker capturing algorithm for mass spectral serum profiles.

RESULTS

We demonstrate the superiority of the proposed algorithm by comparison with six peer algorithms on four benchmark datasets. Moreover, we illustrate that nonnegative principal component analysis can be effectively used to capture meaningful biomarkers.

CONCLUSION

Our analysis suggests that nonnegative principal component analysis effectively conduct local feature selection for mass spectral profiles and contribute to improving sensitivities and specificities in the following classification, and meaningful biomarker discovery.

摘要

背景

作为一种新型的癌症诊断范例，质谱血清蛋白质组模式诊断被报道优于传统的血清癌症生物标志物。然而，其临床应用尚未得到充分验证。防止这项年轻技术成为主流癌症诊断范例的一个重要因素是，从高维蛋白质表达数据中稳健地识别癌症分子模式仍然是机器学习和肿瘤学研究中的一个挑战。作为一种成熟的降维技术，PCA 广泛集成在模式识别分析中，以发现癌症分子模式。然而，其全局特征选择机制阻止了它捕获局部特征。这可能导致难以实现高性能蛋白质组模式发现，因为仅使用解释全局数据行为的特征来训练学习机。

方法

在本研究中，我们开发了一种非负主成分分析算法，并提出了一种基于非负主成分分析的支持向量机稀疏编码算法，用于进行高性能蛋白质组模式分类。此外，我们还提出了一种基于非负主成分分析的过滤包装生物标志物捕获算法，用于质谱血清谱。

结果

我们通过与六个同行算法在四个基准数据集上的比较，证明了所提出算法的优越性。此外，我们还表明，非负主成分分析可以有效地用于捕获有意义的生物标志物。

结论

我们的分析表明，非负主成分分析可以有效地对质谱图谱进行局部特征选择，有助于提高后续分类的灵敏度和特异性，并有助于有意义的生物标志物发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f21/3009481/baac88a7a119/1471-2105-11-S1-S1-1.jpg

相似文献

Nonnegative principal component analysis for mass spectral serum profiles and biomarker discovery.基于非负主成分分析的血清质谱轮廓研究和生物标志物发现。

BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2105-11-S1-S1.

Improving gene expression cancer molecular pattern discovery using nonnegative principal component analysis.使用非负主成分分析改进基因表达癌症分子模式发现

Genome Inform. 2008;21:200-11.

Nonnegative principal component analysis for cancer molecular pattern discovery.基于非负主成分分析的癌症分子模式发现。

IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):537-49. doi: 10.1109/TCBB.2009.36.

Derivative component analysis for mass spectral serum proteomic profiles.质谱血清蛋白质组图谱的衍生成分分析。

BMC Med Genomics. 2014;7 Suppl 1(Suppl 1):S5. doi: 10.1186/1755-8794-7-S1-S5. Epub 2014 May 8.

Multi-resolution independent component analysis for high-performance tumor classification and biomarker discovery.多分辨率独立成分分析在高性能肿瘤分类和生物标志物发现中的应用。

BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S7. doi: 10.1186/1471-2105-12-S1-S7.

A high performance profile-biomarker diagnosis for mass spectral profiles.一种用于质谱图谱的高性能轮廓生物标志物诊断方法。

BMC Syst Biol. 2011;5 Suppl 2(Suppl 2):S5. doi: 10.1186/1752-0509-5-S2-S5. Epub 2011 Dec 14.

Feature selection and nearest centroid classification for protein mass spectrometry.蛋白质质谱的特征选择与最近质心分类

BMC Bioinformatics. 2005 Mar 23;6:68. doi: 10.1186/1471-2105-6-68.

A novel profile biomarker diagnosis for mass spectral proteomics.一种用于质谱蛋白质组学的新型轮廓生物标志物诊断方法。

Pac Symp Biocomput. 2014:340-51.

Feature selection and machine learning with mass spectrometry data.基于质谱数据的特征选择与机器学习

Methods Mol Biol. 2013;1007:237-62. doi: 10.1007/978-1-62703-392-3_10.

A comparison of methods for classifying clinical samples based on proteomics data: a case study for statistical and machine learning approaches.基于蛋白质组学数据的临床样本分类方法比较：统计和机器学习方法的案例研究。

PLoS One. 2011;6(9):e24973. doi: 10.1371/journal.pone.0024973. Epub 2011 Sep 28.

引用本文的文献

Evaluation of the Biological Effect of a Nicotinamide-Containing Broad-Spectrum Sunscreen on Photodamaged Skin.含烟酰胺的广谱防晒霜对光损伤皮肤的生物学效应评估

Dermatol Ther (Heidelb). 2024 Dec;14(12):3321-3336. doi: 10.1007/s13555-024-01298-7. Epub 2024 Nov 7.

Pseudomonas species prevalence, protein analysis, and antibiotic resistance: an evolving public health challenge.假单胞菌属菌种的流行情况、蛋白质分析及抗生素耐药性：一个不断演变的公共卫生挑战。

AMB Express. 2022 May 9;12(1):53. doi: 10.1186/s13568-022-01390-1.

Proteomic characterization and discrimination of Aeromonas species recovered from meat and water samples with a spotlight on the antimicrobial resistance of Aeromonas hydrophila.对从肉和水样中分离出的气单胞菌属的蛋白质组学特征分析和鉴别，重点关注嗜水气单胞菌的抗菌耐药性。

Microbiologyopen. 2019 Nov;8(11):e782. doi: 10.1002/mbo3.782. Epub 2019 Jan 6.

Molecular typing of Meningiomas by Desorption Electrospray Ionization Mass Spectrometry Imaging for Surgical Decision-Making.通过解吸电喷雾电离质谱成像对脑膜瘤进行分子分型以辅助手术决策

Int J Mass Spectrom. 2015 Feb 1;377:690-698. doi: 10.1016/j.ijms.2014.06.024.

Overcome support vector machine diagnosis overfitting.克服支持向量机诊断的过拟合问题。

Cancer Inform. 2014 Dec 9;13(Suppl 1):145-58. doi: 10.4137/CIN.S13875. eCollection 2014.

Derivative component analysis for mass spectral serum proteomic profiles.质谱血清蛋白质组图谱的衍生成分分析。

BMC Med Genomics. 2014;7 Suppl 1(Suppl 1):S5. doi: 10.1186/1755-8794-7-S1-S5. Epub 2014 May 8.

Mass spectrometry imaging as a tool for surgical decision-making.质谱成像作为一种手术决策工具。

J Mass Spectrom. 2013 Nov;48(11):1178-87. doi: 10.1002/jms.3295.

A high performance profile-biomarker diagnosis for mass spectral profiles.一种用于质谱图谱的高性能轮廓生物标志物诊断方法。

BMC Syst Biol. 2011;5 Suppl 2(Suppl 2):S5. doi: 10.1186/1752-0509-5-S2-S5. Epub 2011 Dec 14.

Non-negative matrix factorisation methods for the spectral decomposition of MRS data from human brain tumours.基于非负矩阵分解的人脑肿瘤 MRS 数据谱分解方法。

BMC Bioinformatics. 2012 Mar 8;13:38. doi: 10.1186/1471-2105-13-38.

The Use of Principal Component Analysis in MALDI-TOF MS: a Powerful Tool for Establishing a Mini-optimized Proteomic Profile.主成分分析在基质辅助激光解吸电离飞行时间质谱中的应用：建立小型优化蛋白质组学图谱的强大工具。

Am J Biomed Sci. 2012;4(1):85-101. doi: 10.5099/aj120100085.

本文引用的文献

Nonnegative principal component analysis for cancer molecular pattern discovery.基于非负主成分分析的癌症分子模式发现。

IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):537-49. doi: 10.1109/TCBB.2009.36.

Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation.利用离散小波变换在基质辅助激光解吸电离飞行时间质谱血清蛋白谱中发现生物标志物。

Bioinformatics. 2009 Mar 1;25(5):643-9. doi: 10.1093/bioinformatics/btn662.

Independent component analysis for the extraction of reliable protein signal profiles from MALDI-TOF mass spectra.用于从基质辅助激光解吸电离飞行时间质谱中提取可靠蛋白质信号图谱的独立成分分析。

Bioinformatics. 2008 Jan 1;24(1):63-70. doi: 10.1093/bioinformatics/btm533. Epub 2007 Nov 14.

Feature Selection for Classification of SELDI-TOF-MS Proteomic Profiles.用于SELDI-TOF-MS蛋白质组学图谱分类的特征选择

Appl Bioinformatics. 2005;4(4):227-46. doi: 10.2165/00822942-200504040-00003.

Analysis of mass spectral serum profiles for biomarker selection.用于生物标志物选择的质谱血清谱分析。

Bioinformatics. 2005 Nov 1;21(21):4039-45. doi: 10.1093/bioinformatics/bti670. Epub 2005 Sep 13.

Ovarian cancer identification based on dimensionality reduction for high-throughput mass spectrometry data.基于高通量质谱数据降维的卵巢癌识别

Bioinformatics. 2005 May 15;21(10):2200-9. doi: 10.1093/bioinformatics/bti370. Epub 2005 Mar 22.

Serum proteomics profiling--a young technology begins to mature.血清蛋白质组学分析——一项新兴技术开始走向成熟。

Nat Biotechnol. 2005 Mar;23(3):291-2. doi: 10.1038/nbt0305-291.

SELDI-TOF-based serum proteomic pattern diagnostics for early detection of cancer.基于表面增强激光解吸电离飞行时间质谱技术的血清蛋白质组图谱诊断用于癌症的早期检测

Curr Opin Biotechnol. 2004 Feb;15(1):24-30. doi: 10.1016/j.copbio.2004.01.005.

Probabilistic disease classification of expression-dependent proteomic data from mass spectrometry of human serum.基于人血清质谱的表达依赖性蛋白质组学数据的概率性疾病分类

J Comput Biol. 2003;10(6):925-46. doi: 10.1089/106652703322756159.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于非负主成分分析的血清质谱轮廓研究和生物标志物发现。

Nonnegative principal component analysis for mass spectral serum profiles and biomarker discovery.

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献