Suppr超能文献

基于非负主成分分析的血清质谱轮廓研究和生物标志物发现。

Nonnegative principal component analysis for mass spectral serum profiles and biomarker discovery.

机构信息

Department of Mathematics and Bioinformatics, Eastern Michigan University, Ypsilanti, MI 48109, USA.

出版信息

BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2105-11-S1-S1.

Abstract

BACKGROUND

As a novel cancer diagnostic paradigm, mass spectroscopic serum proteomic pattern diagnostics was reported superior to the conventional serologic cancer biomarkers. However, its clinical use is not fully validated yet. An important factor to prevent this young technology to become a mainstream cancer diagnostic paradigm is that robustly identifying cancer molecular patterns from high-dimensional protein expression data is still a challenge in machine learning and oncology research. As a well-established dimension reduction technique, PCA is widely integrated in pattern recognition analysis to discover cancer molecular patterns. However, its global feature selection mechanism prevents it from capturing local features. This may lead to difficulty in achieving high-performance proteomic pattern discovery, because only features interpreting global data behavior are used to train a learning machine.

METHODS

In this study, we develop a nonnegative principal component analysis algorithm and present a nonnegative principal component analysis based support vector machine algorithm with sparse coding to conduct a high-performance proteomic pattern classification. Moreover, we also propose a nonnegative principal component analysis based filter-wrapper biomarker capturing algorithm for mass spectral serum profiles.

RESULTS

We demonstrate the superiority of the proposed algorithm by comparison with six peer algorithms on four benchmark datasets. Moreover, we illustrate that nonnegative principal component analysis can be effectively used to capture meaningful biomarkers.

CONCLUSION

Our analysis suggests that nonnegative principal component analysis effectively conduct local feature selection for mass spectral profiles and contribute to improving sensitivities and specificities in the following classification, and meaningful biomarker discovery.

摘要

背景

作为一种新型的癌症诊断范例,质谱血清蛋白质组模式诊断被报道优于传统的血清癌症生物标志物。然而,其临床应用尚未得到充分验证。防止这项年轻技术成为主流癌症诊断范例的一个重要因素是,从高维蛋白质表达数据中稳健地识别癌症分子模式仍然是机器学习和肿瘤学研究中的一个挑战。作为一种成熟的降维技术,PCA 广泛集成在模式识别分析中,以发现癌症分子模式。然而,其全局特征选择机制阻止了它捕获局部特征。这可能导致难以实现高性能蛋白质组模式发现,因为仅使用解释全局数据行为的特征来训练学习机。

方法

在本研究中,我们开发了一种非负主成分分析算法,并提出了一种基于非负主成分分析的支持向量机稀疏编码算法,用于进行高性能蛋白质组模式分类。此外,我们还提出了一种基于非负主成分分析的过滤包装生物标志物捕获算法,用于质谱血清谱。

结果

我们通过与六个同行算法在四个基准数据集上的比较,证明了所提出算法的优越性。此外,我们还表明,非负主成分分析可以有效地用于捕获有意义的生物标志物。

结论

我们的分析表明,非负主成分分析可以有效地对质谱图谱进行局部特征选择,有助于提高后续分类的灵敏度和特异性,并有助于有意义的生物标志物发现。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f21/3009481/baac88a7a119/1471-2105-11-S1-S1-1.jpg

相似文献

1
Nonnegative principal component analysis for mass spectral serum profiles and biomarker discovery.
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S1. doi: 10.1186/1471-2105-11-S1-S1.
3
Nonnegative principal component analysis for cancer molecular pattern discovery.
IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):537-49. doi: 10.1109/TCBB.2009.36.
4
Derivative component analysis for mass spectral serum proteomic profiles.
BMC Med Genomics. 2014;7 Suppl 1(Suppl 1):S5. doi: 10.1186/1755-8794-7-S1-S5. Epub 2014 May 8.
5
Multi-resolution independent component analysis for high-performance tumor classification and biomarker discovery.
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S7. doi: 10.1186/1471-2105-12-S1-S7.
6
A high performance profile-biomarker diagnosis for mass spectral profiles.
BMC Syst Biol. 2011;5 Suppl 2(Suppl 2):S5. doi: 10.1186/1752-0509-5-S2-S5. Epub 2011 Dec 14.
7
Feature selection and nearest centroid classification for protein mass spectrometry.
BMC Bioinformatics. 2005 Mar 23;6:68. doi: 10.1186/1471-2105-6-68.
9
Feature selection and machine learning with mass spectrometry data.
Methods Mol Biol. 2013;1007:237-62. doi: 10.1007/978-1-62703-392-3_10.

引用本文的文献

1
Evaluation of the Biological Effect of a Nicotinamide-Containing Broad-Spectrum Sunscreen on Photodamaged Skin.
Dermatol Ther (Heidelb). 2024 Dec;14(12):3321-3336. doi: 10.1007/s13555-024-01298-7. Epub 2024 Nov 7.
5
Overcome support vector machine diagnosis overfitting.
Cancer Inform. 2014 Dec 9;13(Suppl 1):145-58. doi: 10.4137/CIN.S13875. eCollection 2014.
6
Derivative component analysis for mass spectral serum proteomic profiles.
BMC Med Genomics. 2014;7 Suppl 1(Suppl 1):S5. doi: 10.1186/1755-8794-7-S1-S5. Epub 2014 May 8.
7
Mass spectrometry imaging as a tool for surgical decision-making.
J Mass Spectrom. 2013 Nov;48(11):1178-87. doi: 10.1002/jms.3295.
8
A high performance profile-biomarker diagnosis for mass spectral profiles.
BMC Syst Biol. 2011;5 Suppl 2(Suppl 2):S5. doi: 10.1186/1752-0509-5-S2-S5. Epub 2011 Dec 14.

本文引用的文献

1
Nonnegative principal component analysis for cancer molecular pattern discovery.
IEEE/ACM Trans Comput Biol Bioinform. 2010 Jul-Sep;7(3):537-49. doi: 10.1109/TCBB.2009.36.
2
Biomarker discovery in MALDI-TOF serum protein profiles using discrete wavelet transformation.
Bioinformatics. 2009 Mar 1;25(5):643-9. doi: 10.1093/bioinformatics/btn662.
3
Independent component analysis for the extraction of reliable protein signal profiles from MALDI-TOF mass spectra.
Bioinformatics. 2008 Jan 1;24(1):63-70. doi: 10.1093/bioinformatics/btm533. Epub 2007 Nov 14.
4
Feature Selection for Classification of SELDI-TOF-MS Proteomic Profiles.
Appl Bioinformatics. 2005;4(4):227-46. doi: 10.2165/00822942-200504040-00003.
5
Analysis of mass spectral serum profiles for biomarker selection.
Bioinformatics. 2005 Nov 1;21(21):4039-45. doi: 10.1093/bioinformatics/bti670. Epub 2005 Sep 13.
6
Ovarian cancer identification based on dimensionality reduction for high-throughput mass spectrometry data.
Bioinformatics. 2005 May 15;21(10):2200-9. doi: 10.1093/bioinformatics/bti370. Epub 2005 Mar 22.
7
Serum proteomics profiling--a young technology begins to mature.
Nat Biotechnol. 2005 Mar;23(3):291-2. doi: 10.1038/nbt0305-291.
8
SELDI-TOF-based serum proteomic pattern diagnostics for early detection of cancer.
Curr Opin Biotechnol. 2004 Feb;15(1):24-30. doi: 10.1016/j.copbio.2004.01.005.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验