基于 SELDI-TOF 数据降维的卵巢癌分类。

Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data.

机构信息

Department of Chemistry, Tongji University, Shanghai, 200092, China.

出版信息

BMC Bioinformatics. 2010 Feb 27;11:109. doi: 10.1186/1471-2105-11-109.

DOI:10.1186/1471-2105-11-109

PMID:20187963

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2846906/

Abstract

BACKGROUND

Recent advances in proteomics technologies such as SELDI-TOF mass spectrometry has shown promise in the detection of early stage cancers. However, dimensionality reduction and classification are considerable challenges in statistical machine learning. We therefore propose a novel approach for dimensionality reduction and tested it using published high-resolution SELDI-TOF data for ovarian cancer.

RESULTS

We propose a method based on statistical moments to reduce feature dimensions. After refining and t-testing, SELDI-TOF data are divided into several intervals. Four statistical moments (mean, variance, skewness and kurtosis) are calculated for each interval and are used as representative variables. The high dimensionality of the data can thus be rapidly reduced. To improve efficiency and classification performance, the data are further used in kernel PLS models. The method achieved average sensitivity of 0.9950, specificity of 0.9916, accuracy of 0.9935 and a correlation coefficient of 0.9869 for 100 five-fold cross validations. Furthermore, only one control was misclassified in leave-one-out cross validation.

CONCLUSION

The proposed method is suitable for analyzing high-throughput proteomics data.

摘要

背景

SELDI-TOF 质谱等蛋白质组学技术的最新进展显示出在检测早期癌症方面的潜力。然而，在统计机器学习中，降维和分类是相当大的挑战。因此，我们提出了一种新的降维方法，并使用已发表的卵巢癌高分辨率 SELDI-TOF 数据对其进行了测试。

结果

我们提出了一种基于统计矩的方法来降低特征维度。经过精炼和 t 检验后，将 SELDI-TOF 数据分为几个区间。为每个区间计算四个统计矩（均值、方差、偏度和峰度），并用作代表变量。因此，可以快速降低数据的高维性。为了提高效率和分类性能，进一步将数据用于核 PLS 模型。该方法在 100 次五重交叉验证中实现了平均灵敏度为 0.9950、特异性为 0.9916、准确性为 0.9935 和相关系数为 0.9869。此外，在留一法交叉验证中只有一个对照被错误分类。

结论

所提出的方法适用于分析高通量蛋白质组学数据。

相似文献

Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data.基于 SELDI-TOF 数据降维的卵巢癌分类。

BMC Bioinformatics. 2010 Feb 27;11:109. doi: 10.1186/1471-2105-11-109.

Ovarian cancer identification based on dimensionality reduction for high-throughput mass spectrometry data.基于高通量质谱数据降维的卵巢癌识别

Bioinformatics. 2005 May 15;21(10):2200-9. doi: 10.1093/bioinformatics/bti370. Epub 2005 Mar 22.

Feature selection and nearest centroid classification for protein mass spectrometry.蛋白质质谱的特征选择与最近质心分类

BMC Bioinformatics. 2005 Mar 23;6:68. doi: 10.1186/1471-2105-6-68.

Application of SELDI-TOF in N-glycopeptides profiling of the urine from patients with endometrial, ovarian and cervical cancer.表面增强激光解吸电离飞行时间质谱（SELDI-TOF）在子宫内膜癌、卵巢癌和宫颈癌患者尿液N-糖肽谱分析中的应用

Arch Physiol Biochem. 2016 Jul;122(3):111-6. doi: 10.3109/13813455.2016.1151441. Epub 2016 Mar 16.

Advances in clinical cancer proteomics: SELDI-ToF-mass spectrometry and biomarker discovery.临床癌症蛋白质组学进展：表面增强激光解吸电离飞行时间质谱与生物标志物发现

Brief Funct Genomic Proteomic. 2005 May;4(1):16-26. doi: 10.1093/bfgp/4.1.16.

SELDI-TOF-MS proteomics of breast cancer.乳腺癌的表面增强激光解吸电离飞行时间质谱蛋白质组学

Clin Chem Lab Med. 2005;43(12):1314-20. doi: 10.1515/CCLM.2005.225.

Application of serum SELDI proteomic patterns in diagnosis of lung cancer.血清表面增强激光解吸电离飞行时间质谱蛋白质组学图谱在肺癌诊断中的应用。

BMC Cancer. 2005 Jul 20;5:83. doi: 10.1186/1471-2407-5-83.

[Proteomic profiling: the potential of Seldi-Tof for the identification of new cancer biomarkers].蛋白质组学分析：表面增强激光解吸电离飞行时间质谱技术用于鉴定新型癌症生物标志物的潜力

Bull Cancer. 2005 Sep;92(9):763-8.

Pancreatic cancer biomarkers discovery by surface-enhanced laser desorption and ionization time-of-flight mass spectrometry.通过表面增强激光解吸电离飞行时间质谱法发现胰腺癌生物标志物

Clin Chem Lab Med. 2009;47(6):713-23. doi: 10.1515/CCLM.2009.158.

Limitations in SELDI-TOF MS whole serum proteomic profiling with IMAC surface to specifically detect colorectal cancer.使用IMAC表面的SELDI-TOF MS全血清蛋白质组分析在特异性检测结直肠癌方面的局限性。

BMC Cancer. 2009 Aug 19;9:287. doi: 10.1186/1471-2407-9-287.

引用本文的文献

Reducing phenotype-structured partial differential equations models of cancer evolution to systems of ordinary differential equations: a generalised moment dynamics approach.将癌症进化的表型结构偏微分方程模型简化为常微分方程组：一种广义矩动力学方法。

J Math Biol. 2025 Jul 28;91(2):22. doi: 10.1007/s00285-025-02246-5.

HSSG: Identification of Cancer Subtypes Based on Heterogeneity Score of A Single Gene.HSSG：基于单个基因异质性得分的癌症亚型识别。

Cells. 2022 Aug 8;11(15):2456. doi: 10.3390/cells11152456.

Biomarker Discovery by Imperialist Competitive Algorithm in Mass Spectrometry Data for Ovarian Cancer Prediction.基于帝国主义竞争算法的质谱数据生物标志物发现用于卵巢癌预测

J Med Signals Sens. 2021 May 24;11(2):108-119. doi: 10.4103/jmss.JMSS_20_20. eCollection 2021 Apr-Jun.

Protein analytical assays for diagnosing, monitoring, and choosing treatment for cancer patients.用于癌症患者诊断、监测和选择治疗方案的蛋白质分析检测方法。

J Healthc Eng. 2012 Dec;3(4):503-534. doi: 10.1260/2040-2295.3.4.503.

GyneScan: an improved online paradigm for screening of ovarian cancer via tissue characterization.GyneScan：一种通过组织特征分析筛查卵巢癌的改进型在线模式。

Technol Cancer Res Treat. 2014 Dec;13(6):529-39. doi: 10.7785/tcrtexpress.2013.600273. Epub 2013 Dec 6.

Availability of MudPIT data for classification of biological samples.用于生物样本分类的多维蛋白质鉴定技术（MudPIT）数据的可用性。

J Clin Bioinforma. 2013 Jan 14;3(1):1. doi: 10.1186/2043-9113-3-1.

An improved dimensionality reduction method for meta-transcriptome indexing based diseases classification.一种基于元转录组索引的疾病分类的改进降维方法。

BMC Syst Biol. 2012;6 Suppl 3(Suppl 3):S12. doi: 10.1186/1752-0509-6-S3-S12. Epub 2012 Dec 17.

Combining proteomics, serum biomarkers and bioinformatics to discriminate between esophageal squamous cell carcinoma and pre-cancerous lesion.运用蛋白质组学、血清生物标志物和生物信息学区分食管鳞状细胞癌和癌前病变。

J Zhejiang Univ Sci B. 2012 Dec;13(12):964-71. doi: 10.1631/jzus.B1200066.

Ovarian tumor characterization and classification using ultrasound-a new online paradigm.基于超声的卵巢肿瘤特征化与分类——一种新的在线范例。

J Digit Imaging. 2013 Jun;26(3):544-53. doi: 10.1007/s10278-012-9553-8.

The application of SELDI-TOF-MS in clinical diagnosis of cancers.表面增强激光解吸电离飞行时间质谱技术在癌症临床诊断中的应用。

J Biomed Biotechnol. 2011;2011:245821. doi: 10.1155/2011/245821. Epub 2011 May 23.

本文引用的文献

Classification of premalignant pancreatic cancer mass-spectrometry data using decision tree ensembles.使用决策树集成对癌前胰腺癌质谱数据进行分类。

BMC Bioinformatics. 2008 Jun 11;9:275. doi: 10.1186/1471-2105-9-275.

A review of feature selection techniques in bioinformatics.生物信息学中特征选择技术综述。

Bioinformatics. 2007 Oct 1;23(19):2507-17. doi: 10.1093/bioinformatics/btm344. Epub 2007 Aug 24.

Assessing the utility of SELDI-TOF and model averaging for serum proteomic biomarker discovery.评估表面增强激光解吸电离飞行时间质谱（SELDI-TOF）及模型平均法在血清蛋白质组生物标志物发现中的效用。

Proteomics. 2006 Dec;6(24):6405-15. doi: 10.1002/pmic.200600420.

Recursive SVM feature selection and sample classification for mass-spectrometry and microarray data.用于质谱和微阵列数据的递归支持向量机特征选择与样本分类

BMC Bioinformatics. 2006 Apr 10;7:197. doi: 10.1186/1471-2105-7-197.

Biomarker discovery for ovarian cancer using SELDI-TOF-MS.使用表面增强激光解吸电离飞行时间质谱技术发现卵巢癌生物标志物

Gynecol Oncol. 2006 Jul;102(1):61-6. doi: 10.1016/j.ygyno.2005.11.029. Epub 2006 Jan 5.

A robust meta-classification strategy for cancer detection from MS data.一种用于从质谱数据中进行癌症检测的强大元分类策略。

Proteomics. 2006 Jan;6(2):592-604. doi: 10.1002/pmic.200500192.

Feature Selection for Classification of SELDI-TOF-MS Proteomic Profiles.用于SELDI-TOF-MS蛋白质组学图谱分类的特征选择

Appl Bioinformatics. 2005;4(4):227-46. doi: 10.2165/00822942-200504040-00003.

Using proteomic approaches to identify new biomarkers for detection and monitoring of ovarian cancer.运用蛋白质组学方法鉴定用于检测和监测卵巢癌的新型生物标志物。

Gynecol Oncol. 2006 Feb;100(2):247-53. doi: 10.1016/j.ygyno.2005.08.051. Epub 2005 Oct 17.

Ovarian cancer identification based on dimensionality reduction for high-throughput mass spectrometry data.基于高通量质谱数据降维的卵巢癌识别

Bioinformatics. 2005 May 15;21(10):2200-9. doi: 10.1093/bioinformatics/bti370. Epub 2005 Mar 22.

Application of the GA/KNN method to SELDI proteomics data.遗传算法/最近邻算法在表面增强激光解吸电离飞行时间质谱蛋白质组学数据中的应用。

Bioinformatics. 2004 Jul 10;20(10):1638-40. doi: 10.1093/bioinformatics/bth098. Epub 2004 Feb 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于 SELDI-TOF 数据降维的卵巢癌分类。

Ovarian cancer classification based on dimensionality reduction for SELDI-TOF data.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献