• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估基于蛋白质组学的生物标志物的统计有效性。

Assessing the statistical validity of proteomics based biomarkers.

作者信息

Smit Suzanne, van Breemen Mariëlle J, Hoefsloot Huub C J, Smilde Age K, Aerts Johannes M F G, de Koster Chris G

机构信息

Swammerdam Institute for Life Sciences, Universiteit van-Amsterdam, Nieuwe Achtergracht 166, 1018 WV Amsterdam, The Netherlands.

出版信息

Anal Chim Acta. 2007 Jun 5;592(2):210-7. doi: 10.1016/j.aca.2007.04.043. Epub 2007 Apr 27.

DOI:10.1016/j.aca.2007.04.043
PMID:17512828
Abstract

A strategy is presented for the statistical validation of discrimination models in proteomics studies. Several existing tools are combined to form a solid statistical basis for biomarker discovery that should precede a biochemical validation of any biomarker. These tools consist of permutation tests, single and double cross-validation. The cross-validation steps can simply be combined with a new variable selection method, called rank products. The strategy is especially suited for the low-samples-to-variables-ratio (undersampling) case, as is often encountered in proteomics and metabolomics studies. As a classification method, principal component discriminant analysis is used; however, the methodology can be used with any classifier. A dataset containing serum samples from Gaucher patients and healthy controls serves as a test case. Double cross-validation shows that the sensitivity of the model is 89% and the specificity 90%. Potential putative biomarkers are identified using the novel variable selection method. Results from permutation tests support the choice of double cross-validation as the tool for determining error rates when the modelling procedure involves a tuneable parameter. This shows that even cross-validation does not guarantee unbiased results. The validation of discrimination models with a combination of permutation tests and double cross-validation helps to avoid erroneous results which may result from the undersampling.

摘要

本文提出了一种蛋白质组学研究中判别模型统计验证的策略。将几种现有工具结合起来,为生物标志物发现形成坚实的统计基础,这应先于任何生物标志物的生化验证。这些工具包括排列检验、单重和双重交叉验证。交叉验证步骤可以简单地与一种称为秩乘积的新变量选择方法相结合。该策略特别适用于蛋白质组学和代谢组学研究中经常遇到的低样本与变量比率(欠采样)情况。作为一种分类方法,使用主成分判别分析;然而,该方法可与任何分类器一起使用。一个包含戈谢病患者和健康对照血清样本的数据集用作测试案例。双重交叉验证表明,该模型的灵敏度为89%,特异性为90%。使用新型变量选择方法识别潜在的假定生物标志物。排列检验的结果支持将双重交叉验证作为在建模过程涉及可调参数时确定错误率的工具。这表明即使是交叉验证也不能保证无偏结果。结合排列检验和双重交叉验证对判别模型进行验证,有助于避免欠采样可能导致的错误结果。

相似文献

1
Assessing the statistical validity of proteomics based biomarkers.评估基于蛋白质组学的生物标志物的统计有效性。
Anal Chim Acta. 2007 Jun 5;592(2):210-7. doi: 10.1016/j.aca.2007.04.043. Epub 2007 Apr 27.
2
A classification model for the Leiden proteomics competition.用于莱顿蛋白质组学竞赛的分类模型。
Stat Appl Genet Mol Biol. 2008;7(2):Article8. doi: 10.2202/1544-6115.1351. Epub 2008 Feb 19.
3
Statistical data processing in clinical proteomics.临床蛋白质组学中的统计数据处理
J Chromatogr B Analyt Technol Biomed Life Sci. 2008 Apr 15;866(1-2):77-88. doi: 10.1016/j.jchromb.2007.10.042. Epub 2007 Nov 4.
4
How to distinguish healthy from diseased? Classification strategy for mass spectrometry-based clinical proteomics.如何区分健康与疾病?基于质谱的临床蛋白质组学分类策略。
Proteomics. 2007 Oct;7(20):3672-80. doi: 10.1002/pmic.200700046.
5
Proteomic data analysis workflow for discovery of candidate biomarker peaks predictive of clinical outcome for patients with acute myeloid leukemia.用于发现预测急性髓性白血病患者临床结局的候选生物标志物峰的蛋白质组学数据分析流程。
J Proteome Res. 2008 Jun;7(6):2332-41. doi: 10.1021/pr070482e. Epub 2008 May 2.
6
Improvements in the search for potential biomarkers by proteomics: application of principal component and discriminant analyses for two-dimensional maps evaluation.蛋白质组学在寻找潜在生物标志物方面的进展:主成分分析和判别分析在二维图谱评估中的应用
J Chromatogr B Analyt Technol Biomed Life Sci. 2007 Apr 15;849(1-2):251-60. doi: 10.1016/j.jchromb.2006.09.021. Epub 2006 Oct 30.
7
Serum peptidome profiling revealed platelet factor 4 as a potential discriminating Peptide associated with pancreatic cancer.血清肽组分析显示血小板因子4是一种与胰腺癌相关的潜在鉴别肽。
Clin Cancer Res. 2009 Jun 1;15(11):3812-9. doi: 10.1158/1078-0432.CCR-08-2701. Epub 2009 May 26.
8
Identification of lung cancer patients by serum protein profiling using surface-enhanced laser desorption/ionization time-of-flight mass spectrometry.使用表面增强激光解吸/电离飞行时间质谱法通过血清蛋白质谱分析鉴定肺癌患者
Am J Clin Oncol. 2008 Apr;31(2):133-9. doi: 10.1097/COC.0b013e318145b98b.
9
Urinary proteomics--a tool for biomarker discovery.尿蛋白质组学——生物标志物发现的工具。
Ren Fail. 2010 Jan;32(2):259-68. doi: 10.3109/08860221003599759.
10
Discrimination analysis of mass spectrometry proteomics for ovarian cancer detection.用于卵巢癌检测的质谱蛋白质组学的判别分析
Acta Pharmacol Sin. 2008 Oct;29(10):1240-6. doi: 10.1111/j.1745-7254.2008.00861.x.

引用本文的文献

1
Headspace Solid-Phase Microextraction/Gas Chromatography-Mass Spectrometry and Chemometric Approach for the Study of Volatile Profile in X-ray Irradiated Surface-Ripened Cheeses.顶空固相微萃取/气相色谱-质谱联用及化学计量学方法用于研究X射线辐照的表面成熟奶酪的挥发性成分
Foods. 2024 Jan 27;13(3):416. doi: 10.3390/foods13030416.
2
Untargeted Lipidomics and Chemometric Tools for the Characterization and Discrimination of Irradiated Camembert Cheese Analyzed by UHPLC-Q-Orbitrap-MS.用于通过超高效液相色谱-四极杆-轨道阱质谱分析辐照卡芒贝尔奶酪表征与鉴别的非靶向脂质组学和化学计量学工具
Foods. 2023 May 30;12(11):2198. doi: 10.3390/foods12112198.
3
Comprehensive biomarker profiles and chemometric filtering of urinary metabolomics for effective discrimination of prostate carcinoma from benign hyperplasia.
综合生物标志物谱和尿液代谢组学的化学计量学过滤,有效区分前列腺癌与良性增生。
Sci Rep. 2022 Mar 14;12(1):4361. doi: 10.1038/s41598-022-08435-2.
4
Can biochemical traits bridge the gap between genomics and plant performance? A study in rice under drought.生化特征能否弥合基因组学和植物表现之间的差距?以干旱条件下的水稻为例的研究。
Plant Physiol. 2022 Jun 1;189(2):1139-1152. doi: 10.1093/plphys/kiac053.
5
Human disease biomarker panels through systems biology.通过系统生物学构建人类疾病生物标志物组合
Biophys Rev. 2021 Oct 13;13(6):1179-1190. doi: 10.1007/s12551-021-00849-y. eCollection 2021 Dec.
6
Data analysis of MS-based clinical lipidomics studies with crossover design: A tutorial mini-review of statistical methods.基于质谱的交叉设计临床脂质组学研究的数据分析:统计方法教程式小型综述
Clin Mass Spectrom. 2019 May 20;13:5-17. doi: 10.1016/j.clinms.2019.05.002. eCollection 2019 Aug.
7
Metabolomic analysis to discriminate drug-induced liver injury (DILI) phenotypes.代谢组学分析用于鉴别药物性肝损伤(DILI)表型。
Arch Toxicol. 2021 Sep;95(9):3049-3062. doi: 10.1007/s00204-021-03114-z. Epub 2021 Jul 17.
8
Elevated Free Phosphatidylcholine Levels in Cerebrospinal Fluid Distinguish Bacterial from Viral CNS Infections.脑脊液中游离磷脂酰胆碱水平升高可区分细菌性和病毒性中枢神经系统感染。
Cells. 2021 May 6;10(5):1115. doi: 10.3390/cells10051115.
9
CMM-Net: Contextual multi-scale multi-level network for efficient biomedical image segmentation.CMM-Net:用于高效生物医学图像分割的上下文多尺度多层次网络。
Sci Rep. 2021 May 13;11(1):10191. doi: 10.1038/s41598-021-89686-3.
10
Fourier-Transform Infrared Spectroscopy of Skeletal Muscle Tissue: Expanding Biomarkers in Primary Mitochondrial Myopathies.傅里叶变换红外光谱分析骨骼肌组织:原发性线粒体肌病中生物标志物的拓展。
Genes (Basel). 2020 Dec 19;11(12):1522. doi: 10.3390/genes11121522.