• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PPLS-DA 的扩展,用于分类和与普通 PLS-DA 的比较。

An extension of PPLS-DA for classification and comparison to ordinary PLS-DA.

机构信息

Institute for Genetics and Biometry, Department of Bioinformatics and Biomathematics, Leibniz Institute for Farm Animal Biology, Dummerstorf, Germany.

出版信息

PLoS One. 2013;8(2):e55267. doi: 10.1371/journal.pone.0055267. Epub 2013 Feb 11.

DOI:10.1371/journal.pone.0055267
PMID:23408965
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3569448/
Abstract

Classification studies are widely applied, e.g. in biomedical research to classify objects/patients into predefined groups. The goal is to find a classification function/rule which assigns each object/patient to a unique group with the greatest possible accuracy (classification error). Especially in gene expression experiments often a lot of variables (genes) are measured for only few objects/patients. A suitable approach is the well-known method PLS-DA, which searches for a transformation to a lower dimensional space. Resulting new components are linear combinations of the original variables. An advancement of PLS-DA leads to PPLS-DA, introducing a so called 'power parameter', which is maximized towards the correlation between the components and the group-membership. We introduce an extension of PPLS-DA for optimizing this power parameter towards the final aim, namely towards a minimal classification error. We compare this new extension with the original PPLS-DA and also with the ordinary PLS-DA using simulated and experimental datasets. For the investigated data sets with weak linear dependency between features/variables, no improvement is shown for PPLS-DA and for the extensions compared to PLS-DA. A very weak linear dependency, a low proportion of differentially expressed genes for simulated data, does not lead to an improvement of PPLS-DA over PLS-DA, but our extension shows a lower prediction error. On the contrary, for the data set with strong between-feature collinearity and a low proportion of differentially expressed genes and a large total number of genes, the prediction error of PPLS-DA and the extensions is clearly lower than for PLS-DA. Moreover we compare these prediction results with results of support vector machines with linear kernel and linear discriminant analysis.

摘要

分类研究被广泛应用,例如在生物医学研究中,将对象/患者分类到预定义的组中。目标是找到一个分类函数/规则,将每个对象/患者分配到一个具有最大可能准确性(分类误差)的唯一组中。特别是在基因表达实验中,通常为少数对象/患者测量了大量变量(基因)。一种合适的方法是众所周知的 PLS-DA 方法,它搜索到一个较低维数的空间的转换。产生的新组件是原始变量的线性组合。PLS-DA 的一个改进是 PPLS-DA,引入了所谓的“幂参数”,该参数在组件和组成员之间的相关性方面最大化。我们引入了 PPLS-DA 的扩展,以优化该幂参数以实现最终目标,即最小化分类误差。我们将这种新扩展与原始 PPLS-DA 进行比较,也与使用模拟和实验数据集的普通 PLS-DA 进行比较。对于所研究的具有特征/变量之间弱线性相关性的数据集,与 PLS-DA 相比,PPLS-DA 和扩展没有显示出改进。对于模拟数据,特征之间的弱线性相关性和差异表达基因的低比例,不会导致 PLS-DA 优于 PPLS-DA,但我们的扩展显示出更低的预测误差。相反,对于具有强特征之间共线性和低比例差异表达基因和大量基因的数据集,PPLS-DA 和扩展的预测误差明显低于 PLS-DA。此外,我们将这些预测结果与具有线性核和线性判别分析的支持向量机的结果进行了比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/a9144754ce02/pone.0055267.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/c32f6474c231/pone.0055267.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/8ad945792a61/pone.0055267.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/66ab20b72eab/pone.0055267.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/53e9d32bb044/pone.0055267.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/b0b3686f1a14/pone.0055267.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/1ef1dfc676dd/pone.0055267.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/a9144754ce02/pone.0055267.g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/c32f6474c231/pone.0055267.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/8ad945792a61/pone.0055267.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/66ab20b72eab/pone.0055267.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/53e9d32bb044/pone.0055267.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/b0b3686f1a14/pone.0055267.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/1ef1dfc676dd/pone.0055267.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f175/3569448/a9144754ce02/pone.0055267.g007.jpg

相似文献

1
An extension of PPLS-DA for classification and comparison to ordinary PLS-DA.PPLS-DA 的扩展,用于分类和与普通 PLS-DA 的比较。
PLoS One. 2013;8(2):e55267. doi: 10.1371/journal.pone.0055267. Epub 2013 Feb 11.
2
Local classification: Locally weighted-partial least squares-discriminant analysis (LW-PLS-DA).局部分类:局部加权偏最小二乘判别分析(LW-PLS-DA)。
Anal Chim Acta. 2014 Aug 1;838:20-30. doi: 10.1016/j.aca.2014.05.057. Epub 2014 Jun 13.
3
An extension to the discriminant analysis of near-infrared spectra.一种近红外光谱判别分析的扩展。
Med Eng Phys. 2013 Feb;35(2):172-7. doi: 10.1016/j.medengphy.2012.04.012. Epub 2012 May 29.
4
A tutorial review: Metabolomics and partial least squares-discriminant analysis--a marriage of convenience or a shotgun wedding.一篇教程综述:代谢组学与偏最小二乘判别分析——是权宜结合还是仓促结合。
Anal Chim Acta. 2015 Jun 16;879:10-23. doi: 10.1016/j.aca.2015.02.012. Epub 2015 Feb 11.
5
Complex Chemical Data Classification and Discrimination Using Locality Preserving Partial Least Squares Discriminant Analysis.使用局部保留偏最小二乘判别分析的复杂化学数据分类与判别
ACS Omega. 2020 Oct 9;5(41):26601-26610. doi: 10.1021/acsomega.0c03362. eCollection 2020 Oct 20.
6
Improved multi-class discrimination by Common-Subset-of-Independent-Variables Partial-Least-Squares Discriminant Analysis.基于独立变量子集的偏最小二乘判别分析提高多类判别能力。
Talanta. 2021 Nov 1;234:122595. doi: 10.1016/j.talanta.2021.122595. Epub 2021 Jun 15.
7
Nearest clusters based partial least squares discriminant analysis for the classification of spectral data.基于最近聚类的偏最小二乘判别分析用于光谱数据分类
Anal Chim Acta. 2018 Jun 7;1009:27-38. doi: 10.1016/j.aca.2018.01.023.
8
Classification of structurally related commercial contrast media by near infrared spectroscopy.通过近红外光谱法对结构相关的商业造影剂进行分类。
J Pharm Biomed Anal. 2014 Mar;90:148-60. doi: 10.1016/j.jpba.2013.11.033. Epub 2013 Dec 7.
9
PPLS/D: Parallel Pareto Local Search Based on Decomposition.PPLS/D:基于分解的并行帕累托局部搜索
IEEE Trans Cybern. 2020 Mar;50(3):1060-1071. doi: 10.1109/TCYB.2018.2880256. Epub 2018 Nov 29.
10
Stable feature selection and classification algorithms for multiclass microarray data.用于多类微阵列数据的稳定特征选择和分类算法。
Biol Direct. 2012 Oct 2;7:33. doi: 10.1186/1745-6150-7-33.

引用本文的文献

1
A Comparison of Lipid Contents in Different Types of Peanut Cultivars Using UPLC-Q-TOF-MS-Based Lipidomic Study.基于超高效液相色谱-四极杆飞行时间质谱脂质组学研究对不同类型花生品种脂质含量的比较
Foods. 2021 Dec 21;11(1):4. doi: 10.3390/foods11010004.
2
A new method combining LDA and PLS for dimension reduction.一种结合LDA和PLS进行降维的新方法。
PLoS One. 2014 May 12;9(5):e96944. doi: 10.1371/journal.pone.0096944. eCollection 2014.

本文引用的文献

1
BagBoosting for tumor classification with gene expression data.用于基于基因表达数据的肿瘤分类的BagBoosting算法
Bioinformatics. 2004 Dec 12;20(18):3583-93. doi: 10.1093/bioinformatics/bth447. Epub 2004 Oct 5.
2
Gene expression profiling identifies clinically relevant subtypes of prostate cancer.基因表达谱分析可识别前列腺癌的临床相关亚型。
Proc Natl Acad Sci U S A. 2004 Jan 20;101(3):811-6. doi: 10.1073/pnas.0304146101. Epub 2004 Jan 7.
3
Statistical significance for genomewide studies.全基因组研究的统计学显著性
Proc Natl Acad Sci U S A. 2003 Aug 5;100(16):9440-5. doi: 10.1073/pnas.1530509100. Epub 2003 Jul 25.
4
Boosting for tumor classification with gene expression data.利用基因表达数据进行肿瘤分类的提升算法
Bioinformatics. 2003 Jun 12;19(9):1061-9. doi: 10.1093/bioinformatics/btf867.
5
Gene expression correlates of clinical prostate cancer behavior.临床前列腺癌行为的基因表达相关性
Cancer Cell. 2002 Mar;1(2):203-9. doi: 10.1016/s1535-6108(02)00030-2.
6
Gene expression profiling predicts clinical outcome of breast cancer.基因表达谱分析可预测乳腺癌的临床预后。
Nature. 2002 Jan 31;415(6871):530-6. doi: 10.1038/415530a.
7
Diffuse large B-cell lymphoma outcome prediction by gene-expression profiling and supervised machine learning.通过基因表达谱分析和监督式机器学习预测弥漫性大B细胞淋巴瘤的预后
Nat Med. 2002 Jan;8(1):68-74. doi: 10.1038/nm0102-68.
8
Molecular classification of cancer: class discovery and class prediction by gene expression monitoring.癌症的分子分类:通过基因表达监测进行类别发现和类别预测。
Science. 1999 Oct 15;286(5439):531-7. doi: 10.1126/science.286.5439.531.