• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用加权偏最小二乘法从相关微阵列研究中借用信息进行样本分类。

Borrowing information from relevant microarray studies for sample classification using weighted partial least squares.

作者信息

Huang Xiaohong, Pan Wei, Han Xinqiang, Chen Yingjie, Miller Leslie W, Hall Jennifer

机构信息

Division of Biostatistics, School of Public Health, University of Minnesota, A460 Mayo Building (MMC 303), Minneapolis, MN 55455-0378, USA.

出版信息

Comput Biol Chem. 2005 Jun;29(3):204-11. doi: 10.1016/j.compbiolchem.2005.04.002.

DOI:10.1016/j.compbiolchem.2005.04.002
PMID:15979040
Abstract

With an increasing number of publicly available microarray datasets, it becomes attractive to borrow information from other relevant studies to have more reliable and powerful analysis of a given dataset. We do not assume that subjects in the current study and other relevant studies are drawn from the same population as assumed by meta-analysis. In particular, the set of parameters in the current study may be different from that of the other studies. We consider sample classification based on gene expression profiles in this context. We propose two new methods, a weighted partial least squares (WPLS) method and a weighted penalized partial least squares (WPPLS) method, to build a classifier by a combined use of multiple datasets. The methods can weight the individual datasets depending on their relevance to the current study. A more standard approach is first to build a classifier using each of the individual datasets, then to combine the outputs of the multiple classifiers using a weighted voting. Using two quite different datasets on human heart failure, we show first that WPLS/WPPLS, by borrowing information from the other dataset, can improve the performance of PLS/PPLS built on only a single dataset. Second, WPLS/WPPLS performs better than the standard approach of combining multiple classifiers. Third, WPPLS can improve over WPLS, just as PPLS does over PLS for a single dataset.

摘要

随着公开可用的微阵列数据集越来越多,从其他相关研究中借鉴信息,以便对给定数据集进行更可靠、更有力的分析,变得颇具吸引力。我们并不像荟萃分析那样假设当前研究和其他相关研究中的受试者来自同一总体。特别是,当前研究中的参数集可能与其他研究的不同。在此背景下,我们考虑基于基因表达谱的样本分类。我们提出了两种新方法,即加权偏最小二乘法(WPLS)和加权惩罚偏最小二乘法(WPPLS),通过结合使用多个数据集来构建分类器。这些方法可以根据各个数据集与当前研究的相关性对其进行加权。一种更为标准的方法是,首先使用每个单独的数据集构建一个分类器,然后使用加权投票来组合多个分类器的输出。通过使用两个截然不同的人类心力衰竭数据集,我们首先表明,WPLS/WPPLS通过借鉴另一个数据集的信息,可以提高仅基于单个数据集构建的PLS/PPLS的性能。其次,WPLS/WPPLS的性能优于组合多个分类器的标准方法。第三,WPPLS相对于WPLS有所改进,就如同单个数据集的PPLS相对于PLS有所改进一样。

相似文献

1
Borrowing information from relevant microarray studies for sample classification using weighted partial least squares.利用加权偏最小二乘法从相关微阵列研究中借用信息进行样本分类。
Comput Biol Chem. 2005 Jun;29(3):204-11. doi: 10.1016/j.compbiolchem.2005.04.002.
2
Consensus and Meta-analysis regulatory networks for combining multiple microarray gene expression datasets.用于整合多个微阵列基因表达数据集的共识与荟萃分析调控网络。
J Biomed Inform. 2008 Dec;41(6):914-26. doi: 10.1016/j.jbi.2008.01.011. Epub 2008 Feb 6.
3
Reliable gene signatures for microarray classification: assessment of stability and performance.用于微阵列分类的可靠基因特征:稳定性和性能评估
Bioinformatics. 2006 Oct 1;22(19):2356-63. doi: 10.1093/bioinformatics/btl400. Epub 2006 Jul 31.
4
On combining multiple microarray studies for improved functional classification by whole-dataset feature selection.通过全数据集特征选择合并多个微阵列研究以改进功能分类
Genome Inform. 2003;14:44-53.
5
Predicting patient survival from microarray data by accelerated failure time modeling using partial least squares and LASSO.通过使用偏最小二乘法和套索算法的加速失效时间模型从微阵列数据预测患者生存率。
Biometrics. 2007 Mar;63(1):259-71. doi: 10.1111/j.1541-0420.2006.00660.x.
6
A hierarchical Naïve Bayes Model for handling sample heterogeneity in classification problems: an application to tissue microarrays.一种用于处理分类问题中样本异质性的分层朴素贝叶斯模型:在组织微阵列中的应用。
BMC Bioinformatics. 2006 Nov 24;7:514. doi: 10.1186/1471-2105-7-514.
7
Integrated analysis of independent gene expression microarray datasets improves the predictability of breast cancer outcome.独立基因表达微阵列数据集的综合分析提高了乳腺癌预后的可预测性。
BMC Genomics. 2007 Sep 20;8:331. doi: 10.1186/1471-2164-8-331.
8
Evaluating methods for classifying expression data.评估用于分类表达数据的方法。
J Biopharm Stat. 2004 Nov;14(4):1065-84. doi: 10.1081/BIP-200035491.
9
Modeling the relationship between LVAD support time and gene expression changes in the human heart by penalized partial least squares.通过惩罚偏最小二乘法建立左心室辅助装置支持时间与人类心脏基因表达变化之间的关系模型。
Bioinformatics. 2004 Apr 12;20(6):888-94. doi: 10.1093/bioinformatics/btg499. Epub 2004 Jan 29.
10
Dimension reduction for classification with gene expression microarray data.利用基因表达微阵列数据进行分类的降维方法。
Stat Appl Genet Mol Biol. 2006;5:Article6. doi: 10.2202/1544-6115.1147. Epub 2006 Feb 24.

引用本文的文献

1
Identifying the factors associated with cesarean section modeled with categorical correlation coefficients in partial least squares.用偏最小二乘法中的类别相关系数对剖宫产的影响因素进行建模分析。
PLoS One. 2019 Jul 26;14(7):e0219427. doi: 10.1371/journal.pone.0219427. eCollection 2019.
2
Adaptive prediction model in prospective molecular signature-based clinical studies.前瞻性分子特征临床研究中的自适应预测模型。
Clin Cancer Res. 2014 Feb 1;20(3):531-9. doi: 10.1158/1078-0432.CCR-13-2127. Epub 2013 Dec 9.
3
Probe mapping across multiple microarray platforms.
在多个微阵列平台上进行探针映射。
Brief Bioinform. 2012 Sep;13(5):547-54. doi: 10.1093/bib/bbr076. Epub 2011 Dec 23.
4
SlimPLS: a method for feature selection in gene expression-based disease classification.SlimPLS:一种基于基因表达的疾病分类中特征选择的方法。
PLoS One. 2009 Jul 29;4(7):e6416. doi: 10.1371/journal.pone.0006416.
5
A comparative study of discriminating human heart failure etiology using gene expression profiles.利用基因表达谱鉴别人类心力衰竭病因的比较研究。
BMC Bioinformatics. 2005 Aug 24;6:205. doi: 10.1186/1471-2105-6-205.