• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Supervised principal component analysis for gene set enrichment of microarray data with continuous or survival outcomes.用于具有连续或生存结局的微阵列数据基因集富集的监督主成分分析。
Bioinformatics. 2008 Nov 1;24(21):2474-81. doi: 10.1093/bioinformatics/btn458. Epub 2008 Aug 27.
2
Spectral gene set enrichment (SGSE).光谱基因集富集(SGSE)。
BMC Bioinformatics. 2015 Mar 3;16:70. doi: 10.1186/s12859-015-0490-7.
3
Statistical significance of variables driving systematic variation in high-dimensional data.驱动高维数据系统变异的变量的统计学显著性。
Bioinformatics. 2015 Feb 15;31(4):545-54. doi: 10.1093/bioinformatics/btu674. Epub 2014 Oct 21.
4
Gene selection for microarray data analysis using principal component analysis.使用主成分分析进行微阵列数据分析的基因选择
Stat Med. 2005 Jul 15;24(13):2069-87. doi: 10.1002/sim.2082.
5
Selecting subsets of newly extracted features from PCA and PLS in microarray data analysis.在微阵列数据分析中从主成分分析(PCA)和偏最小二乘法(PLS)中选择新提取特征的子集。
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S24. doi: 10.1186/1471-2164-9-S2-S24.
6
Supervised cluster analysis for microarray data based on multivariate Gaussian mixture.基于多元高斯混合的微阵列数据监督聚类分析。
Bioinformatics. 2004 Aug 12;20(12):1905-13. doi: 10.1093/bioinformatics/bth177. Epub 2004 Mar 25.
7
Integrating biological knowledge with gene expression profiles for survival prediction of cancer.整合生物学知识与基因表达谱以预测癌症患者的生存情况。
J Comput Biol. 2009 Feb;16(2):265-78. doi: 10.1089/cmb.2008.12TT.
8
Semi-supervised recursively partitioned mixture models for identifying cancer subtypes.半监督递归分区混合模型用于识别癌症亚型。
Bioinformatics. 2010 Oct 15;26(20):2578-85. doi: 10.1093/bioinformatics/btq470. Epub 2010 Aug 16.
9
Pathway-based analysis for genome-wide association studies using supervised principal components.基于有监督主成分的全基因组关联研究的通路分析。
Genet Epidemiol. 2010 Nov;34(7):716-24. doi: 10.1002/gepi.20532.
10
pcaMethods--a bioconductor package providing PCA methods for incomplete data.pcaMethods——一个生物导体软件包,为不完整数据提供主成分分析方法。
Bioinformatics. 2007 May 1;23(9):1164-7. doi: 10.1093/bioinformatics/btm069. Epub 2007 Mar 7.

引用本文的文献

1
Integrated Systems Analysis Deciphers Transcriptome and Glycoproteome Links in Alzheimer's Disease.综合系统分析破解阿尔茨海默病中的转录组和糖蛋白质组联系。
bioRxiv. 2024 May 30:2023.12.25.573290. doi: 10.1101/2023.12.25.573290.
2
A comprehensive survey of the approaches for pathway analysis using multi-omics data integration.多组学数据整合的通路分析方法的全面综述。
Brief Bioinform. 2022 Nov 19;23(6). doi: 10.1093/bib/bbac435.
3
Genome-wide pathway-based quantitative multiple phenotypes analysis.基于全基因组途径的定量多表型分析。
PLoS One. 2020 Nov 11;15(11):e0240910. doi: 10.1371/journal.pone.0240910. eCollection 2020.
4
Radiomics analysis using stability selection supervised component analysis for right-censored survival data.使用稳定性选择监督成分分析对右删失生存数据进行放射组学分析。
Comput Biol Med. 2020 Sep;124:103959. doi: 10.1016/j.compbiomed.2020.103959. Epub 2020 Aug 6.
5
Conventional MRI radiomics in patients with suspected early- or pseudo-progression.疑似早期进展或假性进展患者的传统MRI影像组学
Neurooncol Adv. 2019 Sep 1;1(1):vdz019. doi: 10.1093/noajnl/vdz019. eCollection 2019 May-Dec.
6
A personalised approach for identifying disease-relevant pathways in heterogeneous diseases.针对异质疾病中与疾病相关的途径进行个性化识别的方法。
NPJ Syst Biol Appl. 2020 Jun 9;6(1):17. doi: 10.1038/s41540-020-0130-3.
7
PathwayPCA: an R/Bioconductor Package for Pathway Based Integrative Analysis of Multi-Omics Data.PathwayPCA:基于通路的多组学数据综合分析的 R/Bioconductor 包。
Proteomics. 2020 Nov;20(21-22):e1900409. doi: 10.1002/pmic.201900409. Epub 2020 Jul 2.
8
Incorporating genetic networks into case-control association studies with high-dimensional DNA methylation data.将遗传网络纳入具有高维 DNA 甲基化数据的病例对照关联研究中。
BMC Bioinformatics. 2019 Oct 22;20(1):510. doi: 10.1186/s12859-019-3040-x.
9
Radiomics and MGMT promoter methylation for prognostication of newly diagnosed glioblastoma.基于影像组学和 MGMT 启动子甲基化预测新诊断胶质母细胞瘤的预后。
Sci Rep. 2019 Oct 8;9(1):14435. doi: 10.1038/s41598-019-50849-y.
10
Aims, Study Design, and Enrollment Results From the Assessing Predictors of Infant Respiratory Syncytial Virus Effects and Severity Study.评估婴儿呼吸道合胞病毒影响及严重程度预测因素研究的目的、研究设计与入组结果
JMIR Res Protoc. 2019 Jun 6;8(6):e12907. doi: 10.2196/12907.

本文引用的文献

1
An integrated approach for the analysis of biological pathways using mixed models.一种使用混合模型分析生物途径的综合方法。
PLoS Genet. 2008 Jul;4(7):e1000115. doi: 10.1371/journal.pgen.1000115. Epub 2008 Jul 4.
2
SEGS: search for enriched gene sets in microarray data.SEGS:在微阵列数据中搜索富集的基因集。
J Biomed Inform. 2008 Aug;41(4):588-601. doi: 10.1016/j.jbi.2007.12.001. Epub 2007 Dec 15.
3
Sex specific gene regulation and expression QTLs in mouse macrophages from a strain intercross.品系杂交小鼠巨噬细胞中的性别特异性基因调控与表达数量性状基因座
PLoS One. 2008 Jan 16;3(1):e1435. doi: 10.1371/journal.pone.0001435.
4
GlobalANCOVA: exploration and assessment of gene group effects.全局协方差分析:基因组效应的探索与评估
Bioinformatics. 2008 Jan 1;24(1):78-85. doi: 10.1093/bioinformatics/btm531. Epub 2007 Nov 17.
5
K-Cl cotransport function and its potential contribution to cardiovascular disease.
Pathophysiology. 2007 Dec;14(3-4):135-46. doi: 10.1016/j.pathophys.2007.09.007. Epub 2007 Oct 18.
6
A multivariate extension of the gene set enrichment analysis.基因集富集分析的多元扩展。
J Bioinform Comput Biol. 2007 Oct;5(5):1139-53. doi: 10.1142/s0219720007003041.
7
A systems biology approach for pathway level analysis.一种用于通路水平分析的系统生物学方法。
Genome Res. 2007 Oct;17(10):1537-45. doi: 10.1101/gr.6202607. Epub 2007 Sep 4.
8
Computation of significance scores of unweighted Gene Set Enrichment Analyses.非加权基因集富集分析的显著性分数计算。
BMC Bioinformatics. 2007 Aug 6;8:290. doi: 10.1186/1471-2105-8-290.
9
Improving gene set analysis of microarray data by SAM-GS.通过SAM-GS改进微阵列数据的基因集分析
BMC Bioinformatics. 2007 Jul 5;8:242. doi: 10.1186/1471-2105-8-242.
10
GeneTrail--advanced gene set enrichment analysis.GeneTrail——高级基因集富集分析
Nucleic Acids Res. 2007 Jul;35(Web Server issue):W186-92. doi: 10.1093/nar/gkm323. Epub 2007 May 25.

用于具有连续或生存结局的微阵列数据基因集富集的监督主成分分析。

Supervised principal component analysis for gene set enrichment of microarray data with continuous or survival outcomes.

作者信息

Chen Xi, Wang Lily, Smith Jonathan D, Zhang Bing

机构信息

Department of Quantitative Health Sciences, The Cleveland Clinic, 9500 Euclid Ave. Cleveland, OH 44195, USA.

出版信息

Bioinformatics. 2008 Nov 1;24(21):2474-81. doi: 10.1093/bioinformatics/btn458. Epub 2008 Aug 27.

DOI:10.1093/bioinformatics/btn458
PMID:18753155
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2732277/
Abstract

MOTIVATION

Gene set analysis allows formal testing of subtle but coordinated changes in a group of genes, such as those defined by Gene Ontology (GO) or KEGG Pathway databases. We propose a new method for gene set analysis that is based on principal component analysis (PCA) of genes expression values in the gene set. PCA is an effective method for reducing high dimensionality and capture variations in gene expression values. However, one limitation with PCA is that the latent variable identified by the first PC may be unrelated to outcome.

RESULTS

In the proposed supervised PCA (SPCA) model for gene set analysis, the PCs are estimated from a selected subset of genes that are associated with outcome. As outcome information is used in the gene selection step, this method is supervised, thus called the Supervised PCA model. Because of the gene selection step, test statistic in SPCA model can no longer be approximated well using t-distribution. We propose a two-component mixture distribution based on Gumbel exteme value distributions to account for the gene selection step. We show the proposed method compares favorably to currently available gene set analysis methods using simulated and real microarray data.

SOFTWARE

The R code for the analysis used in this article are available upon request, we are currently working on implementing the proposed method in an R package.

摘要

动机

基因集分析允许对一组基因中细微但协调的变化进行形式化检验,例如由基因本体论(GO)或KEGG通路数据库定义的那些基因。我们提出了一种基于基因集中基因表达值主成分分析(PCA)的基因集分析新方法。PCA是一种降低高维性并捕捉基因表达值变化的有效方法。然而,PCA的一个局限性在于由第一主成分识别的潜在变量可能与结果无关。

结果

在所提出的用于基因集分析的监督主成分分析(SPCA)模型中,主成分是从与结果相关的选定基因子集中估计出来的。由于在基因选择步骤中使用了结果信息,该方法是有监督的,因此称为监督主成分分析模型。由于基因选择步骤,SPCA模型中的检验统计量不再能用t分布很好地近似。我们提出了一种基于耿贝尔极值分布的双组分混合分布来考虑基因选择步骤。我们表明,使用模拟和真实微阵列数据,所提出的方法优于目前可用的基因集分析方法。

软件

本文中使用的分析的R代码可根据要求提供,我们目前正在努力将所提出的方法在一个R包中实现。