Duan Fenghai, Xu Ye
Department of Biostatistics and Center for Statistical Sciences, School of Public Health, Brown University, Providence, RI, USA.
StubHub, San Francisco, CA, USA.
Cancer Inform. 2017 May 4;16:1176935117705381. doi: 10.1177/1176935117705381. eCollection 2017.
To analyze a microarray experiment to identify the genes with expressions varying after the diagnosis of breast cancer.
A total of 44 928 probe sets in an Affymetrix microarray data publicly available on Gene Expression Omnibus from 249 patients with breast cancer were analyzed by the nonparametric multivariate adaptive splines. Then, the identified genes with turning points were grouped by K-means clustering, and their network relationship was subsequently analyzed by the Ingenuity Pathway Analysis.
In total, 1640 probe sets (genes) were reliably identified to have turning points along with the age at diagnosis in their expression profiling, of which 927 expressed lower after turning points and 713 expressed higher after the turning points. K-means clustered them into 3 groups with turning points centering at 54, 62.5, and 72, respectively. The pathway analysis showed that the identified genes were actively involved in various cancer-related functions or networks.
In this article, we applied the nonparametric multivariate adaptive splines method to a publicly available gene expression data and successfully identified genes with expressions varying before and after breast cancer diagnosis.
分析一项微阵列实验,以鉴定乳腺癌诊断后表达发生变化的基因。
通过非参数多变量自适应样条法,对基因表达综合数据库(Gene Expression Omnibus)上公开的249例乳腺癌患者的Affymetrix微阵列数据中的总共44928个探针集进行分析。然后,对识别出的有转折点的基因进行K均值聚类分组,随后通过Ingenuity通路分析对它们的网络关系进行分析。
总共可靠地鉴定出1640个探针集(基因)在其表达谱中随诊断年龄有转折点,其中927个在转折点后表达降低,713个在转折点后表达升高。K均值聚类将它们分为3组,转折点分别集中在54、62.5和72。通路分析表明,识别出的基因积极参与各种癌症相关功能或网络。
在本文中,我们将非参数多变量自适应样条法应用于公开可用的基因表达数据,并成功鉴定出乳腺癌诊断前后表达发生变化的基因。