• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用微阵列数据对病理状态下失调基因的功能类别进行统计学评估。

Statistical assessment of functional categories of genes deregulated in pathological conditions by using microarray data.

作者信息

Maglietta R, Piepoli A, Catalano D, Licciulli F, Carella M, Liuni S, Pesole G, Perri F, Ancona N

机构信息

Istituto di Studi sui Sistemi Intelligenti per l'Automazione, CNR, Via Amendola 122/D-I, 70126 Bari, Italy.

出版信息

Bioinformatics. 2007 Aug 15;23(16):2063-72. doi: 10.1093/bioinformatics/btm289. Epub 2007 May 31.

DOI:10.1093/bioinformatics/btm289
PMID:17540679
Abstract

MOTIVATION

A major challenge in current biomedical research is the identification of cellular processes deregulated in a given pathology through the analysis of gene expression profiles. To this end, predefined lists of genes, coding specific functions, are compared with a list of genes ordered according to their values of differential expression measured by suitable univariate statistics.

RESULTS

We propose a statistically well-founded method for measuring the relevance of predefined lists of genes and for assessing their statistical significance starting from their raw expression levels as recorded on the microarray. We use prediction accuracy as a measure of relevance of the list. The rationale is that a functional category, coded through a list of genes, is perturbed in a given pathology if it is possible to correctly predict the occurrence of the disease in new subjects on the basis of the expression levels of the genes belonging to the list only. The accuracy is estimated with multiple random validation strategy and its statistical significance is assessed against a couple of null hypothesis, by using two independent permutation tests. The utility of the proposed methodology is illustrated by analyzing the relevance of Gene Ontology terms belonging to biological process category in colon and prostate cancer, by using three different microarray data sets and by comparing it with current approaches.

AVAILABILITY

Source code for the algorithms is available from author upon request.

SUPPLEMENTARY INFORMATION

Colon cancer data set and a complete description of experimental results are available at: ftp://bioftp:76bioftpxxx@marx.ba.issia.cnr.it/supp-info.htm.

摘要

动机

当前生物医学研究中的一个主要挑战是通过基因表达谱分析来识别在特定病理状态下失调的细胞过程。为此,将编码特定功能的预定义基因列表与根据通过合适的单变量统计测量的差异表达值排序的基因列表进行比较。

结果

我们提出了一种基于统计学的方法,用于从微阵列记录的原始表达水平开始测量预定义基因列表的相关性并评估其统计显著性。我们使用预测准确性作为列表相关性的度量。基本原理是,如果仅基于属于该列表的基因的表达水平就能够正确预测新受试者中疾病的发生,那么通过基因列表编码的功能类别在给定病理状态下就会受到干扰。通过多重随机验证策略估计准确性,并通过使用两个独立的置换检验针对几个零假设评估其统计显著性。通过使用三个不同的微阵列数据集并将其与当前方法进行比较,分析属于生物学过程类别的基因本体术语在结肠癌和前列腺癌中的相关性,说明了所提出方法的实用性。

可用性

可根据作者要求提供算法的源代码。

补充信息

结肠癌数据集和实验结果的完整描述可在以下网址获取:ftp://bioftp:76bioftpxxx@marx.ba.issia.cnr.it/supp-info.htm。

相似文献

1
Statistical assessment of functional categories of genes deregulated in pathological conditions by using microarray data.利用微阵列数据对病理状态下失调基因的功能类别进行统计学评估。
Bioinformatics. 2007 Aug 15;23(16):2063-72. doi: 10.1093/bioinformatics/btm289. Epub 2007 May 31.
2
Selection of relevant genes in cancer diagnosis based on their prediction accuracy.基于相关基因的预测准确性进行癌症诊断中的基因选择。
Artif Intell Med. 2007 May;40(1):29-44. doi: 10.1016/j.artmed.2006.06.002. Epub 2006 Aug 22.
3
Empirical Bayes screening of many p-values with applications to microarray studies.用于微阵列研究的多p值经验贝叶斯筛选。
Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2.
4
Pathway recognition and augmentation by computational analysis of microarray expression data.通过微阵列表达数据的计算分析进行通路识别与增强
Bioinformatics. 2006 Jan 15;22(2):233-41. doi: 10.1093/bioinformatics/bti764. Epub 2005 Nov 8.
5
Clustering threshold gradient descent regularization: with applications to microarray studies.聚类阈值梯度下降正则化:及其在微阵列研究中的应用
Bioinformatics. 2007 Feb 15;23(4):466-72. doi: 10.1093/bioinformatics/btl632. Epub 2006 Dec 20.
6
Gaining confidence in biological interpretation of the microarray data: the functional consistence of the significant GO categories.增强对微阵列数据生物学解读的信心:显著GO类别的功能一致性。
Bioinformatics. 2008 Jan 15;24(2):265-71. doi: 10.1093/bioinformatics/btm558. Epub 2007 Nov 15.
7
Algebraic stability indicators for ranked lists in molecular profiling.分子谱分析中排序列表的代数稳定性指标
Bioinformatics. 2008 Jan 15;24(2):258-64. doi: 10.1093/bioinformatics/btm550. Epub 2007 Nov 16.
8
Selecting differentially expressed genes using minimum probability of classification error.使用最小分类错误概率选择差异表达基因。
J Biomed Inform. 2007 Dec;40(6):775-86. doi: 10.1016/j.jbi.2007.07.006. Epub 2007 Aug 29.
9
Constructing the gene regulation-level representation of microarray data for cancer classification.构建用于癌症分类的微阵列数据的基因调控水平表示。
J Biomed Inform. 2008 Feb;41(1):95-105. doi: 10.1016/j.jbi.2007.04.002. Epub 2007 Apr 11.
10
Integration of GO annotations in Correspondence Analysis: facilitating the interpretation of microarray data.在对应分析中整合基因本体论注释:助力微阵列数据的解读
Bioinformatics. 2005 May 15;21(10):2424-9. doi: 10.1093/bioinformatics/bti367. Epub 2005 Mar 3.

引用本文的文献

1
Pathway expression analysis.通路表达分析。
Sci Rep. 2022 Dec 17;12(1):21839. doi: 10.1038/s41598-022-26381-x.
2
Integrating biological knowledge and gene expression data using pathway-guided random forests: a benchmarking study.使用通路引导的随机森林整合生物学知识和基因表达数据:一项基准研究
Bioinformatics. 2020 Aug 1;36(15):4301-4308. doi: 10.1093/bioinformatics/btaa483.
3
Contextualizing the Genes Altered in Bladder Neoplasms in Pediatric andTeen Patients Allows Identifying Two Main Classes of Biological ProcessesInvolved and New Potential Therapeutic Targets.
将小儿和青少年膀胱肿瘤中改变的基因置于背景中考虑,有助于识别所涉及的两类主要生物学过程以及新的潜在治疗靶点。
Curr Genomics. 2016 Feb;17(1):33-61. doi: 10.2174/1389202916666151014222603.
4
Advanced colorectal adenoma related gene expression signature may predict prognostic for colorectal cancer patients with adenoma-carcinoma sequence.高级别结直肠腺瘤相关基因表达特征可能预测具有腺瘤-癌序列的结直肠癌患者的预后。
Int J Clin Exp Med. 2015 Apr 15;8(4):4883-98. eCollection 2015.
5
Genome-wide Pathway Analysis Using Gene Expression Data of Colonic Mucosa in Patients with Inflammatory Bowel Disease.利用炎症性肠病患者结肠黏膜基因表达数据进行全基因组通路分析
Inflamm Bowel Dis. 2015 Jun;21(6):1260-8. doi: 10.1097/MIB.0000000000000370.
6
GSAASeqSP: a toolset for gene set association analysis of RNA-Seq data.GSAASeqSP:一种用于RNA测序数据基因集关联分析的工具集。
Sci Rep. 2014 Sep 12;4:6347. doi: 10.1038/srep06347.
7
Knowledge Driven Variable Selection (KDVS) - a new approach to enrichment analysis of gene signatures obtained from high-throughput data.知识驱动的变量选择(KDVS)——一种对从高通量数据中获得的基因特征进行富集分析的新方法。
Source Code Biol Med. 2013 Jan 9;8(1):2. doi: 10.1186/1751-0473-8-2.
8
Molecular pathways undergoing dramatic transcriptomic changes during tumor development in the human colon.人类结肠肿瘤发生过程中经历显著转录组变化的分子途径。
BMC Cancer. 2012 Dec 19;12:608. doi: 10.1186/1471-2407-12-608.
9
A predictive framework for integrating disparate genomic data types using sample-specific gene set enrichment analysis and multi-task learning.使用样本特异性基因集富集分析和多任务学习整合不同基因组数据类型的预测框架。
PLoS One. 2012;7(9):e44635. doi: 10.1371/journal.pone.0044635. Epub 2012 Sep 13.
10
Integrating genetic and gene expression evidence into genome-wide association analysis of gene sets.将遗传和基因表达证据整合到基因集的全基因组关联分析中。
Genome Res. 2012 Feb;22(2):386-97. doi: 10.1101/gr.124370.111. Epub 2011 Sep 22.