• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于识别微阵列数据中差异表达基因的非参数方法。

Nonparametric methods for identifying differentially expressed genes in microarray data.

作者信息

Troyanskaya Olga G, Garber Mitchell E, Brown Patrick O, Botstein David, Altman Russ B

机构信息

Department of Genetics, Stanford University School of Medicine, Stanford, CA, USA.

出版信息

Bioinformatics. 2002 Nov;18(11):1454-61. doi: 10.1093/bioinformatics/18.11.1454.

DOI:10.1093/bioinformatics/18.11.1454
PMID:12424116
Abstract

MOTIVATION

Gene expression experiments provide a fast and systematic way to identify disease markers relevant to clinical care. In this study, we address the problem of robust identification of differentially expressed genes from microarray data. Differentially expressed genes, or discriminator genes, are genes with significantly different expression in two user-defined groups of microarray experiments. We compare three model-free approaches: (1). nonparametric t-test, (2). Wilcoxon (or Mann-Whitney) rank sum test, and (3). a heuristic method based on high Pearson correlation to a perfectly differentiating gene ('ideal discriminator method'). We systematically assess the performance of each method based on simulated and biological data under varying noise levels and p-value cutoffs.

RESULTS

All methods exhibit very low false positive rates and identify a large fraction of the differentially expressed genes in simulated data sets with noise level similar to that of actual data. Overall, the rank sum test appears most conservative, which may be advantageous when the computationally identified genes need to be tested biologically. However, if a more inclusive list of markers is desired, a higher p-value cutoff or the nonparametric t-test may be appropriate. When applied to data from lung tumor and lymphoma data sets, the methods identify biologically relevant differentially expressed genes that allow clear separation of groups in question. Thus the methods described and evaluated here provide a convenient and robust way to identify differentially expressed genes for further biological and clinical analysis.

摘要

动机

基因表达实验提供了一种快速且系统的方法来识别与临床护理相关的疾病标志物。在本研究中,我们解决了从微阵列数据中稳健识别差异表达基因的问题。差异表达基因,即鉴别基因,是在两组用户定义的微阵列实验中表达有显著差异的基因。我们比较了三种无模型方法:(1)非参数t检验,(2)威尔科克森(或曼 - 惠特尼)秩和检验,以及(3)一种基于与完美区分基因的高皮尔逊相关性的启发式方法(“理想鉴别方法”)。我们基于模拟数据和生物数据,在不同噪声水平和p值截止值下系统地评估了每种方法的性能。

结果

在噪声水平与实际数据相似的模拟数据集中,所有方法均表现出非常低的假阳性率,并识别出了大部分差异表达基因。总体而言,秩和检验似乎最为保守,当需要对通过计算识别出的基因进行生物学测试时,这可能具有优势。然而,如果需要更具包容性的标志物列表,则较高的p值截止值或非参数t检验可能更为合适。当应用于肺癌和淋巴瘤数据集的数据时,这些方法识别出了具有生物学相关性的差异表达基因,这些基因能够清晰地区分相关组。因此,本文所述及评估的方法为识别差异表达基因以进行进一步的生物学和临床分析提供了一种便捷且稳健的方式。

相似文献

1
Nonparametric methods for identifying differentially expressed genes in microarray data.用于识别微阵列数据中差异表达基因的非参数方法。
Bioinformatics. 2002 Nov;18(11):1454-61. doi: 10.1093/bioinformatics/18.11.1454.
2
Modified nonparametric approaches to detecting differentially expressed genes in replicated microarray experiments.在重复微阵列实验中检测差异表达基因的改良非参数方法。
Bioinformatics. 2003 Jun 12;19(9):1046-54. doi: 10.1093/bioinformatics/btf879.
3
A generalized likelihood ratio test to identify differentially expressed genes from microarray data.一种用于从微阵列数据中识别差异表达基因的广义似然比检验。
Bioinformatics. 2004 Jan 1;20(1):100-4. doi: 10.1093/bioinformatics/btg384.
4
Noise sampling method: an ANOVA approach allowing robust selection of differentially regulated genes measured by DNA microarrays.噪声采样方法:一种方差分析方法,可用于通过DNA微阵列测量的差异调节基因的稳健选择。
Bioinformatics. 2003 Jul 22;19(11):1348-59. doi: 10.1093/bioinformatics/btg165.
5
An associative analysis of gene expression array data.基因表达阵列数据的关联分析。
Bioinformatics. 2003 Jan 22;19(2):204-11. doi: 10.1093/bioinformatics/19.2.204.
6
Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.基于疾病谱数据中错误发现率的七种生成Affymetrix表达分数方法的比较。
BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.
7
Identifying periodically expressed transcripts in microarray time series data.在微阵列时间序列数据中识别周期性表达的转录本。
Bioinformatics. 2004 Jan 1;20(1):5-20. doi: 10.1093/bioinformatics/btg364.
8
A nonparametric likelihood ratio test to identify differentially expressed genes from microarray data.一种用于从微阵列数据中识别差异表达基因的非参数似然比检验。
Appl Bioinformatics. 2006;5(4):267-76. doi: 10.2165/00822942-200605040-00009.
9
Identifying differentially expressed genes from microarray experiments via statistic synthesis.通过统计合成从微阵列实验中识别差异表达基因。
Bioinformatics. 2005 Apr 1;21(7):1084-93. doi: 10.1093/bioinformatics/bti108. Epub 2004 Oct 28.
10
The balance of reproducibility, sensitivity, and specificity of lists of differentially expressed genes in microarray studies.微阵列研究中差异表达基因列表的可重复性、敏感性和特异性之间的平衡。
BMC Bioinformatics. 2008 Aug 12;9 Suppl 9(Suppl 9):S10. doi: 10.1186/1471-2105-9-S9-S10.

引用本文的文献

1
lncRNA Profiling of Exosomes and Its Communication Role in Regulating Silica-Stimulated Macrophage Apoptosis and Fibroblast Activation.外泌体中的长链非编码 RNA 谱及其在调控二氧化硅刺激的巨噬细胞凋亡和成纤维细胞活化中的通讯作用。
Biomolecules. 2024 Jan 24;14(2):146. doi: 10.3390/biom14020146.
2
Amphiregulin couples IL1RL1 regulatory T cells and cancer-associated fibroblasts to impede antitumor immunity. Amphiregulin 连接 IL1RL1 调节性 T 细胞和癌相关成纤维细胞,以阻碍抗肿瘤免疫。
Sci Adv. 2023 Aug 25;9(34):eadd7399. doi: 10.1126/sciadv.add7399. Epub 2023 Aug 23.
3
Tests and classification methods in adaptive designs with applications.
具有应用的自适应设计中的检验与分类方法。
J Appl Stat. 2022 Jan 21;50(6):1334-1357. doi: 10.1080/02664763.2022.2026898. eCollection 2023.
4
AIscEA: unsupervised integration of single-cell gene expression and chromatin accessibility via their biological consistency.AIscEA:通过单细胞基因表达和染色质可及性的生物学一致性进行无监督整合。
Bioinformatics. 2022 Nov 30;38(23):5236-5244. doi: 10.1093/bioinformatics/btac683.
5
New Insights into the Regulatory Role of Ferroptosis in Ankylosing Spondylitis via Consensus Clustering of Ferroptosis-Related Genes and Weighted Gene Co-Expression Network Analysis.通过铁死亡相关基因共识聚类和加权基因共表达网络分析探讨铁死亡在强直性脊柱炎中的调控作用的新见解。
Genes (Basel). 2022 Jul 31;13(8):1373. doi: 10.3390/genes13081373.
6
An Integrated Bioinformatics Analysis towards the Identification of Diagnostic, Prognostic, and Predictive Key Biomarkers for Urinary Bladder Cancer.用于鉴定膀胱癌诊断、预后和预测关键生物标志物的综合生物信息学分析
Cancers (Basel). 2022 Jul 10;14(14):3358. doi: 10.3390/cancers14143358.
7
Selective mediation of ovarian cancer SKOV3 cells death by pristine carbon quantum dots/CuO composite through targeting matrix metalloproteinases, angiogenic cytokines and cytoskeleton.通过靶向基质金属蛋白酶、血管生成细胞因子和细胞骨架,对原始碳量子点/CuO 复合材料对卵巢癌细胞 SKOV3 死亡的选择性介导。
J Nanobiotechnology. 2021 Mar 4;19(1):68. doi: 10.1186/s12951-021-00813-8.
8
Modified Significance Analysis of Microarrays in Heterogeneous Diseases.异质性疾病中微阵列的改良显著性分析
J Pers Med. 2021 Jan 20;11(2):62. doi: 10.3390/jpm11020062.
9
Multi-view feature selection for identifying gene markers: a diversified biological data driven approach.多视角特征选择用于鉴定基因标志物:一种多样化的生物数据驱动方法。
BMC Bioinformatics. 2020 Dec 30;21(Suppl 18):483. doi: 10.1186/s12859-020-03810-0.
10
Statistical Approach for Biologically Relevant Gene Selection from High-Throughput Gene Expression Data.从高通量基因表达数据中选择生物学相关基因的统计方法
Entropy (Basel). 2020 Oct 25;22(11):1205. doi: 10.3390/e22111205.