• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在肺腺癌中寻找差异表达基因的竞争性检测方法的效率分析

Efficiency analysis of competing tests for finding differentially expressed genes in lung adenocarcinoma.

作者信息

Jordan Rick, Patel Satish, Hu Hai, Lyons-Weiler James

机构信息

Department of Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA 15260, USA.

出版信息

Cancer Inform. 2008;6:389-421. doi: 10.4137/cin.s791. Epub 2008 Jul 14.

DOI:10.4137/cin.s791
PMID:19259419
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2623303/
Abstract

In this study, we introduce and use Efficiency Analysis to compare differences in the apparent internal and external consistency of competing normalization methods and tests for identifying differentially expressed genes. Using publicly available data, two lung adenocarcinoma datasets were analyzed using caGEDA (http://bioinformatics2.pitt.edu/GE2/GEDA.html) to measure the degree of differential expression of genes existing between two populations. The datasets were randomly split into at least two subsets, each analyzed for differentially expressed genes between the two sample groups, and the gene lists compared for overlapping genes. Efficiency Analysis is an intuitive method that compares the differences in the percentage of overlap of genes from two or more data subsets, found by the same test over a range of testing methods. Tests that yield consistent gene lists across independently analyzed splits are preferred to those that yield less consistent inferences. For example, a method that exhibits 50% overlap in the 100 top genes from two studies should be preferred to a method that exhibits 5% overlap in the top 100 genes. The same procedure was performed using all available normalization and transformation methods that are available through caGEDA. The 'best' test was then further evaluated using internal cross-validation to estimate generalizable sample classification errors using a Naïve Bayes classification algorithm. A novel test, termed D1 (a derivative of the J5 test) was found to be the most consistent, and to exhibit the lowest overall classification error, and highest sensitivity and specificity. The D1 test relaxes the assumption that few genes are differentially expressed. Efficiency Analysis can be misleading if the tests exhibit a bias in any particular dimension (e.g. expression intensity); we therefore explored intensity-scaled and segmented J5 tests using data in which all genes are scaled to share the same intensity distribution range. Efficiency Analysis correctly predicted the 'best' test and normalization method using the Beer dataset and also performed well with the Bhattacharjee dataset based on both efficiency and classification accuracy criteria.

摘要

在本研究中,我们引入并使用效率分析来比较竞争性标准化方法和用于鉴定差异表达基因的检验在表观内部和外部一致性方面的差异。利用公开可用的数据,使用caGEDA(http://bioinformatics2.pitt.edu/GE2/GEDA.html)对两个肺腺癌数据集进行分析,以测量两个群体之间存在的基因差异表达程度。将数据集随机分成至少两个子集,每个子集分析两个样本组之间的差异表达基因,并比较基因列表中的重叠基因。效率分析是一种直观的方法,可比较通过一系列测试方法由相同检验在两个或更多数据子集中发现的基因重叠百分比差异。在独立分析的拆分中产生一致基因列表的检验优于那些产生较不一致推断的检验。例如,在两项研究的前100个基因中显示出50%重叠的方法应优于在前100个基因中显示出5%重叠的方法。使用caGEDA中可用的所有标准化和转换方法执行相同的程序。然后使用朴素贝叶斯分类算法通过内部交叉验证进一步评估“最佳”检验,以估计可推广的样本分类错误。发现一种名为D1(J5检验的衍生物)的新型检验最一致,总体分类错误最低,灵敏度和特异性最高。D1检验放宽了少数基因差异表达的假设。如果检验在任何特定维度(例如表达强度)上表现出偏差,效率分析可能会产生误导;因此,我们使用所有基因都按比例缩放以共享相同强度分布范围的数据探索了强度缩放和分段的J5检验。基于效率和分类准确性标准,效率分析使用Beer数据集正确预测了“最佳”检验和标准化方法,并且在Bhattacharjee数据集上也表现良好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/fdf4934f6f34/cin-6-0389f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/21d2e30a5e32/cin-6-0389f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/9b5c9d6a4192/cin-6-0389f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/a6eaedccffda/cin-6-0389f14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/829816e2376e/cin-6-0389f15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/79d878fe698c/cin-6-0389f16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/c2ad539d84d6/cin-6-0389f17.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/bbca6a430d17/cin-6-0389f18.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/858c8b7047da/cin-6-0389f19.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/b2c542c9bdc3/cin-6-0389f20.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/03f1ba8f9564/cin-6-0389f21.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/a9ce0b67cf8c/cin-6-0389f22.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/fa9f638feb93/cin-6-0389f23.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/3bd9799295a7/cin-6-0389f24.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/b12ca55a0ddf/cin-6-0389f25.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/488ad1300317/cin-6-0389f26.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/61f40a04f201/cin-6-0389f27.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/4ce9fd454ab5/cin-6-0389f28.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/67a81da8dbd4/cin-6-0389f29.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/abb29ca6bc93/cin-6-0389f30.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/3e0edf5dd37d/cin-6-0389f31.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/12960f64cc62/cin-6-0389f32.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/6b55336eedd5/cin-6-0389f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/b963b9565b6f/cin-6-0389f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/ba2bcc16776e/cin-6-0389f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/49801e618613/cin-6-0389f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/1c637a96c27e/cin-6-0389f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/52928e9b0205/cin-6-0389f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/46a86f7d7a14/cin-6-0389f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/bfbeeac3bd55/cin-6-0389f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/2a07bf77a2f6/cin-6-0389f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/e3e8486f6297/cin-6-0389f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/fdf4934f6f34/cin-6-0389f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/21d2e30a5e32/cin-6-0389f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/9b5c9d6a4192/cin-6-0389f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/a6eaedccffda/cin-6-0389f14.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/829816e2376e/cin-6-0389f15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/79d878fe698c/cin-6-0389f16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/c2ad539d84d6/cin-6-0389f17.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/bbca6a430d17/cin-6-0389f18.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/858c8b7047da/cin-6-0389f19.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/b2c542c9bdc3/cin-6-0389f20.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/03f1ba8f9564/cin-6-0389f21.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/a9ce0b67cf8c/cin-6-0389f22.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/fa9f638feb93/cin-6-0389f23.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/3bd9799295a7/cin-6-0389f24.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/b12ca55a0ddf/cin-6-0389f25.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/488ad1300317/cin-6-0389f26.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/61f40a04f201/cin-6-0389f27.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/4ce9fd454ab5/cin-6-0389f28.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/67a81da8dbd4/cin-6-0389f29.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/abb29ca6bc93/cin-6-0389f30.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/3e0edf5dd37d/cin-6-0389f31.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/12960f64cc62/cin-6-0389f32.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/6b55336eedd5/cin-6-0389f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/b963b9565b6f/cin-6-0389f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/ba2bcc16776e/cin-6-0389f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/49801e618613/cin-6-0389f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/1c637a96c27e/cin-6-0389f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/52928e9b0205/cin-6-0389f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/46a86f7d7a14/cin-6-0389f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/bfbeeac3bd55/cin-6-0389f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/2a07bf77a2f6/cin-6-0389f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/e3e8486f6297/cin-6-0389f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/65da/2623303/fdf4934f6f34/cin-6-0389f11.jpg

相似文献

1
Efficiency analysis of competing tests for finding differentially expressed genes in lung adenocarcinoma.在肺腺癌中寻找差异表达基因的竞争性检测方法的效率分析
Cancer Inform. 2008;6:389-421. doi: 10.4137/cin.s791. Epub 2008 Jul 14.
2
caGEDA: a web application for the integrated analysis of global gene expression patterns in cancer.caGEDA:一个用于癌症中全球基因表达模式综合分析的网络应用程序。
Appl Bioinformatics. 2004;3(1):49-62. doi: 10.2165/00822942-200403010-00007.
3
Identification of suitable genes contributes to lung adenocarcinoma clustering by multiple meta-analysis methods.通过多种荟萃分析方法鉴定合适的基因有助于肺腺癌聚类。
Clin Respir J. 2016 Sep;10(5):631-46. doi: 10.1111/crj.12271. Epub 2015 Mar 3.
4
Retrospective analysis: reproducibility of interblastomere differences of mRNA expression in 2-cell stage mouse embryos is remarkably poor due to combinatorial mechanisms of blastomere diversification.回顾性分析:由于囊胚细胞多样化的组合机制,2 细胞期小鼠胚胎中 mRNA 表达的卵裂球间差异的可重复性极差。
Mol Hum Reprod. 2018 Jul 1;24(7):388-400. doi: 10.1093/molehr/gay021.
5
Empirical Bayes screening of many p-values with applications to microarray studies.用于微阵列研究的多p值经验贝叶斯筛选。
Bioinformatics. 2005 May 1;21(9):1987-94. doi: 10.1093/bioinformatics/bti301. Epub 2005 Feb 2.
6
A weighted average difference method for detecting differentially expressed genes from microarray data.一种用于从微阵列数据中检测差异表达基因的加权平均差异法。
Algorithms Mol Biol. 2008 Jun 26;3:8. doi: 10.1186/1748-7188-3-8.
7
MetTailor: dynamic block summary and intensity normalization for robust analysis of mass spectrometry data in metabolomics.MetTailor:用于代谢组学中质谱数据分析的动态块摘要和强度归一化,以实现稳健分析。
Bioinformatics. 2015 Nov 15;31(22):3645-52. doi: 10.1093/bioinformatics/btv434. Epub 2015 Jul 27.
8
Finding differentially expressed genes in two-channel DNA microarray datasets: how to increase reliability of data preprocessing.在双通道DNA微阵列数据集中寻找差异表达基因:如何提高数据预处理的可靠性。
OMICS. 2008 Sep;12(3):171-82. doi: 10.1089/omi.2008.0032.
9
An empirical bayes adjustment to increase the sensitivity of detecting differentially expressed genes in microarray experiments.一种经验贝叶斯调整方法,用于提高微阵列实验中检测差异表达基因的灵敏度。
Bioinformatics. 2004 Jan 22;20(2):235-42. doi: 10.1093/bioinformatics/btg396.
10
A comparative study of different machine learning methods on microarray gene expression data.不同机器学习方法对微阵列基因表达数据的比较研究。
BMC Genomics. 2008;9 Suppl 1(Suppl 1):S13. doi: 10.1186/1471-2164-9-S1-S13.

引用本文的文献

1
Analysis of the Manganese and MntR Regulon in Corynebacterium diphtheriae.分析白喉棒状杆菌中的锰和 MntR 调控子。
J Bacteriol. 2021 Sep 23;203(20):e0027421. doi: 10.1128/JB.00274-21. Epub 2021 Aug 9.
2
Prediction of severity and subtype of fibrosing disease using model informed by inflammation and extracellular matrix gene index.基于炎症和细胞外基质基因指标模型预测纤维化疾病的严重程度和亚型。
PLoS One. 2020 Oct 23;15(10):e0240986. doi: 10.1371/journal.pone.0240986. eCollection 2020.
3
Transcriptome-Based Modeling Reveals that Oxidative Stress Induces Modulation of the AtfA-Dependent Signaling Networks in .

本文引用的文献

1
Assessing the statistical significance of the achieved classification error of classifiers constructed using serum peptide profiles, and a prescription for random sampling repeated studies for massive high-throughput genomic and proteomic studies.评估使用血清肽谱构建的分类器所实现的分类误差的统计显著性,以及针对大规模高通量基因组和蛋白质组研究进行随机抽样重复研究的方案。
Cancer Inform. 2005;1(1):53-77.
2
Non-overlapping and non-cell-type-specific gene expression signatures predict lung cancer survival.非重叠且非细胞类型特异性的基因表达特征可预测肺癌生存率。
J Clin Oncol. 2008 Feb 20;26(6):877-83. doi: 10.1200/JCO.2007.13.1516.
3
基于转录组的建模揭示氧化应激诱导了. 中AtfA依赖性信号网络的调控
Int J Genomics. 2017;2017:6923849. doi: 10.1155/2017/6923849. Epub 2017 Jul 9.
4
HSPs drive dichotomous T-cell immune responses via DNA methylome remodelling in antigen presenting cells.热休克蛋白通过抗原呈递细胞中的 DNA 甲基组重塑来驱动二态 T 细胞免疫应答。
Nat Commun. 2017 May 31;8:15648. doi: 10.1038/ncomms15648.
5
Epigenetic and Transcriptomic Regulation of Lung Repair during Recovery from Influenza Infection.流感感染恢复过程中肺修复的表观遗传和转录组调控
Am J Pathol. 2017 Apr;187(4):851-863. doi: 10.1016/j.ajpath.2016.12.012. Epub 2017 Feb 10.
6
DNA methylation in the pathophysiology of hyperphenylalaninemia in the PAH(enu2) mouse model of phenylketonuria.苯丙酮尿症PAH(enu2)小鼠模型中高苯丙氨酸血症病理生理学中的DNA甲基化
Mol Genet Metab. 2016 Sep;119(1-2):1-7. doi: 10.1016/j.ymgme.2016.01.001. Epub 2016 Jan 14.
7
Expression of three topologically distinct membrane proteins elicits unique stress response pathways in the yeast Saccharomyces cerevisiae.三种拓扑结构不同的膜蛋白的表达在酿酒酵母中引发独特的应激反应途径。
Physiol Genomics. 2015 Jun;47(6):198-214. doi: 10.1152/physiolgenomics.00101.2014. Epub 2015 Mar 10.
8
Differential hippocampal gene expression and pathway analysis in an etiology-based mouse model of major depressive disorder.基于病因的重度抑郁症小鼠模型中海马基因表达差异及通路分析
Am J Med Genet B Neuropsychiatr Genet. 2014 Sep;165B(6):457-66. doi: 10.1002/ajmg.b.32257. Epub 2014 Jul 25.
9
Beyond the genome: epigenetic mechanisms in lung remodeling.超越基因组:肺重塑中的表观遗传机制。
Physiology (Bethesda). 2014 May;29(3):177-85. doi: 10.1152/physiol.00048.2013.
10
Environmental enrichment as a viable neurorehabilitation strategy for experimental traumatic brain injury.环境丰富作为实验性创伤性脑损伤的可行神经康复策略。
J Neurotrauma. 2014 May 15;31(10):873-88. doi: 10.1089/neu.2014.3328. Epub 2014 Apr 17.
Three-gene prognostic classifier for early-stage non small-cell lung cancer.
早期非小细胞肺癌的三基因预后分类器
J Clin Oncol. 2007 Dec 10;25(35):5562-9. doi: 10.1200/JCO.2007.12.0352.
4
Survival prediction of stage I lung adenocarcinomas by expression of 10 genes.通过10种基因的表达对Ⅰ期肺腺癌进行生存预测
J Clin Invest. 2007 Nov;117(11):3436-44. doi: 10.1172/JCI32007.
5
A five-gene signature and clinical outcome in non-small-cell lung cancer.非小细胞肺癌中的五基因特征与临床结局
N Engl J Med. 2007 Jan 4;356(1):11-20. doi: 10.1056/NEJMoa060096.
6
A gene expression signature predicts survival of patients with stage I non-small cell lung cancer.一种基因表达特征可预测I期非小细胞肺癌患者的生存率。
PLoS Med. 2006 Dec;3(12):e467. doi: 10.1371/journal.pmed.0030467.
7
Rat toxicogenomic study reveals analytical consistency across microarray platforms.大鼠毒理基因组学研究揭示了不同微阵列平台间的分析一致性。
Nat Biotechnol. 2006 Sep;24(9):1162-9. doi: 10.1038/nbt1238.
8
caGEDA: a web application for the integrated analysis of global gene expression patterns in cancer.caGEDA:一个用于癌症中全球基因表达模式综合分析的网络应用程序。
Appl Bioinformatics. 2004;3(1):49-62. doi: 10.2165/00822942-200403010-00007.
9
Prediction of lymph node metastasis by analysis of gene expression profiles in primary lung adenocarcinomas.通过分析原发性肺腺癌的基因表达谱预测淋巴结转移
Clin Cancer Res. 2005 Jun 1;11(11):4128-35. doi: 10.1158/1078-0432.CCR-04-2525.
10
Preferred analysis methods for Affymetrix GeneChips revealed by a wholly defined control dataset.由完全定义的对照数据集揭示的Affymetrix基因芯片的首选分析方法。
Genome Biol. 2005;6(2):R16. doi: 10.1186/gb-2005-6-2-r16. Epub 2005 Jan 28.