• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于半学生 t 检验检测异质疾病中的差异表达基因。

Detecting differentially expressed genes in heterogeneous diseases using half Student's t-test.

机构信息

Research Centre for Genes, Environment and Human Health, College of Public Health, National Taiwan University, Taipei, Taiwan ROC.

出版信息

Int J Epidemiol. 2010 Dec;39(6):1597-604. doi: 10.1093/ije/dyq093. Epub 2010 Jun 2.

DOI:10.1093/ije/dyq093
PMID:20519335
Abstract

BACKGROUND

Microarray technology provides information about hundreds and thousands of gene-expression data in a single experiment. To search for disease-related genes, researchers test for those genes that are differentially expressed between the case subjects and the control subjects.

METHODS

The authors propose a new test, the 'half Student's t-test', specifically for detecting differentially expressed genes in heterogeneous diseases. Monte-Carlo simulation shows that the test maintains the nominal α level quite well for both normal and non-normal distributions. Power of the half Student's t is higher than that of the conventional 'pooled' Student's t when there is heterogeneity in the disease under study. The power gain by using the half Student's t can reach ∼10% when the standard deviation of the case group is 50% larger than that of the control group.

RESULTS

Application to a colon cancer data reveals that when the false discovery rate (FDR) is controlled at 0.05, the half Student's t can detect 344 differentially expressed genes, whereas the pooled Student's t can detect only 65 genes. Or alternatively, if only 50 genes are to be selected, the FDR for the pooled Student's t has to be set at 0.0320 (false positive rate of ∼3%), but for the half Student's t, it can be at as low as 0.0001 (false positive rate of about one per ten thousands).

CONCLUSIONS

The half Student's t-test is to be recommended for the detection of differentially expressed genes in heterogeneous diseases.

摘要

背景

微阵列技术在单次实验中提供了数百甚至数千个基因表达数据的信息。为了寻找与疾病相关的基因,研究人员测试那些在病例组和对照组之间表达差异的基因。

方法

作者提出了一种新的测试方法,即“半学生 t 检验”,专门用于检测异质性疾病中的差异表达基因。蒙特卡罗模拟表明,该检验在正态和非正态分布下都能很好地保持名义α水平。当研究中的疾病存在异质性时,半学生 t 的功效高于传统的“合并”学生 t。当病例组的标准差比对照组大 50%时,使用半学生 t 的功效增益可达约 10%。

结果

应用于结肠癌数据表明,当控制错误发现率(FDR)为 0.05 时,半学生 t 可以检测到 344 个差异表达基因,而合并学生 t 只能检测到 65 个基因。或者,如果只选择 50 个基因,合并学生 t 的 FDR 必须设置为 0.0320(假阳性率约为 3%),但对半学生 t 而言,它可以低至 0.0001(假阳性率约为万分之一)。

结论

建议在异质性疾病中使用半学生 t 检验来检测差异表达基因。

相似文献

1
Detecting differentially expressed genes in heterogeneous diseases using half Student's t-test.基于半学生 t 检验检测异质疾病中的差异表达基因。
Int J Epidemiol. 2010 Dec;39(6):1597-604. doi: 10.1093/ije/dyq093. Epub 2010 Jun 2.
2
Detecting differentially expressed genes in heterogeneous diseases using control-only analysis of variance.使用仅对照方差分析检测异质疾病中的差异表达基因。
Ann Epidemiol. 2012 Aug;22(8):598-602. doi: 10.1016/j.annepidem.2012.04.017. Epub 2012 May 31.
3
Improved statistical tests for differential gene expression by shrinking variance components estimates.通过收缩方差分量估计改进差异基因表达的统计检验。
Biostatistics. 2005 Jan;6(1):59-75. doi: 10.1093/biostatistics/kxh018.
4
Comparison of seven methods for producing Affymetrix expression scores based on False Discovery Rates in disease profiling data.基于疾病谱数据中错误发现率的七种生成Affymetrix表达分数方法的比较。
BMC Bioinformatics. 2005 Feb 10;6:26. doi: 10.1186/1471-2105-6-26.
5
On the identification of differentially expressed genes: improving the generalized F-statistics for Affymetrix microarray gene expression data.关于差异表达基因的鉴定:改进用于Affymetrix微阵列基因表达数据的广义F统计量
Comput Biol Chem. 2006 Oct;30(5):321-6. doi: 10.1016/j.compbiolchem.2006.06.002. Epub 2006 Sep 18.
6
Detecting differentially expressed genes by relative entropy.通过相对熵检测差异表达基因。
J Theor Biol. 2005 Jun 7;234(3):395-402. doi: 10.1016/j.jtbi.2004.11.039. Epub 2005 Jan 24.
7
A new efficient statistical test for detecting variability in the gene expression data.一种用于检测基因表达数据变异性的新型高效统计检验方法。
Stat Methods Med Res. 2008 Aug;17(4):405-19. doi: 10.1177/0962280206078643. Epub 2007 Aug 14.
8
Evaluation of a statistical equivalence test applied to microarray data.应用于微阵列数据的统计等效性检验的评估。
J Biopharm Stat. 2010 Mar;20(2):240-66. doi: 10.1080/10543400903572738.
9
A new statistical screening approach for finding pharmacokinetics-related genes in genome-wide studies.一种在全基因组研究中寻找与药代动力学相关基因的新型统计筛选方法。
Pharmacogenomics J. 2009 Apr;9(2):137-46. doi: 10.1038/tpj.2008.17. Epub 2008 Dec 23.
10
Detecting SNP-expression associations: a comparison of mutual information and median test with standard statistical approaches.检测 SNP 表达关联:互信息和中位数检验与标准统计方法的比较。
Stat Med. 2009 Dec 20;28(29):3581-96. doi: 10.1002/sim.3695.

引用本文的文献

1
mA demethylation of FOSL1 mRNA protects hepatoma cells against necrosis under glucose deprivation.mA 去甲基化 FOSL1 mRNA 可保护肝癌细胞免受葡萄糖剥夺引起的坏死。
Cell Death Differ. 2024 Aug;31(8):1029-1043. doi: 10.1038/s41418-024-01308-3. Epub 2024 May 18.
2
Identification of mitophagy-related biomarkers in human osteoporosis based on a machine learning model.基于机器学习模型的人类骨质疏松症中自噬相关生物标志物的鉴定
Front Physiol. 2024 Jan 8;14:1289976. doi: 10.3389/fphys.2023.1289976. eCollection 2023.
3
Modified Significance Analysis of Microarrays in Heterogeneous Diseases.
异质性疾病中微阵列的改良显著性分析
J Pers Med. 2021 Jan 20;11(2):62. doi: 10.3390/jpm11020062.
4
Expression and prognostic analyses of early growth response proteins (EGRs) in human breast carcinoma based on database analysis.基于数据库分析的人乳腺癌早期生长反应蛋白(EGRs)的表达及预后分析
PeerJ. 2019 Dec 9;7:e8183. doi: 10.7717/peerj.8183. eCollection 2019.
5
Expression patterns and the prognostic value of the SMYD family members in human breast carcinoma using integrative bioinformatics analysis.利用整合生物信息学分析SMYD家族成员在人乳腺癌中的表达模式及预后价值。
Oncol Lett. 2019 Apr;17(4):3851-3861. doi: 10.3892/ol.2019.10054. Epub 2019 Feb 19.
6
Analysis of mechanical ventilation and lipopolysaccharide‑induced acute lung injury using DNA microarray analysis.使用DNA微阵列分析对机械通气和脂多糖诱导的急性肺损伤进行分析。
Mol Med Rep. 2015 Jun;11(6):4239-45. doi: 10.3892/mmr.2015.3335. Epub 2015 Feb 11.
7
Predicting breast cancer using an expression values weighted clinical classifier.使用表达值加权临床分类器预测乳腺癌。
BMC Bioinformatics. 2014 Dec 31;15(1):411. doi: 10.1186/s12859-014-0411-1.
8
New bandwidth selection criterion for Kernel PCA: approach to dimensionality reduction and classification problems.核主成分分析的新带宽选择准则:降维和分类问题的方法。
BMC Bioinformatics. 2014 May 10;15:137. doi: 10.1186/1471-2105-15-137.