• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在存在异方差的情况下,用于检验基因表达分析中交互项的广义收缩 F 似然统计量。

Generalized shrinkage F-like statistics for testing an interaction term in gene expression analysis in the presence of heteroscedasticity.

机构信息

Department of Preventive Medicine, Stony Brook University, Stony Brook, NY 11794, USA.

出版信息

BMC Bioinformatics. 2011 Nov 1;12:427. doi: 10.1186/1471-2105-12-427.

DOI:10.1186/1471-2105-12-427
PMID:22044602
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3221690/
Abstract

BACKGROUND

Many analyses of gene expression data involve hypothesis tests of an interaction term between two fixed effects, typically tested using a residual variance. In expression studies, the issue of variance heteroscedasticity has received much attention, and previous work has focused on either between-gene or within-gene heteroscedasticity. However, in a single experiment, heteroscedasticity may exist both within and between genes. Here we develop flexible shrinkage error estimators considering both between-gene and within-gene heteroscedasticity and use them to construct F-like test statistics for testing interactions, with cutoff values obtained by permutation. These permutation tests are complicated, and several permutation tests are investigated here.

RESULTS

Our proposed test statistics are compared with other existing shrinkage-type test statistics through extensive simulation studies and a real data example. The results show that the choice of permutation procedures has dramatically more influence on detection power than the choice of F or F-like test statistics. When both types of gene heteroscedasticity exist, our proposed test statistics can control preselected type-I errors and are more powerful. Raw data permutation is not valid in this setting. Whether unrestricted or restricted residual permutation should be used depends on the specific type of test statistic.

CONCLUSIONS

The F-like test statistic that uses the proposed flexible shrinkage error estimator considering both types of gene heteroscedasticity and unrestricted residual permutation can provide a statistically valid and powerful test. Therefore, we recommended that it should always applied in the analysis of real gene expression data analysis to test an interaction term.

摘要

背景

许多基因表达数据分析都涉及对两个固定效应之间的交互项的假设检验,通常使用残差方差进行检验。在表达研究中,方差异方差性问题受到了广泛关注,先前的工作主要集中在基因间或基因内异方差性上。然而,在单个实验中,基因内和基因间可能存在异方差性。在这里,我们开发了考虑基因间和基因内异方差性的灵活收缩误差估计器,并使用它们构建用于检验交互作用的 F 类似检验统计量,使用置换获得截断值。这些置换检验很复杂,这里研究了几种置换检验。

结果

通过广泛的模拟研究和一个真实数据示例,将我们提出的检验统计量与其他现有的收缩型检验统计量进行比较。结果表明,置换程序的选择对检测能力的影响远远大于 F 或 F 类似检验统计量的选择。当存在两种基因异方差性时,我们提出的检验统计量可以控制预先选择的 I 型错误,并且更有效。在这种情况下,原始数据置换是无效的。是否应使用无限制或受限残差置换取决于特定的检验统计量类型。

结论

使用考虑两种基因异方差性和无限制残差置换的灵活收缩误差估计器的 F 类似检验统计量可以提供一种具有统计学意义和强大的检验方法。因此,我们建议在分析真实基因表达数据时,应始终应用它来检验交互项。

相似文献

1
Generalized shrinkage F-like statistics for testing an interaction term in gene expression analysis in the presence of heteroscedasticity.在存在异方差的情况下,用于检验基因表达分析中交互项的广义收缩 F 似然统计量。
BMC Bioinformatics. 2011 Nov 1;12:427. doi: 10.1186/1471-2105-12-427.
2
Construction of null statistics in permutation-based multiple testing for multi-factorial microarray experiments.基于排列的多因素微阵列实验多重检验中零统计量的构建。
Bioinformatics. 2006 Jun 15;22(12):1486-94. doi: 10.1093/bioinformatics/btl109. Epub 2006 Mar 30.
3
Robust nonparametric tests of general linear model coefficients: A comparison of permutation methods and test statistics.稳健的一般线性模型系数的非参数检验:置换方法和检验统计量的比较。
Neuroimage. 2019 Nov 1;201:116030. doi: 10.1016/j.neuroimage.2019.116030. Epub 2019 Jul 19.
4
To permute or not to permute.是否进行置换。
Bioinformatics. 2006 Sep 15;22(18):2244-8. doi: 10.1093/bioinformatics/btl383. Epub 2006 Jul 26.
5
Monte Carlo simulation of OLS and linear mixed model inference of phenotypic effects on gene expression.基于普通最小二乘法(OLS)和线性混合模型的表型对基因表达影响推断的蒙特卡罗模拟
PeerJ. 2016 Oct 11;4:e2575. doi: 10.7717/peerj.2575. eCollection 2016.
6
Estimating p-values in small microarray experiments.在小型微阵列实验中估计p值。
Bioinformatics. 2007 Jan 1;23(1):38-43. doi: 10.1093/bioinformatics/btl548. Epub 2006 Oct 30.
7
Improved statistical tests for differential gene expression by shrinking variance components estimates.通过收缩方差分量估计改进差异基因表达的统计检验。
Biostatistics. 2005 Jan;6(1):59-75. doi: 10.1093/biostatistics/kxh018.
8
Two-part permutation tests for DNA methylation and microarray data.针对DNA甲基化和微阵列数据的两部分排列检验
BMC Bioinformatics. 2005 Feb 22;6:35. doi: 10.1186/1471-2105-6-35.
9
Permutation and parametric bootstrap tests for gene-gene and gene-environment interactions.基因-基因和基因-环境相互作用的排列检验和参数自抽样检验
Ann Hum Genet. 2011 Jan;75(1):36-45. doi: 10.1111/j.1469-1809.2010.00572.x.
10
A non-parametric statistic for testing conditional heteroscedasticity for unobserved component models.一种用于检验未观测成分模型条件异方差性的非参数统计量。
J Appl Stat. 2020 Feb 25;48(3):471-497. doi: 10.1080/02664763.2020.1732885. eCollection 2021.

本文引用的文献

1
Sex-specific and lineage-specific alternative splicing in primates.灵长类动物中性别特异性和谱系特异性的可变剪接。
Genome Res. 2010 Feb;20(2):180-9. doi: 10.1101/gr.099226.109. Epub 2009 Dec 15.
2
Global analysis of allele-specific expression in Arabidopsis thaliana.拟南芥基因座特异性表达的全局分析。
Genetics. 2009 Aug;182(4):943-54. doi: 10.1534/genetics.109.103499. Epub 2009 May 27.
3
Global analysis of genetic, epigenetic and transcriptional polymorphisms in Arabidopsis thaliana using whole genome tiling arrays.利用全基因组平铺阵列对拟南芥的遗传、表观遗传和转录多态性进行全局分析。
PLoS Genet. 2008 Mar 21;4(3):e1000032. doi: 10.1371/journal.pgen.1000032.
4
Naive application of permutation testing leads to inflated type I error rates.单纯应用置换检验会导致第一类错误率膨胀。
Genetics. 2008 Jan;178(1):609-10. doi: 10.1534/genetics.107.074609.
5
Sequentially testing for a gene-drug interaction in a genomewide analysis.在全基因组分析中对基因-药物相互作用进行序贯检验。
Stat Med. 2008 May 20;27(11):2022-34. doi: 10.1002/sim.3059.
6
A structural mixed model for variances in differential gene expression studies.差异基因表达研究中方差的结构混合模型。
Genet Res. 2007 Feb;89(1):19-25. doi: 10.1017/S0016672307008646.
7
Genetic diversity contribution to errors in short oligonucleotide microarray analysis.遗传多样性对短寡核苷酸微阵列分析中误差的影响
Plant Biotechnol J. 2006 Sep;4(5):489-98. doi: 10.1111/j.1467-7652.2006.00198.x.
8
Sex-specific expression of alternative transcripts in Drosophila.果蝇中可变转录本的性别特异性表达。
Genome Biol. 2006;7(8):R79. doi: 10.1186/gb-2006-7-8-R79. Epub 2006 Aug 25.
9
Linear models and empirical bayes methods for assessing differential expression in microarray experiments.用于评估微阵列实验中差异表达的线性模型和经验贝叶斯方法。
Stat Appl Genet Mol Biol. 2004;3:Article3. doi: 10.2202/1544-6115.1027. Epub 2004 Feb 12.
10
Comparison of various statistical methods for identifying differential gene expression in replicated microarray data.用于识别重复微阵列数据中差异基因表达的各种统计方法的比较。
Stat Methods Med Res. 2006 Feb;15(1):3-20. doi: 10.1191/0962280206sm423oa.