• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

纵向基因表达数据的统计学显著性分析。

Statistical significance analysis of longitudinal gene expression data.

作者信息

Guo Xu, Qi Huilin, Verfaillie Catherine M, Pan Wei

机构信息

Division of Biostatistics, School of Public Health, University of Minnesota, A460 Mayo Building, MMC 303, Minneapolis, MN 55455-0378, USA.

出版信息

Bioinformatics. 2003 Sep 1;19(13):1628-35. doi: 10.1093/bioinformatics/btg206.

DOI:10.1093/bioinformatics/btg206
PMID:12967958
Abstract

MOTIVATION

Time-course microarray experiments are designed to study biological processes in a temporal fashion. Longitudinal gene expression data arise when biological samples taken from the same subject at different time points are used to measure the gene expression levels. It has been observed that the gene expression patterns of samples of a given tumor measured at different time points are likely to be much more similar to each other than are the expression patterns of tumor samples of the same type taken from different subjects. In statistics, this phenomenon is called the within-subject correlation of repeated measurements on the same subject, and the resulting data are called longitudinal data. It is well known in other applications that valid statistical analyses have to appropriately take account of the possible within-subject correlation in longitudinal data.

RESULTS

We apply estimating equation techniques to construct a robust statistic, which is a variant of the robust Wald statistic and accounts for the potential within-subject correlation of longitudinal gene expression data, to detect genes with temporal changes in expression. We associate significance levels to the proposed statistic by either incorporating the idea of the significance analysis of microarrays method or using the mixture model method to identify significant genes. The utility of the statistic is demonstrated by applying it to an important study of osteoblast lineage-specific differentiation. Using simulated data, we also show pitfalls in drawing statistical inference when the within-subject correlation in longitudinal gene expression data is ignored.

摘要

动机

时间进程微阵列实验旨在以时间方式研究生物过程。当在不同时间点从同一受试者采集的生物样本用于测量基因表达水平时,就会产生纵向基因表达数据。据观察,在不同时间点测量的给定肿瘤样本的基因表达模式彼此之间可能比从不同受试者采集的相同类型肿瘤样本的表达模式更为相似。在统计学中,这种现象被称为同一受试者重复测量的受试者内相关性,由此产生的数据被称为纵向数据。在其他应用中众所周知,有效的统计分析必须适当考虑纵向数据中可能存在的受试者内相关性。

结果

我们应用估计方程技术构建一个稳健统计量,它是稳健Wald统计量的一种变体,考虑了纵向基因表达数据潜在的受试者内相关性,以检测表达随时间变化的基因。我们通过纳入微阵列方法的显著性分析思想或使用混合模型方法来识别显著基因,将显著性水平与所提出的统计量相关联。通过将该统计量应用于成骨细胞谱系特异性分化的一项重要研究,证明了其效用。使用模拟数据,我们还展示了在忽略纵向基因表达数据中的受试者内相关性时进行统计推断的陷阱。

相似文献

1
Statistical significance analysis of longitudinal gene expression data.纵向基因表达数据的统计学显著性分析。
Bioinformatics. 2003 Sep 1;19(13):1628-35. doi: 10.1093/bioinformatics/btg206.
2
Noise sampling method: an ANOVA approach allowing robust selection of differentially regulated genes measured by DNA microarrays.噪声采样方法:一种方差分析方法,可用于通过DNA微阵列测量的差异调节基因的稳健选择。
Bioinformatics. 2003 Jul 22;19(11):1348-59. doi: 10.1093/bioinformatics/btg165.
3
Statistical analysis of high-density oligonucleotide arrays: a multiplicative noise model.高密度寡核苷酸阵列的统计分析:一种乘性噪声模型。
Bioinformatics. 2002 Dec;18(12):1633-40. doi: 10.1093/bioinformatics/18.12.1633.
4
Statistical tests for identifying differentially expressed genes in time-course microarray experiments.用于在时间进程微阵列实验中识别差异表达基因的统计测试。
Bioinformatics. 2003 Apr 12;19(6):694-703. doi: 10.1093/bioinformatics/btg068.
5
Comparisons and validation of statistical clustering techniques for microarray gene expression data.微阵列基因表达数据统计聚类技术的比较与验证
Bioinformatics. 2003 Mar 1;19(4):459-66. doi: 10.1093/bioinformatics/btg025.
6
Bayesian hierarchical error model for analysis of gene expression data.用于基因表达数据分析的贝叶斯分层误差模型。
Bioinformatics. 2004 Sep 1;20(13):2016-25. doi: 10.1093/bioinformatics/bth192. Epub 2004 Mar 25.
7
Using weighted permutation scores to detect differential gene expression with microarray data.使用加权排列分数通过微阵列数据检测差异基因表达。
J Bioinform Comput Biol. 2005 Aug;3(4):989-1006. doi: 10.1142/s021972000500134x.
8
Identifying periodically expressed transcripts in microarray time series data.在微阵列时间序列数据中识别周期性表达的转录本。
Bioinformatics. 2004 Jan 1;20(1):5-20. doi: 10.1093/bioinformatics/btg364.
9
Statistical analysis of a small set of time-ordered gene expression data using linear splines.
Bioinformatics. 2002 Nov;18(11):1477-85. doi: 10.1093/bioinformatics/18.11.1477.
10
Robust estimators for expression analysis.用于表达分析的稳健估计量。
Bioinformatics. 2002 Dec;18(12):1585-92. doi: 10.1093/bioinformatics/18.12.1585.

引用本文的文献

1
Pathway testing for longitudinal metabolomics.纵向代谢组学的途径检测。
Stat Med. 2021 Jun 15;40(13):3053-3065. doi: 10.1002/sim.8957. Epub 2021 Mar 26.
2
Time-Course Gene Set Analysis for Longitudinal Gene Expression Data.纵向基因表达数据的时间进程基因集分析
PLoS Comput Biol. 2015 Jun 25;11(6):e1004310. doi: 10.1371/journal.pcbi.1004310. eCollection 2015 Jun.
3
The analytical landscape of static and temporal dynamics in transcriptome data.转录组数据中静态和时间动态的分析格局。
Front Genet. 2014 Feb 20;5:35. doi: 10.3389/fgene.2014.00035. eCollection 2014.
4
Gene set analysis for longitudinal gene expression data.基因集分析用于纵向基因表达数据。
BMC Bioinformatics. 2011 Jul 3;12:273. doi: 10.1186/1471-2105-12-273.
5
Statistical methods for integrating multiple types of high-throughput data.整合多种类型高通量数据的统计方法。
Methods Mol Biol. 2010;620:511-29. doi: 10.1007/978-1-60761-580-4_19.
6
An improved empirical bayes approach to estimating differential gene expression in microarray time-course data: BETR (Bayesian Estimation of Temporal Regulation).一种改进的经验贝叶斯方法,用于估计微阵列时间序列数据中的差异基因表达:BETR(时间调节的贝叶斯估计)。
BMC Bioinformatics. 2009 Dec 10;10:409. doi: 10.1186/1471-2105-10-409.
7
Functional assessment of time course microarray data.时间进程微阵列数据的功能评估
BMC Bioinformatics. 2009 Jun 16;10 Suppl 6(Suppl 6):S9. doi: 10.1186/1471-2105-10-S6-S9.
8
Term-tissue specific models for prediction of gene ontology biological processes using transcriptional profiles of aging in drosophila melanogaster.使用黑腹果蝇衰老转录谱预测基因本体生物学过程的组织特异性模型。
BMC Bioinformatics. 2008 Feb 28;9:129. doi: 10.1186/1471-2105-9-129.
9
Post hoc pattern matching: assigning significance to statistically defined expression patterns in single channel microarray data.事后模式匹配:赋予单通道微阵列数据中统计定义的表达模式以显著性
BMC Bioinformatics. 2007 Jul 5;8:240. doi: 10.1186/1471-2105-8-240.
10
A mixture model approach to detecting differentially expressed genes with microarray data.一种利用微阵列数据检测差异表达基因的混合模型方法。
Funct Integr Genomics. 2003 Jul;3(3):117-24. doi: 10.1007/s10142-003-0085-7. Epub 2003 Jul 1.