• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于有序样本的基因表达数据的广义相关度量的计数统计

Generalized correlation measure using count statistics for gene expression data with ordered samples.

机构信息

School of Mathematics and Statistics, University of Sydney, NSW 2006, Australia.

Department of Statistics, University of California, Berkeley, CA 94720, USA.

出版信息

Bioinformatics. 2018 Feb 15;34(4):617-624. doi: 10.1093/bioinformatics/btx641.

DOI:10.1093/bioinformatics/btx641
PMID:29040382
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5860612/
Abstract

MOTIVATION

Capturing association patterns in gene expression levels under different conditions or time points is important for inferring gene regulatory interactions. In practice, temporal changes in gene expression may result in complex association patterns that require more sophisticated detection methods than simple correlation measures. For instance, the effect of regulation may lead to time-lagged associations and interactions local to a subset of samples. Furthermore, expression profiles of interest may not be aligned or directly comparable (e.g. gene expression profiles from two species).

RESULTS

We propose a count statistic for measuring association between pairs of gene expression profiles consisting of ordered samples (e.g. time-course), where correlation may only exist locally in subsequences separated by a position shift. The statistic is simple and fast to compute, and we illustrate its use in two applications. In a cross-species comparison of developmental gene expression levels, we show our method not only measures association of gene expressions between the two species, but also provides alignment between different developmental stages. In the second application, we applied our statistic to expression profiles from two distinct phenotypic conditions, where the samples in each profile are ordered by the associated phenotypic values. The detected associations can be useful in building correspondence between gene association networks under different phenotypes. On the theoretical side, we provide asymptotic distributions of the statistic for different regions of the parameter space and test its power on simulated data.

AVAILABILITY AND IMPLEMENTATION

The code used to perform the analysis is available as part of the Supplementary Material.

CONTACT

msw@usc.edu or hhuang@stat.berkeley.edu.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

捕捉不同条件或时间点下基因表达水平的关联模式对于推断基因调控相互作用非常重要。实际上,基因表达的时间变化可能导致复杂的关联模式,需要比简单的相关度量更复杂的检测方法。例如,调节的影响可能导致时间滞后的关联和局部到样本子集的相互作用。此外,感兴趣的表达谱可能没有对齐或直接可比(例如,来自两个物种的基因表达谱)。

结果

我们提出了一种用于测量由有序样本(例如时间序列)组成的基因表达谱对之间关联的计数统计量,其中相关性仅可能在由位置偏移分隔的子序列中局部存在。该统计量计算简单且快速,我们在两个应用中说明了其使用。在跨物种发育基因表达水平的比较中,我们表明我们的方法不仅测量了两个物种之间基因表达的关联,而且还提供了不同发育阶段之间的对齐。在第二个应用中,我们将我们的统计量应用于两个不同表型条件的表达谱,其中每个谱中的样本按相关表型值进行排序。检测到的关联可用于在不同表型下构建基因关联网络之间的对应关系。在理论方面,我们为不同参数空间区域的统计量提供了渐近分布,并在模拟数据上测试了其功效。

可用性和实现

执行分析所用的代码可作为补充材料的一部分获得。

联系人

msw@usc.edu 或 hhuang@stat.berkeley.edu。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
Generalized correlation measure using count statistics for gene expression data with ordered samples.基于有序样本的基因表达数据的广义相关度量的计数统计
Bioinformatics. 2018 Feb 15;34(4):617-624. doi: 10.1093/bioinformatics/btx641.
2
VCNet: vector-based gene co-expression network construction and its application to RNA-seq data.VCNet:基于向量的基因共表达网络构建及其在 RNA-seq 数据中的应用。
Bioinformatics. 2017 Jul 15;33(14):2173-2181. doi: 10.1093/bioinformatics/btx131.
3
LEAP: constructing gene co-expression networks for single-cell RNA-sequencing data using pseudotime ordering.LEAP:使用伪时间排序为单细胞RNA测序数据构建基因共表达网络。
Bioinformatics. 2017 Mar 1;33(5):764-766. doi: 10.1093/bioinformatics/btw729.
4
Differential regulation enrichment analysis via the integration of transcriptional regulatory network and gene expression data.通过整合转录调控网络和基因表达数据进行差异调控富集分析。
Bioinformatics. 2015 Feb 15;31(4):563-71. doi: 10.1093/bioinformatics/btu672. Epub 2014 Oct 15.
5
Gene coexpression measures in large heterogeneous samples using count statistics.使用计数统计量在大型异质样本中进行基因共表达测量。
Proc Natl Acad Sci U S A. 2014 Nov 18;111(46):16371-6. doi: 10.1073/pnas.1417128111. Epub 2014 Oct 6.
6
Splice Expression Variation Analysis (SEVA) for inter-tumor heterogeneity of gene isoform usage in cancer.用于分析癌症中基因异构体使用的肿瘤间异质性的剪接表达变异分析(SEVA)。
Bioinformatics. 2018 Jun 1;34(11):1859-1867. doi: 10.1093/bioinformatics/bty004.
7
EBSeq-HMM: a Bayesian approach for identifying gene-expression changes in ordered RNA-seq experiments.EBSeq-HMM:一种用于在有序RNA测序实验中识别基因表达变化的贝叶斯方法。
Bioinformatics. 2015 Aug 15;31(16):2614-22. doi: 10.1093/bioinformatics/btv193. Epub 2015 Apr 5.
8
ImpulseDE: detection of differentially expressed genes in time series data using impulse models.ImpulseDE:使用脉冲模型检测时间序列数据中的差异表达基因。
Bioinformatics. 2017 Mar 1;33(5):757-759. doi: 10.1093/bioinformatics/btw665.
9
Guidance for RNA-seq co-expression network construction and analysis: safety in numbers.RNA测序共表达网络构建与分析指南:数量带来的安全性
Bioinformatics. 2015 Jul 1;31(13):2123-30. doi: 10.1093/bioinformatics/btv118. Epub 2015 Feb 24.
10
Inferring data-specific micro-RNA function through the joint ranking of micro-RNA and pathways from matched micro-RNA and gene expression data.通过对匹配的微小RNA和基因表达数据中的微小RNA与信号通路进行联合排序来推断特定数据的微小RNA功能。
Bioinformatics. 2015 Sep 1;31(17):2822-8. doi: 10.1093/bioinformatics/btv220. Epub 2015 Apr 24.

引用本文的文献

1
Gene Co-Expression Network Tools and Databases for Crop Improvement.用于作物改良的基因共表达网络工具和数据库
Plants (Basel). 2022 Jun 21;11(13):1625. doi: 10.3390/plants11131625.
2
Network Modeling in Biology: Statistical Methods for Gene and Brain Networks.生物学中的网络建模:基因与大脑网络的统计方法
Stat Sci. 2021 Feb;36(1):89-108. doi: 10.1214/20-sts792.
3
Anisotropic Gaussian kernel adaptive filtering by Lie-group dictionary learning.基于李群字典学习的各向异性高斯核自适应滤波。
PLoS One. 2020 Aug 14;15(8):e0237654. doi: 10.1371/journal.pone.0237654. eCollection 2020.
4
TimeMeter assesses temporal gene expression similarity and identifies differentially progressing genes.TimeMeter 评估基因表达的时间相似性,并识别差异表达基因。
Nucleic Acids Res. 2020 May 21;48(9):e51. doi: 10.1093/nar/gkaa142.
5
GeneFishing to reconstruct context specific portraits of biological processes.基因钓取重建生物过程的上下文特异图谱。
Proc Natl Acad Sci U S A. 2019 Sep 17;116(38):18943-18950. doi: 10.1073/pnas.1820340116. Epub 2019 Sep 4.

本文引用的文献

1
Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2.使用DESeq2对RNA测序数据的倍数变化和离散度进行适度估计。
Genome Biol. 2014;15(12):550. doi: 10.1186/s13059-014-0550-8.
2
Gene coexpression measures in large heterogeneous samples using count statistics.使用计数统计量在大型异质样本中进行基因共表达测量。
Proc Natl Acad Sci U S A. 2014 Nov 18;111(46):16371-6. doi: 10.1073/pnas.1417128111. Epub 2014 Oct 6.
3
Reconstruction of gene co-expression network from microarray data using local expression patterns.基于局部表达模式从基因芯片数据中重建基因共表达网络。
BMC Bioinformatics. 2014;15 Suppl 7(Suppl 7):S10. doi: 10.1186/1471-2105-15-S7-S10. Epub 2014 May 28.
4
Identifying local co-regulation relationships in gene expression data.识别基因表达数据中的局部共调控关系。
J Theor Biol. 2014 Nov 7;360:200-207. doi: 10.1016/j.jtbi.2014.06.032. Epub 2014 Jul 18.
5
Comparison of D. melanogaster and C. elegans developmental stages, tissues, and cells by modENCODE RNA-seq data.通过modENCODE RNA测序数据对黑腹果蝇和秀丽隐杆线虫的发育阶段、组织和细胞进行比较。
Genome Res. 2014 Jul;24(7):1086-101. doi: 10.1101/gr.170100.113.
6
Efficient statistical significance approximation for local similarity analysis of high-throughput time series data.高通量时间序列数据局部相似性分析的高效统计显著性逼近。
Bioinformatics. 2013 Jan 15;29(2):230-7. doi: 10.1093/bioinformatics/bts668. Epub 2012 Nov 23.
7
Biclustering of linear patterns in gene expression data.基因表达数据中线性模式的双聚类分析
J Comput Biol. 2012 Jun;19(6):619-31. doi: 10.1089/cmb.2012.0032.
8
Using probabilistic estimation of expression residuals (PEER) to obtain increased power and interpretability of gene expression analyses.使用表达残差的概率估计(PEER)提高基因表达分析的能力和可解释性。
Nat Protoc. 2012 Feb 16;7(3):500-7. doi: 10.1038/nprot.2011.457.
9
Development and application of a modified dynamic time warping algorithm (DTW-S) to analyses of primate brain expression time series.改进动态时间规整算法(DTW-S)的开发与应用于灵长类动物大脑表达时间序列分析。
BMC Bioinformatics. 2011 Aug 18;12:347. doi: 10.1186/1471-2105-12-347.
10
FABIA: factor analysis for bicluster acquisition.FABIA:双聚类因子分析。
Bioinformatics. 2010 Jun 15;26(12):1520-7. doi: 10.1093/bioinformatics/btq227. Epub 2010 Apr 23.