• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

时间序列基因表达数据的连续表示。

Continuous representations of time-series gene expression data.

作者信息

Bar-Joseph Ziv, Gerber Georg K, Gifford David K, Jaakkola Tommi S, Simon Itamar

机构信息

MIT Laboratory for Computer Science, 200 Technology Square, Cambridge, MA 02139, USA.

出版信息

J Comput Biol. 2003;10(3-4):341-56. doi: 10.1089/10665270360688057.

DOI:10.1089/10665270360688057
PMID:12935332
Abstract

We present algorithms for time-series gene expression analysis that permit the principled estimation of unobserved time points, clustering, and dataset alignment. Each expression profile is modeled as a cubic spline (piecewise polynomial) that is estimated from the observed data and every time point influences the overall smooth expression curve. We constrain the spline coefficients of genes in the same class to have similar expression patterns, while also allowing for gene specific parameters. We show that unobserved time points can be reconstructed using our method with 10-15% less error when compared to previous best methods. Our clustering algorithm operates directly on the continuous representations of gene expression profiles, and we demonstrate that this is particularly effective when applied to nonuniformly sampled data. Our continuous alignment algorithm also avoids difficulties encountered by discrete approaches. In particular, our method allows for control of the number of degrees of freedom of the warp through the specification of parameterized functions, which helps to avoid overfitting. We demonstrate that our algorithm produces stable low-error alignments on real expression data and further show a specific application to yeast knock-out data that produces biologically meaningful results.

摘要

我们提出了用于时间序列基因表达分析的算法,这些算法允许对未观察到的时间点进行有原则的估计、聚类以及数据集对齐。每个表达谱被建模为一个三次样条(分段多项式),它是根据观察到的数据估计出来的,并且每个时间点都会影响整体平滑的表达曲线。我们将同一类基因的样条系数约束为具有相似的表达模式,同时也允许基因特异性参数。我们表明,与之前的最佳方法相比,使用我们的方法可以以低10 - 15%的误差重建未观察到的时间点。我们的聚类算法直接对基因表达谱的连续表示进行操作,并且我们证明,当应用于非均匀采样数据时,这特别有效。我们的连续对齐算法也避免了离散方法所遇到的困难。特别是,我们的方法允许通过参数化函数的指定来控制扭曲的自由度数量,这有助于避免过拟合。我们证明我们的算法在真实表达数据上产生稳定的低误差对齐,并进一步展示了其在酵母基因敲除数据上的具体应用,该应用产生了具有生物学意义的结果。

相似文献

1
Continuous representations of time-series gene expression data.时间序列基因表达数据的连续表示。
J Comput Biol. 2003;10(3-4):341-56. doi: 10.1089/10665270360688057.
2
Interpolation based consensus clustering for gene expression time series.基于插值的基因表达时间序列一致性聚类
BMC Bioinformatics. 2015 Apr 16;16:117. doi: 10.1186/s12859-015-0541-0.
3
Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions.超越共表达关系:时移和反向基因表达谱的局部聚类可识别新的生物学相关相互作用。
J Mol Biol. 2001 Dec 14;314(5):1053-66. doi: 10.1006/jmbi.2000.5219.
4
A computational approach to the functional clustering of periodic gene-expression profiles.一种用于周期性基因表达谱功能聚类的计算方法。
Genetics. 2008 Oct;180(2):821-34. doi: 10.1534/genetics.108.093690. Epub 2008 Sep 9.
5
Aligning gene expression time series with time warping algorithms.使用时间规整算法对基因表达时间序列进行对齐。
Bioinformatics. 2001 Jun;17(6):495-508. doi: 10.1093/bioinformatics/17.6.495.
6
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae.用于转录调控定量分析的非线性微分方程模型应用于酿酒酵母的微阵列数据
Nucleic Acids Res. 2007;35(1):279-87. doi: 10.1093/nar/gkl1001. Epub 2006 Dec 14.
7
Clustering of change patterns using Fourier coefficients.使用傅里叶系数对变化模式进行聚类。
Bioinformatics. 2008 Jan 15;24(2):184-91. doi: 10.1093/bioinformatics/btm568. Epub 2007 Nov 19.
8
Finding explained groups of time-course gene expression profiles with predictive clustering trees.使用预测聚类树寻找时间进程基因表达谱的可解释分组。
Mol Biosyst. 2010 Apr;6(4):729-40. doi: 10.1039/b913690h. Epub 2010 Feb 19.
9
Time Delayed Causal Gene Regulatory Network Inference with Hidden Common Causes.具有隐藏共同原因的时间延迟因果基因调控网络推理
PLoS One. 2015 Sep 22;10(9):e0138596. doi: 10.1371/journal.pone.0138596. eCollection 2015.
10
TA-clustering: cluster analysis of gene expression profiles through Temporal Abstractions.TA聚类:通过时间抽象对基因表达谱进行聚类分析。
Int J Med Inform. 2005 Aug;74(7-8):505-17. doi: 10.1016/j.ijmedinf.2005.03.014.

引用本文的文献

1
Integrating Gene Expression Data into Single-Step Method (ssBLUP) Improves Genomic Prediction Accuracy for Complex Traits of Duroc × Erhualian F Pig Population.将基因表达数据整合到单步方法(ssBLUP)中可提高杜洛克×二花脸F猪群体复杂性状的基因组预测准确性。
Curr Issues Mol Biol. 2024 Dec 3;46(12):13713-13724. doi: 10.3390/cimb46120819.
2
Integrating patients in time series clinical transcriptomics data.整合时间序列临床转录组学数据中的患者信息。
Bioinformatics. 2024 Jun 28;40(Suppl 1):i151-i159. doi: 10.1093/bioinformatics/btae241.
3
Cell-specific imputation of drug connectivity mapping with incomplete data.
基于不完全数据的药物连通性映射的细胞特异性推断。
PLoS One. 2023 Feb 16;18(2):e0278289. doi: 10.1371/journal.pone.0278289. eCollection 2023.
4
A machine-learning approach for long-term prediction of experimental cardiac action potential time series using an autoencoder and echo state networks.一种使用自动编码器和回声状态网络对实验性心脏动作电位时间序列进行长期预测的机器学习方法。
Chaos. 2022 Jun;32(6):063117. doi: 10.1063/5.0087812.
5
Prediction of chaotic time series using recurrent neural networks and reservoir computing techniques: A comparative study.使用递归神经网络和储层计算技术预测混沌时间序列:一项比较研究。
Mach Learn Appl. 2022 Jun 15;8. doi: 10.1016/j.mlwa.2022.100300. Epub 2022 Apr 9.
6
Inferring directional relationships in microbial communities using signed Bayesian networks.使用带符号贝叶斯网络推断微生物群落中的方向性关联。
BMC Genomics. 2020 Dec 21;21(Suppl 6):663. doi: 10.1186/s12864-020-07065-0.
7
Metabolomics and Multi-Omics Integration: A Survey of Computational Methods and Resources.代谢组学与多组学整合:计算方法与资源综述
Metabolites. 2020 May 15;10(5):202. doi: 10.3390/metabo10050202.
8
Lag penalized weighted correlation for time series clustering.滞后惩罚加权相关的时间序列聚类。
BMC Bioinformatics. 2020 Jan 16;21(1):21. doi: 10.1186/s12859-019-3324-1.
9
Wearables and the Quantified Self: Systematic Benchmarking of Physiological Sensors.可穿戴设备和量化自我:生理传感器的系统基准测试。
Sensors (Basel). 2019 Oct 14;19(20):4448. doi: 10.3390/s19204448.
10
Etiopathogenesis of Suicide: A Conceptual Analysis of Risk and Prevention Within a Comprehensive, Deterministic Model.自杀的病因学:在一个全面的、确定性模型中对风险与预防的概念性分析。
Front Psychol. 2019 Sep 12;10:2087. doi: 10.3389/fpsyg.2019.02087. eCollection 2019.