Suppr超能文献

基于测量动力学的质量评估和选择基因表达信号。

Assessing and selecting gene expression signals based upon the quality of the measured dynamics.

作者信息

Yang Eric, Androulakis Ioannis P

机构信息

Biomedical Engineering Department, Rutgers University, Piscataway, NJ, USA.

出版信息

BMC Bioinformatics. 2009 Feb 10;10:55. doi: 10.1186/1471-2105-10-55.

Abstract

BACKGROUND

One of the challenges with modeling the temporal progression of biological signals is dealing with the effect of noise and the limited number of replicates at each time point. Given the rising interest in utilizing predictive mathematical models to describe the biological response of an organism or analysis such as clustering and gene ontology enrichment, it is important to determine whether the dynamic progression of the data has been accurately captured despite the limited number of replicates, such that one can have confidence that the results of the analysis are capturing important salient dynamic features.

RESULTS

By pre-selecting genes based upon quality before the identification of differential expression via algorithm such as EDGE, it was found that the percentage of statistically enriched ontologies (p < .05) was improved. Furthermore, it was found that a majority of the genes found via the proposed technique were also selected via an EDGE selection though the reverse was not necessarily true. It was also found that improvements offered by the proposed algorithm are anti-correlated with improvements in the various microarray platforms and the number of replicates. This is illustrated by the fact that newer arrays and experiments with more replicates show less improvement when the filtering for quality is first run before the selection of differentially expressed genes. This suggests that the increase in the number of replicates as well as improvements in array technologies are increase the confidence one has in the dynamics obtained from the experiment.

CONCLUSION

We have developed an algorithm that quantifies the quality of temporal biological signal rather than whether the signal illustrates a significant change over the experimental time course. Because the use of these temporal signals, whether it is in mathematical modeling or clustering, focuses upon the entire time series, it is necessary to develop a method to quantify and select for signals which conform to this ideal. By doing this, we have demonstrated a marked and consistent improvement in the results of a clustering exercise over multiple experiments, microarray platforms, and experimental designs.

摘要

背景

对生物信号的时间进程进行建模面临的挑战之一是处理噪声的影响以及每个时间点重复样本数量有限的问题。鉴于利用预测性数学模型来描述生物体的生物反应或进行聚类和基因本体富集等分析的兴趣日益增加,尽管重复样本数量有限,但确定数据的动态进程是否已被准确捕捉非常重要,这样人们才能相信分析结果捕捉到了重要的显著动态特征。

结果

通过在使用诸如EDGE等算法识别差异表达之前基于质量对基因进行预选择,发现统计学上富集的本体的百分比(p <.05)有所提高。此外,发现通过所提出的技术找到的大多数基因也通过EDGE选择被选中,尽管反之不一定成立。还发现所提出的算法带来的改进与各种微阵列平台的改进以及重复样本数量呈负相关。这一点通过以下事实得到说明:当在选择差异表达基因之前先进行质量过滤时,更新的阵列和重复样本更多的实验显示出的改进较少。这表明重复样本数量的增加以及阵列技术的改进增加了人们对从实验中获得的动态结果的信心。

结论

我们开发了一种算法,该算法量化时间生物信号的质量,而不是信号是否在实验时间进程中显示出显著变化。因为这些时间信号的使用,无论是在数学建模还是聚类中,都关注整个时间序列,所以有必要开发一种方法来量化和选择符合这一理想的信号。通过这样做,我们在多个实验、微阵列平台和实验设计的聚类练习结果中展示了显著且一致的改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a391/2653486/e0a4d14c0c91/1471-2105-10-55-1.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验