Suppr超能文献

MOPAC:通过对微阵列进行预处理和凝聚聚类来寻找基序

MOPAC: motif finding by preprocessing and agglomerative clustering from microarrays.

作者信息

Ganesh R, Siegele Deborah A, Ioerger Thomas R

机构信息

Department of Computer Science, Texas A&M University, College Station, TX 77840, USA.

出版信息

Pac Symp Biocomput. 2003:41-52.

Abstract

We propose a novel strategy for discovering motifs from gene expression data. The gene expression data in our experiments comes from DNA Microarray analysis of the bacterium E. coli in response to recovery from nutrient starvation. We have annotated the data and identified the upregulated genes. Our interest is to find common regulatory motifs that are responsible for the upregulation of these specific genes. We assume that a common motif that a regulatory protein can bind to will be present in the upstream region of the upregulated genes and will not be present in the upstream regions of genes that showed a constant level of expression over time. Our objective is to find the common motifs that are present in at least some of the upstream sequences of upregulated genes and not present in the control set, which is the set of genes whose expression remained the same. Because it is possible that there could be several subsets of co-regulated genes under different control mechanisms among the co-expressed genes, we do not want to require motifs to be present in all upregulated sequences. Therefore, we propose a new algorithm for finding such motifs through stages of pre-processing, denoising, agglomerative clustering and consensus checking. Through this process, we have found some motifs that are good candidates for further validation.

摘要

我们提出了一种从基因表达数据中发现基序的新策略。我们实验中的基因表达数据来自对大肠杆菌从营养饥饿恢复过程的DNA微阵列分析。我们已对数据进行注释并识别出上调基因。我们感兴趣的是找到导致这些特定基因上调的共同调控基序。我们假设调控蛋白能够结合的共同基序会出现在上调基因的上游区域,而不会出现在随时间表达水平恒定的基因的上游区域。我们的目标是找到至少存在于一些上调基因上游序列中且不存在于对照组(即表达保持不变的基因集)中的共同基序。因为在共表达基因中,不同控制机制下可能存在多个共调控基因子集,所以我们不要求基序存在于所有上调序列中。因此,我们提出了一种通过预处理、去噪、凝聚聚类和一致性检查阶段来寻找此类基序的新算法。通过这个过程,我们发现了一些有望进一步验证的基序。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验