Suppr超能文献

重复序列数据库及挖掘基因启动子区域的潜在调控元件。

The repetitive sequence database and mining putative regulatory elements in gene promoter regions.

作者信息

Horng Jorng-Tzong, Huang Hsien-Da, Jin Ming-Hui, Wu Li-Cheng, Huang Shir-Ly

机构信息

Department of Computer Science and Information Engineering, National Central University, Taiwan.

出版信息

J Comput Biol. 2002;9(4):621-40. doi: 10.1089/106652702760277354.

Abstract

At least 43% of the human genome is occupied by repetitive elements. Moreover, around 51% of the rice genome is occupied by repetitive elements. The analysis of repetitive elements reveals that repetitive elements in our genome may have been very important in the evolutionary genomics. The first part of this study is to describe a database of repetitive elements - RSDB. The RSDB database contains repetitive elements, which are classified into the following categories: exact, tandem, and similar. The interfaces needed to query and show the results and statistical data, such as the relationship between repetitive elements and genes, cross-references of repetitive elements among different organisms, and so on, are provided. The second part of this study then attempts to mine the putative binding site for information on how combinations of the known regulatory sites and overrepresented repetitive elements in RSDB are distributed in the promoter regions of groups of functionally related genes. The overrepresented repetitive elements appearing in the associations are possible transcription factor binding sites. Our proposed approach is applied to Saccharomyces cerevisiae and the promoter regions of Yeast ORFs. The complete contents of RSDB and partial putative binding sites are available to the public at www.rsdb.csie.ncu.edu.tw. The readers may download partial query results.

摘要

至少43%的人类基因组由重复元件占据。此外,约51%的水稻基因组由重复元件占据。对重复元件的分析表明,我们基因组中的重复元件在进化基因组学中可能非常重要。本研究的第一部分是描述一个重复元件数据库——RSDB。RSDB数据库包含重复元件,这些重复元件被分为以下几类:精确型、串联型和相似型。提供了查询和显示结果及统计数据所需的界面,例如重复元件与基因之间的关系、不同生物体中重复元件的交叉引用等。本研究的第二部分接着尝试挖掘推定的结合位点,以了解已知调控位点与RSDB中过度富集的重复元件的组合在功能相关基因组的启动子区域中的分布情况。在这些关联中出现的过度富集的重复元件可能是转录因子结合位点。我们提出的方法应用于酿酒酵母和酵母开放阅读框的启动子区域。RSDB的完整内容和部分推定结合位点可在www.rsdb.csie.ncu.edu.tw上向公众提供。读者可以下载部分查询结果。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验