• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过人鼠序列比较来破坏DNA中的调控位点

SMASHing regulatory sites in DNA by human-mouse sequence comparisons.

作者信息

Zavolan Mihaela, Socci Nicholas D, Rajewsky Nikolaus, Gaasterlamd Terry

机构信息

Laboratory for Computational Genomics, The Rockefeller University, New York, NY 10021, USA.

出版信息

Proc IEEE Comput Soc Bioinform Conf. 2003;2:277-86.

PMID:16452803
Abstract

Regulatory sequence elements provide important clues to understanding and predicting gene expression. Although the binding sites for hundreds of transcription factors are known, there has been no systematic attempt to incorporate this information in the annotation of the human genome. Cross species sequence comparisons are critical to a meaningful annotation of regulatory elements since they generally reside in conserved non-coding regions. To take advantage of the recently completed drafts of the mouse and human genomes for annotating transcription factor binding sites, we developed SMASH, a computational pipeline that identifies thousands of orthologous human/ mouse proteins, maps them to genomic sequences, extracts and compares upstream regions and annotates putative regulatory elements in conserved, non-coding, upstream regions. Our current dataset consists of approximately 2,500 human/mouse gene pairs. Transcription start sites were estimated by mapping quasi-full length cDNA sequences. SMASH uses a novel probabilistic method to identify putative conserved binding sites that takes into account the competition between transcription factors for binding DNA. SMASH presents the results via a genome browser web interface which displays the predicted regulatory information together with the current annotations for the human genome. Our results are validated by comparison to previously published experimental data. SMASH results compare favorably to other existing computational approaches.

摘要

调控序列元件为理解和预测基因表达提供了重要线索。尽管已知数百种转录因子的结合位点,但尚未有系统地尝试将这些信息纳入人类基因组注释中。跨物种序列比较对于有意义地注释调控元件至关重要,因为它们通常位于保守的非编码区域。为了利用最近完成的小鼠和人类基因组草图来注释转录因子结合位点,我们开发了SMASH,这是一种计算流程,可识别数千个直系同源的人类/小鼠蛋白质,将它们映射到基因组序列,提取并比较上游区域,并注释保守非编码上游区域中的假定调控元件。我们目前的数据集包含约2500个人类/小鼠基因对。通过对准全长cDNA序列进行映射来估计转录起始位点。SMASH使用一种新颖的概率方法来识别假定的保守结合位点,该方法考虑了转录因子之间对结合DNA的竞争。SMASH通过基因组浏览器网页界面展示结果,该界面将预测的调控信息与人类基因组的当前注释一起显示。我们的结果通过与先前发表的实验数据进行比较而得到验证。SMASH的结果与其他现有计算方法相比更具优势。

相似文献

1
SMASHing regulatory sites in DNA by human-mouse sequence comparisons.通过人鼠序列比较来破坏DNA中的调控位点
Proc IEEE Comput Soc Bioinform Conf. 2003;2:277-86.
2
TFBScluster: a resource for the characterization of transcriptional regulatory networks.TFBScluster:一个用于转录调控网络特征描述的资源库。
Bioinformatics. 2005 Jul 1;21(13):3058-9. doi: 10.1093/bioinformatics/bti461. Epub 2005 Apr 26.
3
Annotating regulatory DNA based on man-mouse genomic comparison.基于人鼠基因组比较对调控性DNA进行注释。
Bioinformatics. 2002;18 Suppl 2:S84-90. doi: 10.1093/bioinformatics/18.suppl_2.s84.
4
WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences.WeederH:一种用于在同源序列中寻找保守调控基序和区域的算法。
BMC Bioinformatics. 2007 Feb 7;8:46. doi: 10.1186/1471-2105-8-46.
5
OMGProm: a database of orthologous mammalian gene promoters.OMGProm:一个直系同源哺乳动物基因启动子数据库。
Bioinformatics. 2005 Mar;21(6):835-6. doi: 10.1093/bioinformatics/bti119. Epub 2004 Nov 5.
6
Using PhyloCon to identify conserved regulatory motifs.使用PhyloCon来识别保守的调控基序。
Curr Protoc Bioinformatics. 2007 Sep;Chapter 2:Unit 2.12. doi: 10.1002/0471250953.bi0212s19.
7
GenoMiner: a tool for genome-wide search of coding and non-coding conserved sequence tags.基因挖掘器:一种用于全基因组搜索编码和非编码保守序列标签的工具。
Bioinformatics. 2006 Feb 15;22(4):497-9. doi: 10.1093/bioinformatics/bti754. Epub 2005 Nov 2.
8
Comparative promoter region analysis powered by CORG.由CORG驱动的启动子区域比较分析。
BMC Genomics. 2005 Feb 21;6:24. doi: 10.1186/1471-2164-6-24.
9
Discovery of regulatory elements in vertebrates through comparative genomics.通过比较基因组学发现脊椎动物中的调控元件。
Nat Biotechnol. 2005 Oct;23(10):1249-56. doi: 10.1038/nbt1140.
10
Deltarho-web, an online tool to assess composition similarity of individual nucleic acid sequences.Deltarho-web,一种用于评估单个核酸序列组成相似性的在线工具。
Bioinformatics. 2005 Jul 1;21(13):3053-5. doi: 10.1093/bioinformatics/bti460. Epub 2005 Apr 26.

引用本文的文献

1
Modelling gene regulation networks via multivariate adaptive splines.通过多元自适应样条对基因调控网络进行建模。
Cancer Genomics Proteomics. 2008 Jan-Feb;5(1):55-62.
2
Finding regulatory elements and regulatory motifs: a general probabilistic framework.寻找调控元件和调控基序:一个通用的概率框架。
BMC Bioinformatics. 2007 Sep 27;8 Suppl 6(Suppl 6):S4. doi: 10.1186/1471-2105-8-S6-S4.
3
PhyloGibbs: a Gibbs sampling motif finder that incorporates phylogeny.PhyloGibbs:一种整合了系统发育的吉布斯采样基序查找器。
PLoS Comput Biol. 2005 Dec;1(7):e67. doi: 10.1371/journal.pcbi.0010067. Epub 2005 Dec 9.
4
Cross-species comparison of Drosophila male accessory gland protein genes.果蝇雄性附腺蛋白基因的跨物种比较。
Genetics. 2005 Sep;171(1):131-43. doi: 10.1534/genetics.105.043844. Epub 2005 Jun 8.
5
Cross-species comparison significantly improves genome-wide prediction of cis-regulatory modules in Drosophila.跨物种比较显著提高了果蝇中顺式调控元件的全基因组预测。
BMC Bioinformatics. 2004 Sep 9;5:129. doi: 10.1186/1471-2105-5-129.
6
Conservation of regulatory elements between two species of Drosophila.两种果蝇之间调控元件的保守性。
BMC Bioinformatics. 2003 Nov 20;4:57. doi: 10.1186/1471-2105-4-57.