• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

保守非编码基因组序列中调控信号的富集

Enrichment of regulatory signals in conserved non-coding genomic sequence.

作者信息

Levy S, Hannenhalli S, Workman C

机构信息

Informatics Research, Celera Genomics Corporation, 45 West Gude Drive, Rockville, MD 20850, USA.

出版信息

Bioinformatics. 2001 Oct;17(10):871-7. doi: 10.1093/bioinformatics/17.10.871.

DOI:10.1093/bioinformatics/17.10.871
PMID:11673231
Abstract

MOTIVATION

Whole genome shotgun sequencing strategies generate sequence data prior to the application of assembly methodologies that result in contiguous sequence. Sequence reads can be employed to indicate regions of conservation between closely related species for which only one genome has been assembled. Consequently, by using pairwise sequence alignments methods it is possible to identify novel, non-repetitive, conserved segments in non-coding sequence that exist between the assembled human genome and mouse whole genome shotgun sequencing fragments. Conserved non-coding regions identify potentially functional DNA that could be involved in transcriptional regulation.

RESULTS

Local sequence alignment methods were applied employing mouse fragments and the assembled human genome. In addition, transcription factor binding sites were detected by aligning their corresponding positional weight matrices to the sequence regions. These methods were applied to a set of transcripts corresponding to 502 genes associated with a variety of different human diseases taken from the Online Mendelian Inheritance in Man database. Using statistical arguments we have shown that conserved non-coding segments contain an enrichment of transcription factor binding sites when compared to the sequence background in which the conserved segments are located. This enrichment of binding sites was not observed in coding sequence. Conserved non-coding segments are not extensively repeated in the genome and therefore their identification provides a rapid means of finding genes with related conserved regions, and consequently potentially related regulatory mechanism. Conserved segments in upstream regions are found to contain binding sites that are co-localized in a manner consistent with experimentally known transcription factor pairwise co-occurrences and afford the identification of novel co-occurring Transcription Factor (TF) pairs. This study provides a methodology and more evidence to suggest that conserved non-coding regions are biologically significant since they contain a statistical enrichment of regulatory signals and pairs of signals that enable the construction of regulatory models for human genes.

CONTACT

samuel.levy@celera.com.

摘要

动机

全基因组鸟枪法测序策略在应用导致连续序列的组装方法之前生成序列数据。序列读数可用于指示仅组装了一个基因组的密切相关物种之间的保守区域。因此,通过使用成对序列比对方法,可以在已组装的人类基因组与小鼠全基因组鸟枪法测序片段之间的非编码序列中识别新的、非重复的保守片段。保守的非编码区域可识别可能参与转录调控的潜在功能性DNA。

结果

应用局部序列比对方法,将小鼠片段与已组装的人类基因组进行比对。此外,通过将转录因子结合位点的相应位置权重矩阵与序列区域进行比对来检测转录因子结合位点。这些方法应用于一组对应于502个与多种不同人类疾病相关的基因的转录本,这些基因取自《人类孟德尔遗传在线》数据库。通过统计学论证,我们表明,与保守片段所在的序列背景相比,保守的非编码片段富含转录因子结合位点。在编码序列中未观察到这种结合位点的富集。保守的非编码片段在基因组中并非广泛重复,因此它们的识别提供了一种快速找到具有相关保守区域的基因的方法,进而可能找到相关的调控机制。发现上游区域的保守片段包含以与实验已知的转录因子成对共现一致的方式共定位的结合位点,并有助于识别新的共现转录因子(TF)对。本研究提供了一种方法,并提供了更多证据表明保守的非编码区域具有生物学意义,因为它们包含调控信号的统计学富集以及能够构建人类基因调控模型的信号对。

联系方式

samuel.levy@celera.com

相似文献

1
Enrichment of regulatory signals in conserved non-coding genomic sequence.保守非编码基因组序列中调控信号的富集
Bioinformatics. 2001 Oct;17(10):871-7. doi: 10.1093/bioinformatics/17.10.871.
2
Enrichment of transcriptional regulatory sites in non-coding genomic region.非编码基因组区域中转录调控位点的富集
Bioinformatics. 2004 Mar 1;20(4):569-75. doi: 10.1093/bioinformatics/btg450. Epub 2004 Jan 22.
3
Identification of transcription factor binding sites in the human genome sequence.人类基因组序列中转录因子结合位点的鉴定。
Mamm Genome. 2002 Sep;13(9):510-4. doi: 10.1007/s00335-002-2175-6.
4
Annotating regulatory DNA based on man-mouse genomic comparison.基于人鼠基因组比较对调控性DNA进行注释。
Bioinformatics. 2002;18 Suppl 2:S84-90. doi: 10.1093/bioinformatics/18.suppl_2.s84.
5
Identification of conserved regulatory elements by comparative genome analysis.通过比较基因组分析鉴定保守调控元件。
J Biol. 2003;2(2):13. doi: 10.1186/1475-4924-2-13. Epub 2003 May 22.
6
Whole genome human/mouse phylogenetic footprinting of potential transcription regulatory signals.潜在转录调控信号的全基因组人类/小鼠系统发育足迹分析
Pac Symp Biocomput. 2003:291-302. doi: 10.1142/9789812776303_0028.
7
Evolutionary rates and patterns for human transcription factor binding sites derived from repetitive DNA.源自重复DNA的人类转录因子结合位点的进化速率和模式。
BMC Genomics. 2008 May 17;9:226. doi: 10.1186/1471-2164-9-226.
8
Comparing vertebrate whole-genome shotgun reads to the human genome.将脊椎动物全基因组鸟枪法测序 reads 与人类基因组进行比较。
Genome Res. 2001 Nov;11(11):1807-16. doi: 10.1101/gr.203601.
9
Ab initio identification of putative human transcription factor binding sites by comparative genomics.通过比较基因组学从头鉴定假定的人类转录因子结合位点
BMC Bioinformatics. 2005 May 2;6:110. doi: 10.1186/1471-2105-6-110.
10
NemaFootPrinter: a web based software for the identification of conserved non-coding genome sequence regions between C. elegans and C. briggsae.线虫足部打印机:一种基于网络的软件,用于识别秀丽隐杆线虫和briggsae线虫之间保守的非编码基因组序列区域。
BMC Bioinformatics. 2005 Dec 1;6 Suppl 4(Suppl 4):S22. doi: 10.1186/1471-2105-6-S4-S22.

引用本文的文献

1
Enhancers in Plant Development, Adaptation and Evolution.植物发育、适应与进化中的增强子
Plant Cell Physiol. 2025 May 17;66(4):461-476. doi: 10.1093/pcp/pcae121.
2
-regulatory elements in conserved non-coding sequences of nuclear receptor genes indicate for crosstalk between endocrine systems.核受体基因保守非编码序列中的调控元件表明内分泌系统之间存在串扰。
Open Med (Wars). 2021 Apr 12;16(1):640-650. doi: 10.1515/med-2021-0264. eCollection 2021.
3
Tbx2a Modulates Switching of RH2 and LWS Opsin Gene Expression.Tbx2a 调节 RH2 和 LWS 视蛋白基因表达的转换。
Mol Biol Evol. 2020 Jul 1;37(7):2002-2014. doi: 10.1093/molbev/msaa062.
4
A Method for the Structure-Based, Genome-Wide Analysis of Bacterial Intergenic Sequences Identifies Shared Compositional and Functional Features.基于结构的全基因组分析细菌基因间序列的方法可识别共享的组成和功能特征。
Genes (Basel). 2019 Oct 22;10(10):834. doi: 10.3390/genes10100834.
5
Draft genome of provides insights into conserved regulatory elements of the brain restriction gene in planarians.[物种名称]的基因组草图为涡虫中脑限制基因的保守调控元件提供了见解。
Zoological Lett. 2018 Aug 29;4:24. doi: 10.1186/s40851-018-0102-2. eCollection 2018.
6
An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome.黄瓜基因组中蛋白质编码基因的更新功能注释
Front Plant Sci. 2018 Mar 15;9:325. doi: 10.3389/fpls.2018.00325. eCollection 2018.
7
Conserved non-coding elements: developmental gene regulation meets genome organization.保守非编码元件:发育基因调控与基因组组织相遇
Nucleic Acids Res. 2017 Dec 15;45(22):12611-12624. doi: 10.1093/nar/gkx1074.
8
Genome-wide identification of conserved intronic non-coding sequences using a Bayesian segmentation approach.使用贝叶斯分割方法对保守内含子非编码序列进行全基因组鉴定。
BMC Genomics. 2017 Mar 27;18(1):259. doi: 10.1186/s12864-017-3645-2.
9
Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana.纯化选择作用于拟南芥中旁系同源基因的编码和非编码序列。
BMC Genomics. 2016 Jun 13;17:456. doi: 10.1186/s12864-016-2803-2.
10
CNMS: The preferred genic markers for comparative genomic, molecular phylogenetic, functional genetic diversity and differential gene regulatory expression analyses in chickpea.鹰嘴豆中用于比较基因组、分子系统发育、功能遗传多样性和差异基因调控表达分析的首选基因标记。
J Biosci. 2015 Sep;40(3):579-92. doi: 10.1007/s12038-015-9545-1.