• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用表达基因标记和基因建模在人类基因组中鉴定一致性启动子。

Consensus promoter identification in the human genome utilizing expressed gene markers and gene modeling.

作者信息

Liu Rongxiang, States David J

机构信息

Bioinformatics Program and the Department of Human Genetics, University of Michigan, Ann Arbor, MI 48109, USA.

出版信息

Genome Res. 2002 Mar;12(3):462-9. doi: 10.1101/gr.198002.

DOI:10.1101/gr.198002
PMID:11875035
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC155291/
Abstract

Deciphering the human genome includes locating the promoters that initiate transcription and identifying the exons of genes. Many promoter prediction programs have been proposed, but when they are applied to extended regions of the genome, most of their predictions are false-positives. The extensive collection of gene transcript sequences is an important new source of information, which has not been used previously in promoter predictions. Our approach is to enhance the specificity of predictions by restricting the genomic regions that are searched using gene transcript alignments as anchors in the genome for gene modeling. We developed a consensus promoter prediction method combining previously developed algorithms with the GENSCAN gene modeling program. Our method, CONPRO (CONsensus PROmoter), identifies promoters with very high confidence, and the predicted promoters are guaranteed to be associated with genes. On our test data set, the method correctly detects promoters for approximately half of all human genes (37%-71%), and most predictions are true promoters (85%-90%). Applying our method to the human genome and human genes from the Unigene data set, we find the promoters for 13,744 genes. Of these, 6440 are genes with a functionally cloned mRNA, and 7304 are novel genes for which only expressed sequence tags (ESTs) are available. Candidate promoters for many novel genes will be a useful resource in elucidating complex biological response mechanisms.

摘要

解读人类基因组包括定位启动转录的启动子以及识别基因的外显子。已经提出了许多启动子预测程序,但当将它们应用于基因组的扩展区域时,其大多数预测都是假阳性。基因转录本序列的广泛收集是一个重要的新信息来源,以前在启动子预测中尚未使用过。我们的方法是通过使用基因转录本比对作为基因组中基因建模的锚点来限制搜索的基因组区域,从而提高预测的特异性。我们开发了一种将先前开发的算法与GENSCAN基因建模程序相结合的一致性启动子预测方法。我们的方法CONPRO(一致性启动子)能够以非常高的置信度识别启动子,并且预测的启动子保证与基因相关。在我们的测试数据集上,该方法能正确检测出约一半人类基因(37%-71%)的启动子,并且大多数预测都是真正的启动子(85%-90%)。将我们的方法应用于人类基因组和来自Unigene数据集的人类基因,我们找到了13744个基因的启动子。其中,6440个是具有功能克隆mRNA的基因,7304个是仅具有表达序列标签(EST)的新基因。许多新基因的候选启动子将成为阐明复杂生物反应机制的有用资源。

相似文献

1
Consensus promoter identification in the human genome utilizing expressed gene markers and gene modeling.利用表达基因标记和基因建模在人类基因组中鉴定一致性启动子。
Genome Res. 2002 Mar;12(3):462-9. doi: 10.1101/gr.198002.
2
Promoter-sharing by different genes in human genome--CPNE1 and RBM12 gene pair as an example.人类基因组中不同基因的启动子共享——以CPNE1和RBM12基因对为例。
BMC Genomics. 2008 Oct 3;9:456. doi: 10.1186/1471-2164-9-456.
3
Genome-wide analysis of core promoter structures in Schizosaccharomyces pombe with DeepCAGE.利用深度CAGE对粟酒裂殖酵母核心启动子结构进行全基因组分析。
RNA Biol. 2015;12(5):525-37. doi: 10.1080/15476286.2015.1022704.
4
Sequence patterns defining the 5' boundary of human genes.定义人类基因5'边界的序列模式。
Biopolymers. 2001 Oct 15;59(5):347-55. doi: 10.1002/1097-0282(20011015)59:5<347::AID-BIP1032>3.0.CO;2-6.
5
Characterization of 954 bovine full-CDS cDNA sequences.954条牛全长编码序列(CDS)cDNA序列的特征分析
BMC Genomics. 2005 Nov 23;6:166. doi: 10.1186/1471-2164-6-166.
6
Genome-wide prediction of transcriptional regulatory elements of human promoters using gene expression and promoter analysis data.利用基因表达和启动子分析数据对人类启动子的转录调控元件进行全基因组预测。
BMC Bioinformatics. 2006 Jul 4;7:330. doi: 10.1186/1471-2105-7-330.
7
Identification of promoter regions in the human genome by using a retroviral plasmid library-based functional reporter gene assay.利用基于逆转录病毒质粒文库的功能性报告基因分析鉴定人类基因组中的启动子区域。
Genome Res. 2003 Jul;13(7):1765-74. doi: 10.1101/gr.529803. Epub 2003 Jun 12.
8
Retroviral promoters in the human genome.人类基因组中的逆转录病毒启动子。
Bioinformatics. 2008 Jul 15;24(14):1563-7. doi: 10.1093/bioinformatics/btn243. Epub 2008 Jun 5.
9
Genomic organization of human arylamine N-acetyltransferase Type I reveals alternative promoters that generate different 5'-UTR splice variants with altered translational activities.人类I型芳基胺N-乙酰基转移酶的基因组组织揭示了替代启动子,这些启动子产生具有改变的翻译活性的不同5'-UTR剪接变体。
Biochem J. 2005 Apr 1;387(Pt 1):119-27. doi: 10.1042/BJ20040903.
10
Cluster analysis and promoter modelling as bioinformatics tools for the identification of target genes from expression array data.聚类分析和启动子建模作为从表达阵列数据中识别靶基因的生物信息学工具。
Pharmacogenomics. 2001 Feb;2(1):25-36. doi: 10.1517/14622416.2.1.25.

引用本文的文献

1
Flnc: Machine Learning Improves the Identification of Novel Long Noncoding RNAs from Stand-Alone RNA-Seq Data.Flnc:机器学习助力从独立RNA测序数据中鉴定新型长链非编码RNA
Noncoding RNA. 2022 Oct 13;8(5):70. doi: 10.3390/ncrna8050070.
2
Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction.原核生物和真核生物启动子预测的计算工具的批判性评估。
Brief Bioinform. 2022 Mar 10;23(2). doi: 10.1093/bib/bbab551.
3
Role of Non-Coding Regulatory Elements in the Control of GR-Dependent Gene Expression.非编码调控元件在 GR 依赖性基因表达调控中的作用。
Int J Mol Sci. 2021 Apr 20;22(8):4258. doi: 10.3390/ijms22084258.
4
Comparison of machine learning and deep learning techniques in promoter prediction across diverse species.跨物种启动子预测中机器学习与深度学习技术的比较
PeerJ Comput Sci. 2021 Feb 9;7:e365. doi: 10.7717/peerj-cs.365. eCollection 2021.
5
First insights into the molecular basis association between promoter polymorphisms of the IL1B gene and Helicobacter pylori infection in the Sudanese population: computational approach.苏丹人群白细胞介素 1B 基因启动子多态性与幽门螺杆菌感染相关性的分子基础初探:计算方法。
BMC Microbiol. 2021 Jan 7;21(1):16. doi: 10.1186/s12866-020-02072-3.
6
A composite method based on formal grammar and DNA structural features in detecting human polymerase II promoter region.基于形式语法和 DNA 结构特征的复合方法检测人类聚合酶 II 启动子区域。
PLoS One. 2013;8(2):e54843. doi: 10.1371/journal.pone.0054843. Epub 2013 Feb 20.
7
Fine tuning the transcription of ldhA for D-lactate production.优化 ldhA 的转录以生产 D-乳酸。
J Ind Microbiol Biotechnol. 2012 Aug;39(8):1209-17. doi: 10.1007/s10295-012-1116-y. Epub 2012 Mar 20.
8
Identification of the transcriptional promoters in the proximal regions of human microRNA genes.鉴定人类 microRNA 基因近端区域的转录启动子。
Mol Biol Rep. 2011 Aug;38(6):4153-7. doi: 10.1007/s11033-010-0535-y. Epub 2010 Nov 24.
9
Recent computational approaches to understand gene regulation: mining gene regulation in silico.最近用于理解基因调控的计算方法:计算机挖掘基因调控。
Curr Genomics. 2007 Apr;8(2):79-91. doi: 10.2174/138920207780368150.
10
Computational methods to dissect cis-regulatory transcriptional networks.剖析顺式调控转录网络的计算方法。
J Biosci. 2007 Dec;32(7):1325-30. doi: 10.1007/s12038-007-0142-9.

本文引用的文献

1
Conformational model for binding site recognition by the E.coli MetJ transcription factor.大肠杆菌MetJ转录因子识别结合位点的构象模型。
Bioinformatics. 2001 Jul;17(7):622-33. doi: 10.1093/bioinformatics/17.7.622.
2
Initial sequencing and analysis of the human genome.人类基因组的初步测序与分析。
Nature. 2001 Feb 15;409(6822):860-921. doi: 10.1038/35057062.
3
First pass annotation of promoters on human chromosome 22.人类22号染色体上启动子的首过注释
Genome Res. 2001 Mar;11(3):333-40. doi: 10.1101/gr.154601.
4
Comparative evaluation of 5'-end-sequence quality of clones in CAP trapper and other full-length-cDNA libraries.CAP 捕获法及其他全长 cDNA 文库中克隆的 5' 端序列质量的比较评估
Gene. 2001 Jan 24;263(1-2):93-102. doi: 10.1016/s0378-1119(00)00557-6.
5
The sequence of the human genome.人类基因组序列。
Science. 2001 Feb 16;291(5507):1304-51. doi: 10.1126/science.1058040.
6
UTR reconstruction and analysis using genomically aligned EST sequences.利用基因组比对的EST序列进行UTR重建与分析
Proc Int Conf Intell Syst Mol Biol. 2000;8:218-27.
7
Statistical analysis of the 5' untranslated region of human mRNA using "Oligo-Capped" cDNA libraries.使用“寡聚帽”cDNA文库对人类mRNA的5'非翻译区进行统计分析。
Genomics. 2000 Mar 15;64(3):286-97. doi: 10.1006/geno.2000.6076.
8
Highly specific localization of promoter regions in large genomic sequences by PromoterInspector: a novel context analysis approach.通过启动子检测工具在大型基因组序列中对启动子区域进行高度特异性定位:一种新型的上下文分析方法。
J Mol Biol. 2000 Mar 31;297(3):599-606. doi: 10.1006/jmbi.2000.3589.
9
Promoter2.0: for the recognition of PolII promoter sequences.启动子2.0:用于识别聚合酶II启动子序列。
Bioinformatics. 1999 May;15(5):356-61. doi: 10.1093/bioinformatics/15.5.356.
10
A computer program for aligning a cDNA sequence with a genomic DNA sequence.一种用于将互补DNA(cDNA)序列与基因组DNA序列进行比对的计算机程序。
Genome Res. 1998 Sep;8(9):967-74. doi: 10.1101/gr.8.9.967.