• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从开头开始:预测带有5'非翻译区的基因。

Begin at the beginning: predicting genes with 5' UTRs.

作者信息

Brown Randall H, Gross Samuel S, Brent Michael R

机构信息

Laboratory for Computational Genomics, Washington University, St. Louis, MO 63130, USA.

出版信息

Genome Res. 2005 May;15(5):742-7. doi: 10.1101/gr.3696205.

DOI:10.1101/gr.3696205
PMID:15867435
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC1088303/
Abstract

The retrainable, comparative gene predictor N-SCAN integrates multigenome modeling and 5' untranslated region (5' UTR) modeling. In this article, we evaluate N-SCAN's transcription-start site (TSS) and first exon predictions both computationally and experimentally. The computational results indicate that N-SCAN is more accurate than any of the other tools we tested at predicting the TSS and the complete first exon. It is the only one of these tools that can predict complete gene structures together with 5' UTRs. Experimental evaluation shows that N-SCAN can be used to validate novel UTR introns in human gene predictions that do not overlap any RefSeq gene and even to correct RefSeq mRNAs by adding validated UTR exons that are missing from RefSeq.

摘要

可重新训练的比较基因预测器N-SCAN整合了多基因组建模和5'非翻译区(5'UTR)建模。在本文中,我们通过计算和实验评估了N-SCAN对转录起始位点(TSS)和首个外显子的预测。计算结果表明,在预测TSS和完整的首个外显子时,N-SCAN比我们测试的任何其他工具都更准确。它是这些工具中唯一能够预测完整基因结构以及5'UTR的工具。实验评估表明,N-SCAN可用于验证人类基因预测中不与任何RefSeq基因重叠的新型UTR内含子,甚至通过添加RefSeq中缺失的经过验证的UTR外显子来校正RefSeq mRNA。

相似文献

1
Begin at the beginning: predicting genes with 5' UTRs.从开头开始:预测带有5'非翻译区的基因。
Genome Res. 2005 May;15(5):742-7. doi: 10.1101/gr.3696205.
2
Bioinformatic analyses of mammalian 5'-UTR sequence properties of mRNAs predicts alternative translation initiation sites.对哺乳动物mRNA的5'-UTR序列特性进行生物信息学分析可预测替代性翻译起始位点。
BMC Bioinformatics. 2008 May 8;9:232. doi: 10.1186/1471-2105-9-232.
3
Multiple tissue-specific promoters control expression of the murine tartrate-resistant acid phosphatase gene.多个组织特异性启动子控制小鼠抗酒石酸酸性磷酸酶基因的表达。
Gene. 2003 Mar 27;307:111-23. doi: 10.1016/s0378-1119(03)00449-9.
4
Gene structure and organization of the human beta-secretase (BACE) promoter.人类β-分泌酶(BACE)启动子的基因结构与组织
FASEB J. 2004 Jun;18(9):1034-6. doi: 10.1096/fj.03-1378fje. Epub 2004 Apr 1.
5
The 5' leader of plant PgiC has an intron: the leader shows both the loss and maintenance of constraints compared with introns and exons in the coding region.植物磷酸葡萄糖异构酶(PgiC)的5'前导序列含有一个内含子:与编码区的内含子和外显子相比,该前导序列在限制因素上既表现出丧失又表现出保留。
Mol Biol Evol. 2002 Sep;19(9):1613-23. doi: 10.1093/oxfordjournals.molbev.a004223.
6
Molecular characterization of the 5'-UTR of retinal dystrophin reveals a cryptic intron that regulates translational activity.视网膜肌营养不良蛋白5'-UTR的分子特征揭示了一个调控翻译活性的隐蔽内含子。
Mol Vis. 2010 Dec 7;16:2590-7.
7
Structural organization of the human complexin 2 gene (CPLX2) and aspects of its functional activity.人类复合体蛋白2基因(CPLX2)的结构组织及其功能活性方面
Gene. 2005 Oct 10;359:127-37. doi: 10.1016/j.gene.2005.07.005.
8
Genomic structure of human GM3 synthase gene (hST3Gal V) and identification of mRNA isoforms in the 5'-untranslated region.人类GM3合酶基因(hST3Gal V)的基因组结构及5'-非翻译区mRNA亚型的鉴定
Gene. 2001 Aug 8;273(2):163-71. doi: 10.1016/s0378-1119(01)00595-9.
9
5' genomic structure of human Sp3.人类Sp3的5'基因组结构。
Mol Biol Evol. 2002 Nov;19(11):2026-9. doi: 10.1093/oxfordjournals.molbev.a004026.
10
5' diversity of human hepatic PXR (NR1I2) transcripts and identification of the major transcription initiation site.人类肝脏孕烷X受体(NR1I2)转录本的5'端多样性及主要转录起始位点的鉴定。
Mol Cell Biochem. 2005 May;273(1-2):79-85. doi: 10.1007/s11010-005-7757-7.

引用本文的文献

1
FINDER: an automated software package to annotate eukaryotic genes from RNA-Seq data and associated protein sequences.FINDER:一个自动化软件包,用于从 RNA-Seq 数据和相关蛋白质序列中注释真核基因。
BMC Bioinformatics. 2021 Apr 20;22(1):205. doi: 10.1186/s12859-021-04120-9.
2
Known and novel post-transcriptional regulatory sequences are conserved across plant families.已知和新的转录后调控序列在植物科之间是保守的。
RNA. 2012 Mar;18(3):368-84. doi: 10.1261/rna.031179.111. Epub 2012 Jan 11.
3
mGene: accurate SVM-based gene finding with an application to nematode genomes.mGene:基于 SVM 的精确基因预测方法及其在线虫基因组中的应用。
Genome Res. 2009 Nov;19(11):2133-43. doi: 10.1101/gr.090597.108. Epub 2009 Jun 29.
4
Finding genes in Schistosoma japonicum: annotating novel genomes with help of extrinsic evidence.日本血吸虫基因的发现:借助外部证据注释新基因组
Nucleic Acids Res. 2009 Apr;37(7):e52. doi: 10.1093/nar/gkp052. Epub 2009 Mar 5.
5
POEM, A 3-dimensional exon taxonomy and patterns in untranslated exons.POEM,一种三维外显子分类法及非翻译外显子中的模式。
BMC Genomics. 2008 Sep 20;9:428. doi: 10.1186/1471-2164-9-428.
6
Towards a molecular dynamics consensus view of B-DNA flexibility.迈向B型DNA柔韧性的分子动力学共识观点。
Nucleic Acids Res. 2008 Apr;36(7):2379-94. doi: 10.1093/nar/gkn082. Epub 2008 Feb 24.
7
Determining promoter location based on DNA structure first-principles calculations.基于DNA结构第一性原理计算确定启动子位置。
Genome Biol. 2007;8(12):R263. doi: 10.1186/gb-2007-8-12-r263.
8
Conrad: gene prediction using conditional random fields.康拉德:使用条件随机场进行基因预测。
Genome Res. 2007 Sep;17(9):1389-98. doi: 10.1101/gr.6558107. Epub 2007 Aug 9.
9
Identification of core promoter modules in Drosophila and their application in accurate transcription start site prediction.果蝇核心启动子模块的鉴定及其在准确转录起始位点预测中的应用。
Nucleic Acids Res. 2006;34(20):5943-50. doi: 10.1093/nar/gkl608. Epub 2006 Oct 26.
10
Vertebrate gene finding from multiple-species alignments using a two-level strategy.使用两级策略从多物种比对中寻找脊椎动物基因。
Genome Biol. 2006;7 Suppl 1(Suppl 1):S6.1-12. doi: 10.1186/gb-2006-7-s1-s6. Epub 2006 Aug 7.

本文引用的文献

1
Using multiple alignments to improve gene prediction.使用多重比对来改进基因预测。
J Comput Biol. 2006 Mar;13(2):379-93. doi: 10.1089/cmb.2006.13.379.
2
A computational and experimental approach to validating annotations and gene predictions in the Drosophila melanogaster genome.一种用于验证黑腹果蝇基因组注释和基因预测的计算与实验方法。
Proc Natl Acad Sci U S A. 2005 Feb 1;102(5):1566-71. doi: 10.1073/pnas.0409421102. Epub 2005 Jan 24.
3
A probabilistic model of 3' end formation in Caenorhabditis elegans.秀丽隐杆线虫3' 端形成的概率模型。
Nucleic Acids Res. 2004 Jun 24;32(11):3392-9. doi: 10.1093/nar/gkh656. Print 2004.
4
Recent advances in gene structure prediction.基因结构预测的最新进展。
Curr Opin Struct Biol. 2004 Jun;14(3):264-72. doi: 10.1016/j.sbi.2004.05.007.
5
Comprehensive analysis of the base composition around the transcription start site in Metazoa.后生动物转录起始位点周围碱基组成的综合分析。
BMC Genomics. 2004 Jun 1;5(1):34. doi: 10.1186/1471-2164-5-34.
6
Aligning multiple genomic sequences with the threaded blockset aligner.使用线程化块集比对器对多个基因组序列进行比对。
Genome Res. 2004 Apr;14(4):708-15. doi: 10.1101/gr.1933104.
7
Multiple-sequence functional annotation and the generalized hidden Markov phylogeny.多序列功能注释与广义隐马尔可夫系统发育
Bioinformatics. 2004 Aug 12;20(12):1850-60. doi: 10.1093/bioinformatics/bth153. Epub 2004 Feb 26.
8
Phylogenetic estimation of context-dependent substitution rates by maximum likelihood.通过最大似然法对上下文依赖替换率进行系统发育估计。
Mol Biol Evol. 2004 Mar;21(3):468-88. doi: 10.1093/molbev/msh039. Epub 2003 Dec 5.
9
Dragon Gene Start Finder identifies approximate locations of the 5' ends of genes.龙基因起始位点查找器可识别基因5'端的大致位置。
Nucleic Acids Res. 2003 Jul 1;31(13):3560-3. doi: 10.1093/nar/gkg570.
10
The methyl-CpG binding domain and the evolving role of DNA methylation in animals.甲基-CpG结合结构域与DNA甲基化在动物中的演变作用。
Trends Genet. 2003 May;19(5):269-77. doi: 10.1016/S0168-9525(03)00080-5.