• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自动基因组注释准确性方面的稳步进展和近期突破。

Steady progress and recent breakthroughs in the accuracy of automated genome annotation.

作者信息

Brent Michael R

机构信息

Center for Genome Sciences, Campus BOX 8510, Washington University, 4444 Forest Park Blvd, Saint Louis, Missouri 63108, USA.

出版信息

Nat Rev Genet. 2008 Jan;9(1):62-73. doi: 10.1038/nrg2220.

DOI:10.1038/nrg2220
PMID:18087260
Abstract

The sequencing of large, complex genomes has become routine, but understanding how sequences relate to biological function is less straightforward. Although much attention is focused on how to annotate genomic features such as developmental enhancers and non-coding RNAs, there is still no higher eukaryote for which we know the correct exon-intron structure of at least one ORF for each gene. Despite this uncomfortable truth, genome annotation has made remarkable progress since the first drafts of the human genome were analysed. By combining several computational and experimental methods, we are now closer to producing complete and accurate gene catalogues than ever before.

摘要

对大型复杂基因组进行测序已成为常规操作,但理解序列与生物学功能之间的关系却没那么简单直接。尽管目前很多注意力都集中在如何注释诸如发育增强子和非编码RNA等基因组特征上,但对于任何一种高等真核生物,我们仍无法知晓每个基因至少一个开放阅读框(ORF)的正确外显子-内含子结构。尽管存在这一令人不安的事实,但自人类基因组初稿被分析以来,基因组注释已经取得了显著进展。通过结合多种计算方法和实验方法,我们如今比以往任何时候都更接近生成完整且准确的基因目录。

相似文献

1
Steady progress and recent breakthroughs in the accuracy of automated genome annotation.自动基因组注释准确性方面的稳步进展和近期突破。
Nat Rev Genet. 2008 Jan;9(1):62-73. doi: 10.1038/nrg2220.
2
Genome annotation past, present, and future: how to define an ORF at each locus.基因组注释的过去、现在与未来:如何在每个基因座定义一个开放阅读框。
Genome Res. 2005 Dec;15(12):1777-86. doi: 10.1101/gr.3866105.
3
The use of covariance models to annotate RNAs in whole genomes.使用协方差模型对全基因组中的RNA进行注释。
Brief Funct Genomic Proteomic. 2009 Nov;8(6):444-50. doi: 10.1093/bfgp/elp042.
4
Strategies for whole microbial genome sequencing and analysis.全微生物基因组测序与分析策略。
Electrophoresis. 1997 Aug;18(8):1207-16. doi: 10.1002/elps.1150180803.
5
Origination of the split structure of spliceosomal genes from random genetic sequences.剪接体基因的分裂结构源于随机遗传序列。
PLoS One. 2008;3(10):e3456. doi: 10.1371/journal.pone.0003456. Epub 2008 Oct 20.
6
Genome-wide validation of Magnaporthe grisea gene structures based on transcription evidence.基于转录证据对稻瘟病菌基因结构进行全基因组验证。
FEBS Lett. 2009 Feb 18;583(4):797-800. doi: 10.1016/j.febslet.2009.01.041. Epub 2009 Jan 30.
7
CGAS: a comparative genome annotation system.CGAS:一种比较基因组注释系统。
Methods Mol Biol. 2007;395:133-46.
8
Constructing the landscape of the mammalian transcriptome.构建哺乳动物转录组图谱。
J Exp Biol. 2007 May;210(Pt 9):1497-506. doi: 10.1242/jeb.000406.
9
First exons and introns--a survey of GC content and gene structure in the human genome.首个外显子与内含子——人类基因组中GC含量及基因结构的一项调查
In Silico Biol. 2006;6(3):237-42.
10
Prediction of protein coding regions by the 3-base periodicity analysis of a DNA sequence.通过DNA序列的三碱基周期性分析预测蛋白质编码区域。
J Theor Biol. 2007 Aug 21;247(4):687-94. doi: 10.1016/j.jtbi.2007.03.038. Epub 2007 Apr 10.

引用本文的文献

1
Identification and characterisation of two functional antibiotic MATE efflux pumps in the archaeon Halorubrum amylolyticum.嗜盐解淀粉嗜盐碱杆菌中两个功能性抗生素MATE外排泵的鉴定与表征
NPJ Antimicrob Resist. 2024 Aug 2;2(1):21. doi: 10.1038/s44259-024-00036-5.
2
Prediction of transcript isoforms and identification of tissue-specific genes in cucumber.黄瓜转录本异构体的预测及组织特异性基因的鉴定。
BMC Genomics. 2025 Jan 10;26(1):25. doi: 10.1186/s12864-025-11212-w.
3
Comparative Genome Annotation.比较基因组注释。
Methods Mol Biol. 2024;2802:165-187. doi: 10.1007/978-1-0716-3838-5_7.
4
Chromosome-level assembly and analysis of Camelina neglecta: a novel diploid model for Camelina biotechnology research.亚麻荠(Camelina neglecta)的染色体水平组装与分析:亚麻荠生物技术研究的新型二倍体模型
Biotechnol Biofuels Bioprod. 2024 Jan 31;17(1):17. doi: 10.1186/s13068-024-02466-9.
5
Single-cell analysis technologies for cancer research: from tumor-specific single cell discovery to cancer therapy.用于癌症研究的单细胞分析技术:从肿瘤特异性单细胞发现到癌症治疗
Front Genet. 2023 Oct 12;14:1276959. doi: 10.3389/fgene.2023.1276959. eCollection 2023.
6
In-Depth Annotation of the Reveals the Presence of Several Alternative ORFs That Could Encode for Motif-Rich Peptides.深入注释揭示了存在几个可能编码富含基序肽的替代 ORF。
Cells. 2021 Nov 2;10(11):2983. doi: 10.3390/cells10112983.
7
ceRNA Network Regulation of TGF-β, WNT, FOXO, Hedgehog Pathways in the Pharynx of .TGF-β、WNT、FOXO、Hedgehog 通路在咽中的 ceRNA 网络调控
Int J Mol Sci. 2021 Mar 28;22(7):3497. doi: 10.3390/ijms22073497.
8
Comprehensive analysis of pseudogene expression and potential pathogenesis in ovarian serous cystadenocarcinoma.卵巢浆液性囊腺癌中假基因表达及潜在发病机制的综合分析
Cancer Cell Int. 2020 Jun 10;20:229. doi: 10.1186/s12935-020-01324-6. eCollection 2020.
9
Re-recognition of pseudogenes: From molecular to clinical applications.假基因的再识别:从分子到临床应用。
Theranostics. 2020 Jan 1;10(4):1479-1499. doi: 10.7150/thno.40659. eCollection 2020.
10
Repertoire-wide gene structure analyses: a case study comparing automatically predicted and manually annotated gene models.全面的基因结构分析:自动预测和手动注释基因模型的比较案例研究。
BMC Genomics. 2019 Oct 17;20(1):753. doi: 10.1186/s12864-019-6064-8.