高等真核生物基因注释的进展情况。

The state of play in higher eukaryote gene annotation.

作者信息

Mudge Jonathan M, Harrow Jennifer

机构信息

Department of Computational Genomics, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, UK.

Illumina Cambridge Ltd, Chesterford Research Park, Little Chesterford, Saffron Walden CB10 1 XL, UK.

出版信息

Nat Rev Genet. 2016 Dec;17(12):758-772. doi: 10.1038/nrg.2016.119. Epub 2016 Oct 24.

DOI:10.1038/nrg.2016.119

PMID:27773922

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5876476/

Abstract

A genome sequence is worthless if it cannot be deciphered; therefore, efforts to describe - or 'annotate' - genes began as soon as DNA sequences became available. Whereas early work focused on individual protein-coding genes, the modern genomic ocean is a complex maelstrom of alternative splicing, non-coding transcription and pseudogenes. Scientists - from clinicians to evolutionary biologists - need to navigate these waters, and this has led to the design of high-throughput, computationally driven annotation projects. The catalogues that are being produced are key resources for genome exploration, especially as they become integrated with expression, epigenomic and variation data sets. Their creation, however, remains challenging.

摘要

如果一个基因组序列无法被解读，那它就是毫无价值的；因此，早在DNA序列可用之时，人们就开始了描述——或者说“注释”——基因的工作。早期的工作聚焦于单个蛋白质编码基因，而现代基因组领域则是一个由可变剪接、非编码转录和假基因构成的复杂漩涡。从临床医生到进化生物学家，科学家们都需要在这片领域中探索前行，这也促使了高通量、计算驱动的注释项目的设计。正在生成的目录是基因组探索的关键资源，尤其是当它们与表达、表观基因组和变异数据集整合在一起时。然而，创建这些目录仍然具有挑战性。

相似文献

The state of play in higher eukaryote gene annotation.

Nat Rev Genet. 2016 Dec;17(12):758-772. doi: 10.1038/nrg.2016.119. Epub 2016 Oct 24.

Roadmap for annotating transposable elements in eukaryote genomes.

Methods Mol Biol. 2012;859:53-68. doi: 10.1007/978-1-61779-603-6_3.

GENCODE Pseudogenes.

Methods Mol Biol. 2021;2324:67-82. doi: 10.1007/978-1-0716-1503-4_5.

Computational Methods for Pseudogene Annotation Based on Sequence Homology.

Methods Mol Biol. 2021;2324:35-48. doi: 10.1007/978-1-0716-1503-4_3.

Segway 2.0: Gaussian mixture models and minibatch training.

Bioinformatics. 2018 Feb 15;34(4):669-671. doi: 10.1093/bioinformatics/btx603.

A beginner's guide to eukaryotic genome annotation.

Nat Rev Genet. 2012 Apr 18;13(5):329-42. doi: 10.1038/nrg3174.

An investigation of causes of false positive single nucleotide polymorphisms using simulated reads from a small eukaryote genome.

BMC Bioinformatics. 2015 Nov 11;16:382. doi: 10.1186/s12859-015-0801-z.

AGORA: organellar genome annotation from the amino acid and nucleotide references.

Bioinformatics. 2018 Aug 1;34(15):2661-2663. doi: 10.1093/bioinformatics/bty196.

Genome annotation for clinical genomic diagnostics: strengths and weaknesses.

Genome Med. 2017 May 30;9(1):49. doi: 10.1186/s13073-017-0441-1.

Coding Exon-Structure Aware Realigner (CESAR): Utilizing Genome Alignments for Comparative Gene Annotation.

Methods Mol Biol. 2019;1962:179-191. doi: 10.1007/978-1-4939-9173-0_10.

引用本文的文献

Gene model for the ortholog of in .

bioRxiv. 2025 Sep 6:2025.08.25.672213. doi: 10.1101/2025.08.25.672213.

Gene model for the ortholog of in .

MicroPubl Biol. 2025 Aug 18;2025. doi: 10.17912/micropub.biology.001096. eCollection 2025.

Gene model for the ortholog of in .

MicroPubl Biol. 2025 Aug 16;2025. doi: 10.17912/micropub.biology.001001. eCollection 2025.

Gene model for the ortholog of in .

bioRxiv. 2025 Aug 20:2025.08.16.670678. doi: 10.1101/2025.08.16.670678.

Gene model for the ortholog of in .

MicroPubl Biol. 2025 Aug 12;2025. doi: 10.17912/micropub.biology.000901. eCollection 2025.

Gene model for the ortholog of in .

MicroPubl Biol. 2025 Aug 6;2025. doi: 10.17912/micropub.biology.001094. eCollection 2025.

Gene model for the ortholog of in .

MicroPubl Biol. 2025 Aug 4;2025. doi: 10.17912/micropub.biology.001027. eCollection 2025.

Gene model for the ortholog of in .

bioRxiv. 2025 Aug 12:2025.08.06.668967. doi: 10.1101/2025.08.06.668967.

Gene model for the ortholog of in .

bioRxiv. 2025 Aug 12:2025.08.06.668985. doi: 10.1101/2025.08.06.668985.

Gene model for the ortholog of in .

bioRxiv. 2025 Aug 6:2025.08.04.668519. doi: 10.1101/2025.08.04.668519.

本文引用的文献

CHiCAGO: robust detection of DNA looping interactions in Capture Hi-C data.

Genome Biol. 2016 Jun 15;17(1):127. doi: 10.1186/s13059-016-0992-2.

The Ensembl Variant Effect Predictor.

Genome Biol. 2016 Jun 6;17(1):122. doi: 10.1186/s13059-016-0974-4.

Improving GENCODE reference gene annotation using a high-stringency proteogenomics workflow.

Nat Commun. 2016 Jun 2;7:11778. doi: 10.1038/ncomms11778.

Genome-culture coevolution promotes rapid divergence of killer whale ecotypes.

Nat Commun. 2016 May 31;7:11693. doi: 10.1038/ncomms11693.

Thousands of novel translated open reading frames in humans inferred by ribosome footprint profiling.

Elife. 2016 May 27;5:e13328. doi: 10.7554/eLife.13328.

spongeScan: A web for detecting microRNA binding elements in lncRNA sequences.

Nucleic Acids Res. 2016 Jul 8;44(W1):W176-80. doi: 10.1093/nar/gkw443. Epub 2016 May 19.

Ribosome Footprint Profiling of Translation throughout the Genome.

Cell. 2016 Mar 24;165(1):22-33. doi: 10.1016/j.cell.2016.02.066.

Alternative Polyadenylation of mRNAs: 3'-Untranslated Region Matters in Gene Expression.

Mol Cells. 2016 Apr 30;39(4):281-5. doi: 10.14348/molcells.2016.0035. Epub 2016 Feb 25.

Ensembl regulation resources.

Database (Oxford). 2016 Feb 17;2016. doi: 10.1093/database/bav119. Print 2016.

Widespread Expansion of Protein Interaction Capabilities by Alternative Splicing.

Cell. 2016 Feb 11;164(4):805-17. doi: 10.1016/j.cell.2016.01.029.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

高等真核生物基因注释的进展情况。

The state of play in higher eukaryote gene annotation.

作者信息

Mudge Jonathan M, Harrow Jennifer

机构信息

Department of Computational Genomics, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, UK.

Illumina Cambridge Ltd, Chesterford Research Park, Little Chesterford, Saffron Walden CB10 1 XL, UK.

出版信息

Nat Rev Genet. 2016 Dec;17(12):758-772. doi: 10.1038/nrg.2016.119. Epub 2016 Oct 24.

DOI:10.1038/nrg.2016.119

PMID:27773922

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5876476/

Abstract

摘要

高等真核生物基因注释的进展情况。

The state of play in higher eukaryote gene annotation.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

高等真核生物基因注释的进展情况。

The state of play in higher eukaryote gene annotation.

作者信息

机构信息

出版信息