• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

后 ENCODE 时代的功能转录组学。

Functional transcriptomics in the post-ENCODE era.

机构信息

Department of Informatics, Wellcome Trust Sanger Institute, Hinxton CB10 1SA, United Kingdom.

出版信息

Genome Res. 2013 Dec;23(12):1961-73. doi: 10.1101/gr.161315.113. Epub 2013 Oct 30.

DOI:10.1101/gr.161315.113
PMID:24172201
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3847767/
Abstract

The last decade has seen tremendous effort committed to the annotation of the human genome sequence, most notably perhaps in the form of the ENCODE project. One of the major findings of ENCODE, and other genome analysis projects, is that the human transcriptome is far larger and more complex than previously thought. This complexity manifests, for example, as alternative splicing within protein-coding genes, as well as in the discovery of thousands of long noncoding RNAs. It is also possible that significant numbers of human transcripts have not yet been described by annotation projects, while existing transcript models are frequently incomplete. The question as to what proportion of this complexity is truly functional remains open, however, and this ambiguity presents a serious challenge to genome scientists. In this article, we will discuss the current state of human transcriptome annotation, drawing on our experience gained in generating the GENCODE gene annotation set. We highlight the gaps in our knowledge of transcript functionality that remain, and consider the potential computational and experimental strategies that can be used to help close them. We propose that an understanding of the true overlap between transcriptional complexity and functionality will not be gained in the short term. However, significant steps toward obtaining this knowledge can now be taken by using an integrated strategy, combining all of the experimental resources at our disposal.

摘要

过去十年,人们投入了大量精力对人类基因组序列进行注释,其中最著名的或许是 ENCODE 项目。ENCODE 和其他基因组分析项目的主要发现之一是,人类转录组远比以前认为的更大、更复杂。这种复杂性体现在蛋白质编码基因的选择性剪接中,也体现在发现了数千种长非编码 RNA 上。也有可能大量的人类转录本尚未被注释项目所描述,而现有的转录本模型也经常不完整。然而,目前仍不清楚这种复杂性有多大比例是真正有功能的,这种不确定性给基因组科学家带来了严重的挑战。在本文中,我们将根据我们在生成 GENCODE 基因注释集方面的经验,讨论人类转录组注释的现状。我们强调了我们对转录本功能的了解仍然存在的差距,并考虑了可以用来帮助弥补这些差距的潜在计算和实验策略。我们认为,在短期内不太可能了解转录复杂性和功能之间的真正重叠。然而,现在可以通过采用综合策略,结合我们所能利用的所有实验资源,朝着获得这些知识迈出重要的一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/8a4303feb2b1/1961fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/ae466bf0415a/1961fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/cbbf540443e5/1961fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/44203e20cfd1/1961fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/36a0f463a489/1961fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/f6ee68bf5343/1961fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/1abb97aebca4/1961fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/8a4303feb2b1/1961fig7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/ae466bf0415a/1961fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/cbbf540443e5/1961fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/44203e20cfd1/1961fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/36a0f463a489/1961fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/f6ee68bf5343/1961fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/1abb97aebca4/1961fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79c5/3847767/8a4303feb2b1/1961fig7.jpg

相似文献

1
Functional transcriptomics in the post-ENCODE era.后 ENCODE 时代的功能转录组学。
Genome Res. 2013 Dec;23(12):1961-73. doi: 10.1101/gr.161315.113. Epub 2013 Oct 30.
2
GENCODE: the reference human genome annotation for The ENCODE Project.GENCODE:ENCODE 项目的人类参考基因组注释。
Genome Res. 2012 Sep;22(9):1760-74. doi: 10.1101/gr.135350.111.
3
Comparison of GENCODE and RefSeq gene annotation and the impact of reference geneset on variant effect prediction.GENCODE与RefSeq基因注释的比较以及参考基因集对变异效应预测的影响。
BMC Genomics. 2015;16 Suppl 8(Suppl 8):S2. doi: 10.1186/1471-2164-16-S8-S2. Epub 2015 Jun 18.
4
The GENCODE v7 catalog of human long noncoding RNAs: analysis of their gene structure, evolution, and expression.GENCODE v7 人类长非编码 RNA 目录:基因结构、进化和表达分析。
Genome Res. 2012 Sep;22(9):1775-89. doi: 10.1101/gr.132159.111.
5
Pinstripe: a suite of programs for integrating transcriptomic and proteomic datasets identifies novel proteins and improves differentiation of protein-coding and non-coding genes.Pinstripe:一套用于整合转录组和蛋白质组数据集的程序,可识别新的蛋白质,并提高蛋白质编码和非编码基因的区分能力。
Bioinformatics. 2012 Dec 1;28(23):3042-50. doi: 10.1093/bioinformatics/bts582. Epub 2012 Oct 7.
6
GENCODE 2021.GENCODE 2021.
Nucleic Acids Res. 2021 Jan 8;49(D1):D916-D923. doi: 10.1093/nar/gkaa1087.
7
EGASP: the human ENCODE Genome Annotation Assessment Project.EGASP:人类ENCODE基因组注释评估项目。
Genome Biol. 2006;7 Suppl 1(Suppl 1):S2.1-31. doi: 10.1186/gb-2006-7-s1-s2. Epub 2006 Aug 7.
8
Multi-omics annotation of human long non-coding RNAs.人类长非编码 RNA 的多组学注释。
Biochem Soc Trans. 2020 Aug 28;48(4):1545-1556. doi: 10.1042/BST20191063.
9
De novo reconstruction of the Toxoplasma gondii transcriptome improves on the current genome annotation and reveals alternatively spliced transcripts and putative long non-coding RNAs.新生重建弓形虫转录组提高了目前的基因组注释,并揭示了选择性剪接的转录本和潜在的长非编码 RNA。
BMC Genomics. 2012 Dec 12;13:696. doi: 10.1186/1471-2164-13-696.
10
Comparative proteomics reveals a significant bias toward alternative protein isoforms with conserved structure and function.比较蛋白质组学揭示了一种明显的偏向,即偏向于具有保守结构和功能的替代蛋白质异构体。
Mol Biol Evol. 2012 Sep;29(9):2265-83. doi: 10.1093/molbev/mss100. Epub 2012 Mar 22.

引用本文的文献

1
Housekeeping protein-coding genes interrogated with tissue and individual variations.用组织和个体差异来研究管家蛋白编码基因。
Sci Rep. 2024 May 30;14(1):12454. doi: 10.1038/s41598-024-63269-4.
2
Tissue-specific regulation of gene expression via unproductive splicing.通过无效剪接实现组织特异性基因表达调控。
Nucleic Acids Res. 2023 Apr 24;51(7):3055-3066. doi: 10.1093/nar/gkad161.
3
Methods to Analyze the Non-Coding RNA Interactome-Recent Advances and Challenges.分析非编码RNA相互作用组的方法——最新进展与挑战

本文引用的文献

1
VlincRNAs controlled by retroviral elements are a hallmark of pluripotency and cancer.受逆转录元件控制的 vlincRNAs 是多能性和癌症的标志。
Genome Biol. 2013 Jul 22;14(7):R73. doi: 10.1186/gb-2013-14-7-r73.
2
Pervasive transcription of the human genome produces thousands of previously unidentified long intergenic noncoding RNAs.人类基因组的普遍转录产生了数千个以前未被识别的长基因间非编码 RNA。
PLoS Genet. 2013 Jun;9(6):e1003569. doi: 10.1371/journal.pgen.1003569. Epub 2013 Jun 20.
3
Understanding the regulatory and transcriptional complexity of the genome through structure.
Front Genet. 2022 Mar 17;13:857759. doi: 10.3389/fgene.2022.857759. eCollection 2022.
4
Long Non-Coding RNA Neighbor of BRCA1 Gene 2: A Crucial Regulator in Cancer Biology.BRCA1基因2的长链非编码RNA邻居:癌症生物学中的关键调节因子
Front Oncol. 2021 Dec 2;11:783526. doi: 10.3389/fonc.2021.783526. eCollection 2021.
5
Nanopore sequencing reveals endogenous NMD-targeted isoforms in human cells.纳米孔测序揭示了人类细胞中内源性 NMD 靶向异构体。
Genome Biol. 2021 Aug 13;22(1):223. doi: 10.1186/s13059-021-02439-3.
6
LncRNA HOTAIR: A Potential Prognostic Factor and Therapeutic Target in Human Cancers.长链非编码RNA HOTAIR:人类癌症中的一种潜在预后因素和治疗靶点。
Front Oncol. 2021 Jul 22;11:679244. doi: 10.3389/fonc.2021.679244. eCollection 2021.
7
FuncPEP: A Database of Functional Peptides Encoded by Non-Coding RNAs.FuncPEP:一个由非编码RNA编码的功能性肽数据库。
Noncoding RNA. 2020 Sep 23;6(4):41. doi: 10.3390/ncrna6040041.
8
Machine Learning Classifiers for Endometriosis Using Transcriptomics and Methylomics Data.利用转录组学和甲基组学数据的子宫内膜异位症机器学习分类器
Front Genet. 2019 Sep 4;10:766. doi: 10.3389/fgene.2019.00766. eCollection 2019.
9
Discovery of high-confidence human protein-coding genes and exons by whole-genome PhyloCSF helps elucidate 118 GWAS loci.全基因组 PhyloCSF 发现高可信度的人类蛋白编码基因和外显子,有助于阐明 118 个 GWAS 基因座。
Genome Res. 2019 Dec;29(12):2073-2087. doi: 10.1101/gr.246462.118. Epub 2019 Sep 19.
10
Integrative transcriptomic analysis suggests new autoregulatory splicing events coupled with nonsense-mediated mRNA decay.整合转录组分析表明新的自动调控剪接事件与无义介导的 mRNA 衰变有关。
Nucleic Acids Res. 2019 Jun 4;47(10):5293-5306. doi: 10.1093/nar/gkz193.
通过结构来理解基因组的调控和转录复杂性。
Genome Res. 2013 Jul;23(7):1081-8. doi: 10.1101/gr.156612.113.
4
Transcriptome analysis of human tissues and cell lines reveals one dominant transcript per gene.人类组织和细胞系的转录组分析显示每个基因都有一个主要转录本。
Genome Biol. 2013 Jul 1;14(7):R70. doi: 10.1186/gb-2013-14-7-r70.
5
Gene regulation by the act of long non-coding RNA transcription.长非编码 RNA 转录对基因的调控。
BMC Biol. 2013 May 30;11:59. doi: 10.1186/1741-7007-11-59.
6
Non-polyadenylated transcription in embryonic stem cells reveals novel non-coding RNA related to pluripotency and differentiation.胚胎干细胞中的非多聚腺苷酸化转录揭示了与多能性和分化相关的新型非编码 RNA。
Nucleic Acids Res. 2013 Jul;41(12):6300-15. doi: 10.1093/nar/gkt316. Epub 2013 Apr 29.
7
A perspective on mammalian upstream open reading frame function.哺乳动物上游开放阅读框功能的研究视角。
Int J Biochem Cell Biol. 2013 Aug;45(8):1690-700. doi: 10.1016/j.biocel.2013.04.020. Epub 2013 Apr 25.
8
DNA: Celebrate the unknowns.DNA:颂扬未知之物。
Nature. 2013 Apr 25;496(7446):419-20. doi: 10.1038/496419a.
9
Widespread and extensive lengthening of 3' UTRs in the mammalian brain.哺乳动物大脑中 3'UTR 的广泛和广泛延长。
Genome Res. 2013 May;23(5):812-25. doi: 10.1101/gr.146886.112. Epub 2013 Mar 21.
10
Is junk DNA bunk? A critique of ENCODE.垃圾 DNA 是无稽之谈吗?对 ENCODE 的批判。
Proc Natl Acad Sci U S A. 2013 Apr 2;110(14):5294-300. doi: 10.1073/pnas.1221376110. Epub 2013 Mar 11.