Human and Vertebrate Analysis and Annotation Team, Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK.
Database (Oxford). 2012 Mar 20;2012:bas014. doi: 10.1093/database/bas014. Print 2012.
While alternative splicing (AS) can potentially expand the functional repertoire of vertebrate genomes, relatively few AS transcripts have been experimentally characterized. We describe our detailed manual annotation of vertebrate genomes, which is generating a publicly available geneset rich in AS. In order to achieve this we have adopted a highly sensitive approach to annotating gene models supported by correctly mapped, canonically spliced transcriptional evidence combined with a highly cautious approach to adding unsupported extensions to models and making decisions on their functional potential. We use information about the predicted functional potential and structural properties of every AS transcript annotated at a protein-coding or non-coding locus to place them into one of eleven subclasses. We describe the incorporation of new sequencing and proteomics technologies into our annotation pipelines, which are used to identify and validate AS. Combining all data sources has led to the production of a rich geneset containing an average of 6.3 AS transcripts for every human multi-exon protein-coding gene. The datasets produced have proved very useful in providing context to studies investigating the functional potential of genes and the effect of variation may have on gene structure and function. DATABASE URL: http://www.ensembl.org/index.html, http://vega.sanger.ac.uk/index.html.
虽然选择性剪接 (AS) 有可能扩展脊椎动物基因组的功能范围,但相对较少的 AS 转录本已被实验表征。我们描述了我们对脊椎动物基因组的详细手动注释,这产生了一个富含 AS 的公开可用基因集。为了实现这一目标,我们采用了一种高度敏感的方法来注释基因模型,该方法得到了正确映射的、规范剪接的转录证据的支持,同时还采用了一种非常谨慎的方法来向模型添加不受支持的扩展,并对其功能潜力做出决策。我们使用每个在蛋白质编码或非编码基因座上注释的 AS 转录本的预测功能潜力和结构特性的信息,将其归入十一个亚类之一。我们描述了将新的测序和蛋白质组学技术纳入我们的注释管道中,这些技术用于识别和验证 AS。结合所有数据源,产生了一个丰富的基因集,其中包含每个人类多外显子蛋白质编码基因的平均 6.3 个 AS 转录本。所产生的数据集在为研究基因的功能潜力以及变异可能对基因结构和功能的影响提供背景方面非常有用。数据库 URL:http://www.ensembl.org/index.html,http://vega.sanger.ac.uk/index.html。