Cardiovascular Proteomics Laboratory, Centro Nacional de Investigaciones Cardiovasculares Carlos III (CNIC), 28029 Madrid, Spain.
Bioinformatics Institute, Spanish National Cancer Research Centre (CNIO), Madrid, 28029, Spain.
Nucleic Acids Res. 2022 Jan 7;50(D1):D54-D59. doi: 10.1093/nar/gkab1058.
APPRIS (https://appris.bioinfo.cnio.es) is a well-established database housing annotations for protein isoforms for a range of species. APPRIS selects principal isoforms based on protein structure and function features and on cross-species conservation. Most coding genes produce a single main protein isoform and the principal isoforms chosen by the APPRIS database best represent this main cellular isoform. Human genetic data, experimental protein evidence and the distribution of clinical variants all support the relevance of APPRIS principal isoforms. APPRIS annotations and principal isoforms have now been expanded to 10 model organisms. In this paper we highlight the most recent updates to the database. APPRIS annotations have been generated for two new species, cow and chicken, the protein structural information has been augmented with reliable models from the EMBL-EBI AlphaFold database, and we have substantially expanded the confirmatory proteomics evidence available for the human genome. The most significant change in APPRIS has been the implementation of TRIFID functional isoform scores. TRIFID functional scores are assigned to all splice isoforms, and APPRIS uses the TRIFID functional scores and proteomics evidence to determine principal isoforms when core methods cannot.
APPRIS(https://appris.bioinfo.cnio.es)是一个成熟的数据库,为多种物种的蛋白质异构体提供注释。APPRIS 根据蛋白质结构和功能特征以及跨物种保守性选择主要异构体。大多数编码基因产生单个主要蛋白质异构体,而 APPRIS 数据库选择的主要异构体最能代表这种主要细胞异构体。人类遗传数据、实验蛋白质证据和临床变异体的分布都支持 APPRIS 主要异构体的相关性。APPRIS 注释和主要异构体现在已扩展到 10 个模式生物。本文重点介绍数据库的最新更新。为两个新物种——牛和鸡生成了 APPRIS 注释,用 EMBL-EBI AlphaFold 数据库的可靠模型扩充了蛋白质结构信息,并大大扩展了人类基因组的确认蛋白质组学证据。APPRIS 的最大变化是实施了 TRIFID 功能异构体分数。TRIFID 功能分数被分配给所有剪接异构体,当核心方法无法确定时,APPRIS 使用 TRIFID 功能分数和蛋白质组学证据来确定主要异构体。