• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

PPLine:蛋白质基因组学背景下用于单核苷酸多态性、单氨基酸多态性和剪接变体检测的自动化流程

PPLine: An Automated Pipeline for SNP, SAP, and Splice Variant Detection in the Context of Proteogenomics.

作者信息

Krasnov George Sergeevich, Dmitriev Alexey Alexandrovich, Kudryavtseva Anna Viktorovna, Shargunov Alexander Valerievich, Karpov Dmitry Sergeevich, Uroshlev Leonid Andreevich, Melnikova Natalya Vladimirovna, Blinov Vladimir Mikhailovich, Poverennaya Ekaterina Vladimirovna, Archakov Alexander Ivanovich, Lisitsa Andrey Valerievich, Ponomarenko Elena Alexandrovna

机构信息

Engelhardt Institute of Molecular Biology, Russian Academy of Sciences , Moscow, 111991 Russia.

Orekhovich Institute of Biomedical Chemistry, Russian Academy of Medical Sciences , Moscow, 119121 Russia.

出版信息

J Proteome Res. 2015 Sep 4;14(9):3729-37. doi: 10.1021/acs.jproteome.5b00490. Epub 2015 Aug 3.

DOI:10.1021/acs.jproteome.5b00490
PMID:26147802
Abstract

The fundamental mission of the Chromosome-Centric Human Proteome Project (C-HPP) is the research of human proteome diversity, including rare variants. Liver tissues, HepG2 cells, and plasma were selected as one of the major objects for C-HPP studies. The proteogenomic approach, a recently introduced technique, is a powerful method for predicting and validating proteoforms coming from alternative splicing, mutations, and transcript editing. We developed PPLine, a Python-based proteogenomic pipeline providing automated single-amino-acid polymorphism (SAP), indel, and alternative-spliced-variants discovery based on raw transcriptome and exome sequence data, single-nucleotide polymorphism (SNP) annotation and filtration, and the prediction of proteotypic peptides (available at https://sourceforge.net/projects/ppline). In this work, we performed deep transcriptome sequencing of HepG2 cells and liver tissues using two platforms: Illumina HiSeq and Applied Biosystems SOLiD. Using PPLine, we revealed 7756 SAP and indels for HepG2 cells and liver (including 659 variants nonannotated in dbSNP). We found 17 indels in transcripts associated with the translation of alternate reading frames (ARF) longer than 300 bp. The ARF products of two genes, SLMO1 and TMEM8A, demonstrate signatures of caspase-binding domain and Gcn5-related N-acetyltransferase. Alternative splicing analysis predicted novel proteoforms encoded by 203 (liver) and 475 (HepG2) genes according to both Illumina and SOLiD data. The results of the present work represent a basis for subsequent proteomic studies by the C-HPP consortium.

摘要

以染色体为中心的人类蛋白质组计划(C-HPP)的基本任务是研究人类蛋白质组多样性,包括罕见变异体。肝脏组织、HepG2细胞和血浆被选为C-HPP研究的主要对象之一。蛋白质基因组学方法是一种最近引入的技术,是预测和验证来自可变剪接、突变和转录本编辑的蛋白质异构体的有力方法。我们开发了PPLine,这是一个基于Python的蛋白质基因组学流程,可基于原始转录组和外显子组序列数据自动发现单氨基酸多态性(SAP)、插入缺失以及可变剪接变体,进行单核苷酸多态性(SNP)注释和过滤,并预测蛋白质型肽段(可从https://sourceforge.net/projects/ppline获取)。在这项工作中,我们使用Illumina HiSeq和Applied Biosystems SOLiD两个平台对HepG2细胞和肝脏组织进行了深度转录组测序。使用PPLine,我们在HepG2细胞和肝脏中发现了7756个SAP和插入缺失(包括659个在dbSNP中未注释的变异体)。我们在与长度超过300 bp的交替阅读框(ARF)翻译相关的转录本中发现了17个插入缺失。两个基因SLMO1和TMEM8A的ARF产物显示出胱天蛋白酶结合域和Gcn5相关N-乙酰转移酶的特征。根据Illumina和SOLiD数据,可变剪接分析预测了由203个(肝脏)和475个(HepG2)基因编码的新型蛋白质异构体。本研究结果为C-HPP联盟后续的蛋白质组学研究奠定了基础。

相似文献

1
PPLine: An Automated Pipeline for SNP, SAP, and Splice Variant Detection in the Context of Proteogenomics.PPLine:蛋白质基因组学背景下用于单核苷酸多态性、单氨基酸多态性和剪接变体检测的自动化流程
J Proteome Res. 2015 Sep 4;14(9):3729-37. doi: 10.1021/acs.jproteome.5b00490. Epub 2015 Aug 3.
2
Chromosome-Based Proteomic Study for Identifying Novel Protein Variants from Human Hippocampal Tissue Using Customized neXtProt and GENCODE Databases.基于染色体的蛋白质组学研究,利用定制的neXtProt和GENCODE数据库从人类海马组织中鉴定新型蛋白质变体。
J Proteome Res. 2015 Dec 4;14(12):5028-37. doi: 10.1021/acs.jproteome.5b00472. Epub 2015 Nov 16.
3
Tissue-specific alternative splicing analysis reveals the diversity of chromosome 18 transcriptome.组织特异性可变剪接分析揭示了18号染色体转录组的多样性。
J Proteome Res. 2014 Jan 3;13(1):173-82. doi: 10.1021/pr400808u. Epub 2013 Dec 9.
4
Integrated Proteomic Pipeline Using Multiple Search Engines for a Proteogenomic Study with a Controlled Protein False Discovery Rate.使用多种搜索引擎的集成蛋白质组学流程用于蛋白质基因组学研究并控制蛋白质错误发现率
J Proteome Res. 2016 Nov 4;15(11):4082-4090. doi: 10.1021/acs.jproteome.6b00376. Epub 2016 Aug 30.
5
Identification of Differentially Expressed Splice Variants by the Proteogenomic Pipeline Splicify.通过 Proteogenomic 管道 Splicify 鉴定差异表达的剪接变体。
Mol Cell Proteomics. 2017 Oct;16(10):1850-1863. doi: 10.1074/mcp.TIR117.000056. Epub 2017 Jul 26.
6
Proteogenomic Study beyond Chromosome 9: New Insight into Expressed Variant Proteome and Transcriptome in Human Lung Adenocarcinoma Tissues.9号染色体以外的蛋白质基因组学研究:对人肺腺癌组织中表达的变异蛋白质组和转录组的新见解。
J Proteome Res. 2015 Dec 4;14(12):5007-16. doi: 10.1021/acs.jproteome.5b00544. Epub 2015 Nov 19.
7
Next Generation Proteomic Pipeline for Chromosome-Based Proteomic Research Using NeXtProt and GENCODE Databases.基于 NeXtProt 和 GENCODE 数据库的染色体蛋白质组学研究下一代蛋白质组学管道。
J Proteome Res. 2017 Dec 1;16(12):4425-4434. doi: 10.1021/acs.jproteome.7b00223. Epub 2017 Oct 13.
8
Bridging the Chromosome-centric and Biology/Disease-driven Human Proteome Projects: Accessible and Automated Tools for Interpreting the Biological and Pathological Impact of Protein Sequence Variants Detected via Proteogenomics.桥接以染色体为中心和以生物学/疾病为驱动的人类蛋白质组计划:用于解释通过蛋白质基因组学检测到的蛋白质序列变异的生物学和病理学影响的可及和自动化工具。
J Proteome Res. 2018 Dec 7;17(12):4329-4336. doi: 10.1021/acs.jproteome.8b00404. Epub 2018 Sep 5.
9
PGTools: A Software Suite for Proteogenomic Data Analysis and Visualization.PGTools:用于蛋白质基因组数据分析与可视化的软件套件。
J Proteome Res. 2015 May 1;14(5):2255-66. doi: 10.1021/acs.jproteome.5b00029. Epub 2015 Apr 17.
10
Protannotator: a semiautomated pipeline for chromosome-wise functional annotation of the "missing" human proteome.Protannotator:一种用于对“缺失”的人类蛋白质组进行染色体水平功能注释的半自动流程。
J Proteome Res. 2014 Jan 3;13(1):76-83. doi: 10.1021/pr400794x. Epub 2013 Dec 13.

引用本文的文献

1
The Biological Consequences of the Knockout of Genes Involved in the Synthesis and Metabolism of HS in .HS合成与代谢相关基因敲除的生物学后果
Antioxidants (Basel). 2025 Jun 6;14(6):693. doi: 10.3390/antiox14060693.
2
Similar Normalizing Effect of HSP70 and YB-1 Stress Proteins on the Brain Transcription of a Mouse Model of Alzheimer's Disease.热休克蛋白70(HSP70)和YB-1应激蛋白对阿尔茨海默病小鼠模型大脑转录的类似归一化作用。
Mol Neurobiol. 2025 Jun 24. doi: 10.1007/s12035-025-05135-6.
3
Transcriptome map and genome annotation of flax line 3896.
亚麻品系3896的转录组图谱与基因组注释
Front Plant Sci. 2025 May 16;16:1520832. doi: 10.3389/fpls.2025.1520832. eCollection 2025.
4
Genetic diversity of varieties with different fruit characteristics based on whole-genome sequencing.基于全基因组测序的不同果实特征品种的遗传多样性
Front Plant Sci. 2025 Mar 4;16:1542552. doi: 10.3389/fpls.2025.1542552. eCollection 2025.
5
Identification and Analysis of , , , and Gene Families in .中对、、和基因家族的鉴定与分析。
Plants (Basel). 2024 Dec 13;13(24):3486. doi: 10.3390/plants13243486.
6
Nanopore Data-Driven Chromosome-Level Assembly of Flax Genome.基于纳米孔数据驱动的亚麻基因组染色体水平组装
Plants (Basel). 2024 Dec 11;13(24):3465. doi: 10.3390/plants13243465.
7
Proteogenomic Approaches for Diseasome Studies.基于蛋白质基因组学的疾病基因组研究方法
Methods Mol Biol. 2025;2859:253-264. doi: 10.1007/978-1-0716-4152-1_14.
8
Bioinformatics in Russia: history and present-day landscape.俄罗斯的生物信息学:历史与现状
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae513.
9
Complete Annotated Genome Assembly of Flax Pathogen .亚麻病原体的完整注释基因组组装
J Fungi (Basel). 2024 Aug 26;10(9):605. doi: 10.3390/jof10090605.
10
Transforming Clinical Research: The Power of High-Throughput Omics Integration.变革临床研究:高通量组学整合的力量
Proteomes. 2024 Sep 6;12(3):25. doi: 10.3390/proteomes12030025.