Suppr超能文献

NGPINT:下一代蛋白质-蛋白质相互作用软件。

NGPINT: a next-generation protein-protein interaction software.

机构信息

Program in Bioinformatics & Computational Biology, Iowa State University, Ames, IA, 50011, USA.

Department of Statistics, Iowa State University, Ames, IA, 50011, USA.

出版信息

Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa351.

Abstract

Mapping protein-protein interactions at a proteome scale is critical to understanding how cellular signaling networks respond to stimuli. Since eukaryotic genomes encode thousands of proteins, testing their interactions one-by-one is a challenging prospect. High-throughput yeast-two hybrid (Y2H) assays that employ next-generation sequencing to interrogate complementary DNA (cDNA) libraries represent an alternative approach that optimizes scale, cost and effort. We present NGPINT, a robust and scalable software to identify all putative interactors of a protein using Y2H in batch culture. NGPINT combines diverse tools to align sequence reads to target genomes, reconstruct prey fragments and compute gene enrichment under reporter selection. Central to this pipeline is the identification of fusion reads containing sequences derived from both the Y2H expression plasmid and the cDNA of interest. To reduce false positives, these fusion reads are evaluated as to whether the cDNA fragment forms an in-frame translational fusion with the Y2H transcription factor. NGPINT successfully recognized 95% of interactions in simulated test runs. As proof of concept, NGPINT was tested using published data sets and it recognized all validated interactions. NGPINT can process interaction data from any biosystem with an available genome or transcriptome reference, thus facilitating the discovery of protein-protein interactions in model and non-model organisms.

摘要

在蛋白质组学水平上绘制蛋白质-蛋白质相互作用图谱对于理解细胞信号网络如何对刺激做出反应至关重要。由于真核基因组编码了数千种蛋白质,逐个测试它们的相互作用是一项具有挑战性的任务。高通量酵母双杂交(Y2H)测定法采用下一代测序技术来检测 cDNA 文库,代表了一种优化规模、成本和效率的替代方法。我们提出了 NGPINT,这是一种使用 Y2H 在批量培养中鉴定蛋白质所有假定相互作用者的强大且可扩展的软件。NGPINT 结合了多种工具,将序列读取与目标基因组对齐,重建猎物片段,并根据报告基因选择计算基因富集。该流水线的核心是识别包含来自 Y2H 表达质粒和感兴趣的 cDNA 的序列的融合读取。为了减少假阳性,这些融合读取会根据 cDNA 片段是否与 Y2H 转录因子形成框内翻译融合来进行评估。NGPINT 在模拟测试运行中成功识别了 95%的相互作用。作为概念验证,我们使用已发表的数据集对 NGPINT 进行了测试,它识别了所有经过验证的相互作用。NGPINT 可以处理任何具有可用基因组或转录组参考的生物系统的相互作用数据,从而促进模型和非模型生物中蛋白质-蛋白质相互作用的发现。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验