Proteomics and metabolomics unit, Basic research department, Children's Cancer Hospital, 57357 Cairo, (CCHE-57357), Egypt.
Proteomics and metabolomics unit, Basic research department, Children's Cancer Hospital, 57357 Cairo, (CCHE-57357), Egypt; Department of Pharmacology, Faculty of Veterinary Medicine, Suez Canal University, 41522 Ismailia, Egypt.
J Proteomics. 2020 Feb 20;213:103613. doi: 10.1016/j.jprot.2019.103613. Epub 2019 Dec 14.
UniprotR is a software package designed to easily retrieve, cluster and visualize protein data from UniProt knowledgebase (UniProtKB) using R language. The package is implemented mainly to process, parse and illustrate proteomics data in a handy and time-saving approach allowing researchers to summarize all required protein information available at UniProtKB in a readable data frame, Excel CSV file, and/or graphical output. UniprotR generates a set of graphics including gene ontology, chromosomal location, protein scoring and status, protein networking, sequence phylogenetic tree, and physicochemical properties. In addition, the package supports clustering of proteins based on primary gene name or chromosomal location, facilitating additional downstream analysis. SIGNIFICANCE: In this work, we implemented a robust package for retrieving and visualizing information from multiple sources such UniProtKB, SWISS-MODEL, and STRING. UniprotR Contains functions that enable retrieving and cluster data in a handy way and visualize data in publishable graphs to facilitate researcher's work and fulfill their needs. UniprotR will aid in saving time for downstream data analysis instead of manual time consuming data analysis. AVAILABILITY AND IMPLEMENTATION: UniprotR released as free open source code under the license of GPLv3, and available in CRAN (The Comprehensive R Archive Network) and GitHub. (https://cran.r-project.org/web/packages/UniprotR/index.html). (https://github.com/Proteomicslab57357/UniprotR).
UniprotR 是一个软件包,旨在使用 R 语言轻松地从 UniProt 知识库(UniProtKB)中检索、聚类和可视化蛋白质数据。该软件包主要用于以一种方便且节省时间的方式处理、解析和说明蛋白质组学数据,使研究人员能够在一个可读的数据框、Excel CSV 文件和/或图形输出中总结 UniProtKB 中提供的所有所需蛋白质信息。UniprotR 生成了一组图形,包括基因本体、染色体位置、蛋白质评分和状态、蛋白质网络、序列系统发育树和理化性质。此外,该软件包还支持基于主要基因名称或染色体位置对蛋白质进行聚类,便于进行额外的下游分析。意义:在这项工作中,我们实现了一个强大的软件包,用于从多个来源(如 UniProtKB、SWISS-MODEL 和 STRING)检索和可视化信息。UniprotR 包含了一些功能,可以方便地检索和聚类数据,并以可发表的图形形式可视化数据,以方便研究人员的工作并满足他们的需求。UniprotR 将有助于节省下游数据分析的时间,而不是手动进行耗时的数据分析。可用性和实现:UniprotR 作为免费的开源代码,在 GPLv3 许可证下发布,可在 CRAN(综合 R 存档网络)和 GitHub 上获得。(https://cran.r-project.org/web/packages/UniprotR/index.html)。(https://github.com/Proteomicslab57357/UniprotR)。