• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

drawProteins:一个用于可重复和程序化生成蛋白质示意图的Bioconductor/R软件包。

drawProteins: a Bioconductor/R package for reproducible and programmatic generation of protein schematics.

作者信息

Brennan Paul

机构信息

Centre for Medical Education, School of Medicine, Cardiff University, Cardiff, Wales, UK.

出版信息

F1000Res. 2018 Jul 18;7:1105. doi: 10.12688/f1000research.14541.1. eCollection 2018.

DOI:10.12688/f1000research.14541.1
PMID:30210791
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6107989/
Abstract

Protein schematics are valuable for research, teaching and knowledge communication. However, the tools used to automate the process are challenging. The purpose of the drawProteins package is to enable the generation of schematics of proteins in an automated fashion that can integrate with the Bioconductor/R suite of tools for bioinformatics and statistical analysis. Using UniProt accession numbers, the package uses the UniProt API to get the features of the protein from the UniProt database. The features are assembled into a data frame and visualized using adaptations of the ggplot2 package. Visualizations can be customised in many ways including adding additional protein features information from other data frames, altering colors and protein names and adding extra layers using other ggplot2 functions. This can be completed within a script that makes the workflow reproducible and sharable.

摘要

蛋白质示意图对于研究、教学和知识交流具有重要价值。然而,用于自动化这一过程的工具颇具挑战性。drawProteins软件包的目的是能够以自动化方式生成蛋白质示意图,使其可以与用于生物信息学和统计分析的Bioconductor/R工具套件集成。该软件包使用UniProt登录号,通过UniProt API从UniProt数据库获取蛋白质的特征。这些特征被组装成一个数据框,并使用ggplot2软件包的改编版本进行可视化。可视化可以通过多种方式进行定制,包括从其他数据框添加额外的蛋白质特征信息、改变颜色和蛋白质名称以及使用其他ggplot2函数添加额外的图层。这可以在一个脚本中完成,使工作流程具有可重复性和可共享性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/db784b32672d/f1000research-7-15828-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/0d0f510088d8/f1000research-7-15828-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/b10ce1186eec/f1000research-7-15828-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/277aaf509089/f1000research-7-15828-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/be13e7680521/f1000research-7-15828-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/4883dd4813c3/f1000research-7-15828-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/db784b32672d/f1000research-7-15828-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/0d0f510088d8/f1000research-7-15828-g0000.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/b10ce1186eec/f1000research-7-15828-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/277aaf509089/f1000research-7-15828-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/be13e7680521/f1000research-7-15828-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/4883dd4813c3/f1000research-7-15828-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bc2f/6107989/db784b32672d/f1000research-7-15828-g0005.jpg

相似文献

1
drawProteins: a Bioconductor/R package for reproducible and programmatic generation of protein schematics.drawProteins:一个用于可重复和程序化生成蛋白质示意图的Bioconductor/R软件包。
F1000Res. 2018 Jul 18;7:1105. doi: 10.12688/f1000research.14541.1. eCollection 2018.
2
Triplex: an R/Bioconductor package for identification and visualization of potential intramolecular triplex patterns in DNA sequences.三重体:一个用于鉴定和可视化 DNA 序列中潜在的分子内三重体模式的 R/Bioconductor 包。
Bioinformatics. 2013 Aug 1;29(15):1900-1. doi: 10.1093/bioinformatics/btt299. Epub 2013 May 24.
3
ggCyto: next generation open-source visualization software for cytometry.ggCyto:下一代开源流式细胞术可视化软件。
Bioinformatics. 2018 Nov 15;34(22):3951-3953. doi: 10.1093/bioinformatics/bty441.
4
rTANDEM, an R/Bioconductor package for MS/MS protein identification.rTANDEM,一个用于 MS/MS 蛋白质鉴定的 R/Bioconductor 包。
Bioinformatics. 2014 Aug 1;30(15):2233-4. doi: 10.1093/bioinformatics/btu178. Epub 2014 Apr 3.
5
mAPKL: R/ Bioconductor package for detecting gene exemplars and revealing their characteristics.mAPKL:用于检测基因范例并揭示其特征的R/Bioconductor软件包。
BMC Bioinformatics. 2015 Sep 15;16(1):291. doi: 10.1186/s12859-015-0719-5.
6
UniProtJAPI: a remote API for accessing UniProt data.UniProtJAPI:用于访问UniProt数据的远程应用程序编程接口。
Bioinformatics. 2008 May 15;24(10):1321-2. doi: 10.1093/bioinformatics/btn122. Epub 2008 Apr 4.
7
miRNAmeConverter: an R/bioconductor package for translating mature miRNA names to different miRBase versions.miRNAmeConverter:一个用于将成熟 miRNA 名称转换为不同 miRBase 版本的 R/bioconductor 包。
Bioinformatics. 2017 Feb 15;33(4):592-593. doi: 10.1093/bioinformatics/btw660.
8
rBiopaxParser--an R package to parse, modify and visualize BioPAX data.rBiopaxParser——一个解析、修改和可视化 BioPAX 数据的 R 包。
Bioinformatics. 2013 Feb 15;29(4):520-2. doi: 10.1093/bioinformatics/bts710. Epub 2012 Dec 28.
9
escheR: unified multi-dimensional visualizations with Gestalt principles.埃舍尔:运用格式塔原理实现统一的多维可视化。
Bioinform Adv. 2023 Dec 6;3(1):vbad179. doi: 10.1093/bioadv/vbad179. eCollection 2023.
10
scruff: an R/Bioconductor package for preprocessing single-cell RNA-sequencing data.scruff:一个用于预处理单细胞 RNA-seq 数据的 R/Bioconductor 包。
BMC Bioinformatics. 2019 May 2;20(1):222. doi: 10.1186/s12859-019-2797-2.

引用本文的文献

1
Whole Genome Sequencing of "Mutation-Negative" Individuals With Cornelia de Lange Syndrome.患有科妮莉亚·德·朗格综合征的“无突变”个体的全基因组测序
Hum Mutat. 2025 Jan 30;2025:4711663. doi: 10.1155/humu/4711663. eCollection 2025.
2
Detecting and quantifying clonal selection in somatic stem cells.检测和量化体细胞干细胞中的克隆选择
Nat Genet. 2025 Jul 3. doi: 10.1038/s41588-025-02217-y.
3
Divergent evolution of low-complexity regions in the vertebrate CPEB protein family.脊椎动物CPEB蛋白家族中低复杂性区域的趋异进化。

本文引用的文献

1
Good enough practices in scientific computing.科学计算中的良好实践。
PLoS Comput Biol. 2017 Jun 22;13(6):e1005510. doi: 10.1371/journal.pcbi.1005510. eCollection 2017 Jun.
2
The Proteins API: accessing key integrated protein and genome information.蛋白质 API:访问关键的综合蛋白质和基因组信息。
Nucleic Acids Res. 2017 Jul 3;45(W1):W539-W544. doi: 10.1093/nar/gkx237.
3
UniProt: the universal protein knowledgebase.通用蛋白质知识库:UniProt
Front Bioinform. 2025 Mar 20;5:1491735. doi: 10.3389/fbinf.2025.1491735. eCollection 2025.
4
Differential GTP-dependent in-vitro polymerization of recombinant Physcomitrella FtsZ proteins.重组小立碗藓FtsZ蛋白的差异GTP依赖性体外聚合反应。
Sci Rep. 2025 Jan 24;15(1):3095. doi: 10.1038/s41598-024-85077-6.
5
LD-transpeptidase-mediated cell envelope remodeling enables developmental transitions and survival in and .LD-转肽酶介导的细胞包膜重塑促进了[具体生物体1]和[具体生物体2]中的发育转变与存活。
J Bacteriol. 2025 Feb 20;207(2):e0024724. doi: 10.1128/jb.00247-24. Epub 2025 Jan 23.
6
Dynamic co-evolution of transposable elements and the piRNA pathway in African cichlid fishes.非洲丽鱼科鱼类中转座元件与piRNA途径的动态协同进化
Genome Biol. 2025 Jan 22;26(1):14. doi: 10.1186/s13059-025-03475-z.
7
Diversity, abundance, and domain architecture of plant NLR proteins in .植物NLR蛋白在……中的多样性、丰度和结构域架构
Heliyon. 2024 Jul 12;10(14):e34475. doi: 10.1016/j.heliyon.2024.e34475. eCollection 2024 Jul 30.
8
Breathing new insights into the role of mutant p53 in lung cancer.对突变型p53在肺癌中的作用有了新的见解。
Oncogene. 2025 Feb;44(3):115-129. doi: 10.1038/s41388-024-03219-6. Epub 2024 Nov 20.
9
Identification of clade-defining single nucleotide polymorphisms for improved rabies virus surveillance.鉴定用于改进狂犬病病毒监测的分支定义单核苷酸多态性
New Microbes New Infect. 2024 Oct 22;62:101511. doi: 10.1016/j.nmni.2024.101511. eCollection 2024 Dec.
10
Evaluating AlphaFold for Clinical Pharmacology and Pharmacogenetics: A Case-Study of Huntingtin Variants Linked to Huntington's Disease.评估 AlphaFold 在临床药理学和药物遗传学中的应用:以与亨廷顿病相关的亨廷顿变异体为例。
AAPS J. 2024 Sep 17;26(6):106. doi: 10.1208/s12248-024-00969-9.
Nucleic Acids Res. 2017 Jan 4;45(D1):D158-D169. doi: 10.1093/nar/gkw1099. Epub 2016 Nov 29.
4
GenVisR: Genomic Visualizations in R.GenVisR:R语言中的基因组可视化
Bioinformatics. 2016 Oct 1;32(19):3012-4. doi: 10.1093/bioinformatics/btw325. Epub 2016 Jun 10.
5
IBS: an illustrator for the presentation and visualization of biological sequences.IBS:一种用于生物序列展示与可视化的工具。
Bioinformatics. 2015 Oct 15;31(20):3359-61. doi: 10.1093/bioinformatics/btv362. Epub 2015 Jun 10.
6
The BioJS article collection of open source components for biological data visualisation.用于生物数据可视化的开源组件的BioJS文章集。
F1000Res. 2014 Feb 13;3:56. doi: 10.12688/f1000research.3-56.v1. eCollection 2014.
7
Proteomics-based strategies to identify proteins relevant to chronic lymphocytic leukemia.基于蛋白质组学的策略来鉴定与慢性淋巴细胞白血病相关的蛋白质。
J Proteome Res. 2014 Nov 7;13(11):5051-62. doi: 10.1021/pr5002803. Epub 2014 Jul 14.
8
The nuclear factor NF-kappaB pathway in inflammation.炎症中的核因子 NF-κB 通路。
Cold Spring Harb Perspect Biol. 2009 Dec;1(6):a001651. doi: 10.1101/cshperspect.a001651. Epub 2009 Oct 7.
9
Mapping identifiers for the integration of genomic datasets with the R/Bioconductor package biomaRt.使用R/Bioconductor软件包biomaRt整合基因组数据集的映射标识符。
Nat Protoc. 2009;4(8):1184-91. doi: 10.1038/nprot.2009.97. Epub 2009 Jul 23.
10
DOG 1.0: illustrator of protein domain structures.DOG 1.0:蛋白质结构域结构的展示工具
Cell Res. 2009 Feb;19(2):271-3. doi: 10.1038/cr.2009.6.