Genomics program, College of Public Health, University of South Florida, Tampa, FL, USA.
Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA.
Methods Mol Biol. 2025;2867:169-184. doi: 10.1007/978-1-0716-4196-5_10.
DescribePROT is a freely available online database of structural and functional descriptors of proteins at the amino acid level. It provides access to 13 diverse descriptors that include sequence conservation, putative secondary structure, solvent accessibility, intrinsic disorder, and signal peptides, and putative annotations of residues that interact with proteins, peptides and nucleic acids. These data can be used to elucidate protein functions, to support efforts to develop therapeutics, and to develop and evaluate future predictors of protein structure and function. DescribePROT includes 7.8 billion predictions for 1.4 million proteins from 83 complete proteomes of popular model organisms. This information can be downloaded at multiple levels of scope (entire database, specific organisms, and individual proteins) and can be interacted with using a graphical interface that simultaneously displays data on multiple descriptors. We describe the contents of this resource, provide directions on how to use its interface, and offer instructions on how to obtain and interact with the underlying data. Moreover, we briefly discuss plans for a future expansion of this database. DescribePROT is available at http://biomine.cs.vcu.edu/servers/DESCRIBEPROT/ .
DescribePROT 是一个免费的在线数据库,提供蛋白质在氨基酸水平的结构和功能描述符。它提供了 13 种不同的描述符,包括序列保守性、推测的二级结构、溶剂可及性、内在无序性和信号肽,以及推测与蛋白质、肽和核酸相互作用的残基的注释。这些数据可用于阐明蛋白质的功能,支持开发治疗方法的努力,并开发和评估未来蛋白质结构和功能的预测器。DescribePROT 包含 78 亿个预测,针对 83 个流行的模式生物的完整蛋白质组中的 140 万个蛋白质。此信息可在多个范围级别(整个数据库、特定生物体和单个蛋白质)下载,并可使用图形界面进行交互,同时显示多个描述符的数据。我们描述了此资源的内容,提供了使用其界面的说明,并提供了有关如何获取和交互基础数据的说明。此外,我们简要讨论了未来扩展此数据库的计划。DescribePROT 可在 http://biomine.cs.vcu.edu/servers/DESCRIBEPROT/ 获得。