Vecna Technologies, Inc., Greenbelt, MD 20770.
Northrop Grumman, Rockville, MD 20850, USA.
Bioinformatics. 2020 Mar 1;36(5):1627-1628. doi: 10.1093/bioinformatics/btz777.
Sequence repositories have few well-annotated virus mature peptide sequences. Therefore post-translational proteolytic processing of polyproteins into mature peptides (MPs) has been performed in silico, with a new computational method, for over 200 species in 5 pathogenic virus families (Caliciviridae, Coronaviridae, Flaviviridae, Picornaviridae and Togaviridae).
Using pairwise alignment with reference sequences, MPs have been annotated and their sequences made available for search, analysis and download. At publication the method had produced 156 216 sequences, a large portion of the protein sequences now available in https://www.viprbrc.org. It represents a new and comprehensive mature peptide collection.
The data are available at the Virus Pathogen Resource https://www.viprbrc.org, and the software at https://github.com/VirusBRC/vipr_mat_peptide.
序列存储库中很少有经过良好注释的病毒成熟肽序列。因此,针对超过 200 种 5 种致病病毒科(杯状病毒科、冠状病毒科、黄病毒科、小核糖核酸病毒科和披膜病毒科)的多蛋白,已通过一种新的计算方法进行了基于计算机的翻译后蛋白水解加工,以获得成熟肽(MPs)。
使用与参考序列的两两比对,已对 MPs 进行了注释,并提供了它们的序列,以供搜索、分析和下载。在发表时,该方法已产生了 156216 个序列,其中大部分蛋白质序列现在可在 https://www.viprbrc.org 上获得。它代表了一个新的、全面的成熟肽集合。
数据可在病毒病原体资源 https://www.viprbrc.org 上获得,软件可在 https://github.com/VirusBRC/vipr_mat_peptide 上获得。