Univ. Lille, CNRS, Centrale Lille, UMR 9189 - CRIStAL - Centre de Recherche en Informatique Signal et Automatique de Lille, F-59000 Lille, France.
Proteome Informatics Group, SIB Swiss Institute of Bioinformatics, CMU, Rue Michel-Servet 1, 1211 Geneva, Switzerland.
Nucleic Acids Res. 2020 Jan 8;48(D1):D465-D469. doi: 10.1093/nar/gkz1000.
Norine, the unique resource dedicated to nonribosomal peptides (NRPs), is now updated with a new pipeline to automate massive sourcing and enhance annotation. External databases are mined to extract NRPs that are not yet in Norine. To maintain a high data quality, successive filters are applied to automatically validate the NRP annotations and only validated data is inserted in the database. External databases were also used to complete annotations of NRPs already in Norine. Besides, annotation consistency inside Norine and between Norine and external sources have reported annotation errors. Some can be corrected automatically, while others need manual curation. This new approach led to the insertion of 539 new NRPs and the addition or correction of annotations of nearly all Norine entries. Two new tools to analyse the chemical structures of NRPs (rBAN) and to infer a molecular formula from the mass-to-charge ratio of an NRP (Kendrick Formula Predictor) were also integrated. Norine is freely accessible from the following URL: https://bioinfo.cristal.univ-lille.fr/norine/.
诺林(Norine)是一个专注于非核糖体肽(NRPs)的独特资源,现在更新了一个新的流水线,用于自动化大规模资源收集和增强注释。挖掘外部数据库以提取尚未在 Norine 中的 NRPs。为了保持高质量的数据,应用了连续的过滤器来自动验证 NRP 注释,只有经过验证的数据才会被插入数据库。外部数据库也用于完成已在 Norine 中的 NRPs 的注释。此外,还报告了在 Norine 内部和 Norine 与外部资源之间的注释一致性问题。有些可以自动纠正,而有些则需要手动整理。这种新方法导致插入了 539 个新的 NRPs,并对几乎所有 Norine 条目的注释进行了添加或修正。还集成了两个新的工具,用于分析 NRPs 的化学结构(rBAN)和从 NRP 的质荷比推断分子式(肯德里克公式预测器)。可以从以下 URL 访问 Norine:https://bioinfo.cristal.univ-lille.fr/norine/。