Computational Systems Biology Group, Systems Biology Department, National Center for Biotechnology (CNB-CSIC), c/ Darwin, 3, Madrid 28049, Spain.
Computational Systems Biology Group, Systems Biology Department, National Center for Biotechnology (CNB-CSIC), c/ Darwin, 3, Madrid 28049, Spain.
J Mol Biol. 2022 Jun 15;434(11):167568. doi: 10.1016/j.jmb.2022.167568. Epub 2022 Mar 30.
The mining of the massive amounts of biomedical information is hindered by the still scarce representation of these data using formal vocabularies and ontologies, which is necessary for cross-linking conceptual entities between different resources and, in general, representing the information in a computer-tractable way. Basic things such as retrieving a comprehensive list of associations between complex diseases and their reported symptoms or underlying biological processes, given in terms of formal identifiers, are not trivial and, in many cases, these have to be generated by manual curation or inferred/predicted from indirect evidences. In this work, using a text-mining approach based on detecting significant co-mentions in the scientific literature, we generated a resource with millions of relationships between thousands of terms representing diseases, symptoms, biological processes, molecular functions and cellular compartments, all given in terms of formal identifiers of these terms in the main resources dealing with them. We show some examples that highlight the differences between these relationships and those that are available in other resources. These relationships can be queried and inspected in an interactive web interface freely available at: https://sysbiol.cnb.csic.es/CoMent.
大量生物医学信息的挖掘受到这些数据使用形式词汇和本体表示仍然稀缺的阻碍,这对于在不同资源之间链接概念实体以及通常以计算机可处理的方式表示信息是必要的。基本的事情,例如检索给定正式标识符的复杂疾病与其报告的症状或潜在生物过程之间的综合关联列表,并不是微不足道的,在许多情况下,这些必须通过手动策展生成或从间接证据推断/预测。在这项工作中,我们使用了一种基于在科学文献中检测显著共提及的文本挖掘方法,生成了一个包含数百万个关系的资源,这些关系涉及数千个术语,代表疾病、症状、生物过程、分子功能和细胞区室,所有这些术语都以主要资源中处理这些术语的正式标识符表示。我们展示了一些示例,突出了这些关系与其他资源中可用的关系之间的差异。这些关系可以在免费提供的交互式网络界面中查询和检查:https://sysbiol.cnb.csic.es/CoMent。