Suppr超能文献

基于本体的知识图谱,用于表示涉及 RNA 分子的相互作用。

An ontology-based knowledge graph for representing interactions involving RNA molecules.

机构信息

AnacletoLab, Computer Science Department, University of Milan, Milan, 20133, Italy.

Department of Biomedical Informatics, Columbia University Irving Medical Center, New York, NY, 10032, USA.

出版信息

Sci Data. 2024 Aug 22;11(1):906. doi: 10.1038/s41597-024-03673-7.

Abstract

The "RNA world" represents a novel frontier for the study of fundamental biological processes and human diseases and is paving the way for the development of new drugs tailored to each patient's biomolecular characteristics. Although scientific data about coding and non-coding RNA molecules are constantly produced and available from public repositories, they are scattered across different databases and a centralized, uniform, and semantically consistent representation of the "RNA world" is still lacking. We propose RNA-KG, a knowledge graph (KG) encompassing biological knowledge about RNAs gathered from more than 60 public databases, integrating functional relationships with genes, proteins, and chemicals and ontologically grounded biomedical concepts. To develop RNA-KG, we first identified, pre-processed, and characterized each data source; next, we built a meta-graph that provides an ontological description of the KG by representing all the bio-molecular entities and medical concepts of interest in this domain, as well as the types of interactions connecting them. Finally, we leveraged an instance-based semantically abstracted knowledge model to specify the ontological alignment according to which RNA-KG was generated. RNA-KG can be downloaded in different formats and also queried by a SPARQL endpoint. A thorough topological analysis of the resulting heterogeneous graph provides further insights into the characteristics of the "RNA world". RNA-KG can be both directly explored and visualized, and/or analyzed by applying computational methods to infer bio-medical knowledge from its heterogeneous nodes and edges. The resource can be easily updated with new experimental data, and specific views of the overall KG can be extracted according to the bio-medical problem to be studied.

摘要

“RNA 世界”代表了研究基本生物过程和人类疾病的一个新前沿,为开发针对每个患者生物分子特征的定制药物铺平了道路。尽管关于编码和非编码 RNA 分子的科学数据不断从公共存储库中产生并可用,但它们分散在不同的数据库中,并且“RNA 世界”的集中式、统一且语义一致的表示形式仍然缺乏。我们提出了 RNA-KG,这是一个知识图 (KG),它包含了从 60 多个公共数据库中收集的关于 RNA 的生物学知识,整合了与基因、蛋白质和化学物质的功能关系以及基于本体论的生物医学概念。为了开发 RNA-KG,我们首先确定、预处理和表征了每个数据源;接下来,我们构建了一个元图,通过表示该领域中所有感兴趣的生物分子实体和医学概念以及连接它们的交互类型,为 KG 提供了本体描述。最后,我们利用基于实例的语义抽象知识模型来指定本体对齐,根据该对齐生成了 RNA-KG。RNA-KG 可以以不同的格式下载,也可以通过 SPARQL 端点查询。对生成的异构图进行彻底的拓扑分析,进一步深入了解“RNA 世界”的特征。RNA-KG 可以直接探索和可视化,也可以通过应用计算方法从其异构节点和边推断生物医学知识进行分析。该资源可以轻松地用新的实验数据更新,并且可以根据要研究的生物医学问题提取整个 KG 的特定视图。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1deb/11341713/7c57d26d95f3/41597_2024_3673_Fig1_HTML.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验