• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

枚举有向无环图的一致子图:生物医学本体论的一个见解。

Enumerating consistent sub-graphs of directed acyclic graphs: an insight into biomedical ontologies.

机构信息

Department of Computer Science, Indiana University, Bloomington, USA.

出版信息

Bioinformatics. 2018 Jul 1;34(13):i313-i322. doi: 10.1093/bioinformatics/bty268.

DOI:10.1093/bioinformatics/bty268
PMID:29949985
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6022688/
Abstract

MOTIVATION

Modern problems of concept annotation associate an object of interest (gene, individual, text document) with a set of interrelated textual descriptors (functions, diseases, topics), often organized in concept hierarchies or ontologies. Most ontology can be seen as directed acyclic graphs (DAGs), where nodes represent concepts and edges represent relational ties between these concepts. Given an ontology graph, each object can only be annotated by a consistent sub-graph; that is, a sub-graph such that if an object is annotated by a particular concept, it must also be annotated by all other concepts that generalize it. Ontologies therefore provide a compact representation of a large space of possible consistent sub-graphs; however, until now we have not been aware of a practical algorithm that can enumerate such annotation spaces for a given ontology.

RESULTS

We propose an algorithm for enumerating consistent sub-graphs of DAGs. The algorithm recursively partitions the graph into strictly smaller graphs until the resulting graph becomes a rooted tree (forest), for which a linear-time solution is computed. It then combines the tallies from graphs created in the recursion to obtain the final count. We prove the correctness of this algorithm, propose several practical accelerations, evaluate it on random graphs and then apply it to characterize four major biomedical ontologies. We believe this work provides valuable insights into the complexity of concept annotation spaces and its potential influence on the predictability of ontological annotation.

AVAILABILITY AND IMPLEMENTATION

https://github.com/shawn-peng/counting-consistent-sub-DAG.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

现代的概念注释问题将感兴趣的对象(基因、个体、文本文档)与一组相关的文本描述符(功能、疾病、主题)联系起来,这些描述符通常组织在概念层次结构或本体中。大多数本体可以看作是有向无环图(DAG),其中节点表示概念,边表示这些概念之间的关系纽带。给定一个本体图,每个对象只能通过一个一致的子图进行注释;也就是说,一个子图,使得如果一个对象被一个特定的概念注释,它也必须被所有其他概括它的概念注释。本体因此提供了一个紧凑的表示,用于表示一个可能的一致子图的大空间;然而,直到现在,我们还没有意识到一个实用的算法可以为给定的本体枚举这样的注释空间。

结果

我们提出了一种用于枚举 DAG 一致子图的算法。该算法递归地将图划分为严格较小的图,直到得到的图成为一个有根树(森林),对于这种情况,计算出一个线性时间的解决方案。然后,它将递归中创建的图形的计数合并起来,以获得最终的计数。我们证明了该算法的正确性,提出了几种实用的加速方法,在随机图上进行了评估,然后将其应用于描述四个主要的生物医学本体。我们相信这项工作为概念注释空间的复杂性及其对本体注释的可预测性的潜在影响提供了有价值的见解。

可用性和实现

https://github.com/shawn-peng/counting-consistent-sub-DAG。

补充信息

补充数据可在 Bioinformatics 在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/b132b68a54e3/bty268f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/305f5813d85c/bty268f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/623f489642f1/bty268f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/069325422cbf/bty268f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/ea944cccf102/bty268f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/d27de3036358/bty268f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/b132b68a54e3/bty268f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/305f5813d85c/bty268f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/623f489642f1/bty268f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/069325422cbf/bty268f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/ea944cccf102/bty268f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/d27de3036358/bty268f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8d1a/6022688/b132b68a54e3/bty268f6.jpg

相似文献

1
Enumerating consistent sub-graphs of directed acyclic graphs: an insight into biomedical ontologies.枚举有向无环图的一致子图:生物医学本体论的一个见解。
Bioinformatics. 2018 Jul 1;34(13):i313-i322. doi: 10.1093/bioinformatics/bty268.
2
Bit-parallel sequence-to-graph alignment.位并行序列到图的对齐。
Bioinformatics. 2019 Oct 1;35(19):3599-3607. doi: 10.1093/bioinformatics/btz162.
3
OPA2Vec: combining formal and informal content of biomedical ontologies to improve similarity-based prediction.OPA2Vec:结合生物医学本体的正式和非正式内容以改进基于相似度的预测。
Bioinformatics. 2019 Jun 1;35(12):2133-2140. doi: 10.1093/bioinformatics/bty933.
4
Inferring ontology graph structures using OWL reasoning.利用 owl 推理推断本体图结构。
BMC Bioinformatics. 2018 Jan 5;19(1):7. doi: 10.1186/s12859-017-1999-8.
5
Automated ontology generation framework powered by linked biomedical ontologies for disease-drug domain.基于链接生物医学本体的疾病-药物领域自动化本体生成框架。
Comput Methods Programs Biomed. 2018 Oct;165:117-128. doi: 10.1016/j.cmpb.2018.08.010. Epub 2018 Aug 16.
6
Where to search top-K biomedical ontologies?在哪里搜索顶级生物医学本体?
Brief Bioinform. 2019 Jul 19;20(4):1477-1491. doi: 10.1093/bib/bby015.
7
An efficient, large-scale, non-lattice-detection algorithm for exhaustive structural auditing of biomedical ontologies.一种用于生物医学本体全面结构审核的高效、大规模、非格检测算法。
J Biomed Inform. 2018 Apr;80:106-119. doi: 10.1016/j.jbi.2018.03.004. Epub 2018 Mar 13.
8
Elucidating high-dimensional cancer hallmark annotation via enriched ontology.
J Biomed Inform. 2017 Sep;73:84-94. doi: 10.1016/j.jbi.2017.07.011. Epub 2017 Jul 16.
9
mOWL: Python library for machine learning with biomedical ontologies.mOWL:用于生物医学本体机器学习的 Python 库。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac811.
10
Disease ontologies for knowledge graphs.疾病本体在知识图谱中的应用。
BMC Bioinformatics. 2021 Jul 21;22(1):377. doi: 10.1186/s12859-021-04173-w.

引用本文的文献

1
Optimizing gene selection and module identification via ontology-based scoring and deep learning.通过基于本体的评分和深度学习优化基因选择与模块识别。
Bioinform Adv. 2025 Feb 26;5(1):vbaf034. doi: 10.1093/bioadv/vbaf034. eCollection 2025.
2
Mutual annotation-based prediction of protein domain functions with Domain2GO.基于互注释的蛋白质结构域功能预测与 Domain2GO。
Protein Sci. 2024 Jun;33(6):e4988. doi: 10.1002/pro.4988.
3
CAFA-evaluator: a Python tool for benchmarking ontological classification methods.CAFA评估器:一种用于对本体分类方法进行基准测试的Python工具。

本文引用的文献

1
Semantic Similarity in the Gene Ontology.基因本体论中的语义相似性。
Methods Mol Biol. 2017;1446:161-173. doi: 10.1007/978-1-4939-3743-1_12.
2
Community-Wide Evaluation of Computational Function Prediction.计算功能预测的全社区评估
Methods Mol Biol. 2017;1446:133-146. doi: 10.1007/978-1-4939-3743-1_10.
3
Best Practices in Manual Annotation with the Gene Ontology.使用基因本体进行人工注释的最佳实践
Bioinform Adv. 2024 Mar 14;4(1):vbae043. doi: 10.1093/bioadv/vbae043. eCollection 2024.
4
The field of protein function prediction as viewed by different domain scientists.不同领域科学家眼中的蛋白质功能预测领域。
Bioinform Adv. 2022 Aug 17;2(1):vbac057. doi: 10.1093/bioadv/vbac057. eCollection 2022.
5
Hierarchical deep learning for predicting GO annotations by integrating protein knowledge.基于蛋白质知识的 GO 注释预测的分层深度学习
Bioinformatics. 2022 Sep 30;38(19):4488-4496. doi: 10.1093/bioinformatics/btac536.
6
Semantic similarity and machine learning with ontologies.语义相似性和本体论的机器学习。
Brief Bioinform. 2021 Jul 20;22(4). doi: 10.1093/bib/bbaa199.
7
The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens.CAFA 挑战赛报告称,通过实验筛选,提高了数百个基因的蛋白质功能预测和新的功能注释。
Genome Biol. 2019 Nov 19;20(1):244. doi: 10.1186/s13059-019-1835-8.
Methods Mol Biol. 2017;1446:41-54. doi: 10.1007/978-1-4939-3743-1_4.
4
FEDRR: fast, exhaustive detection of redundant hierarchical relations for quality improvement of large biomedical ontologies.FEDRR:用于大型生物医学本体质量改进的冗余层次关系快速详尽检测
BioData Min. 2016 Oct 10;9:31. doi: 10.1186/s13040-016-0110-8. eCollection 2016.
5
An expanded evaluation of protein function prediction methods shows an improvement in accuracy.对蛋白质功能预测方法的扩展评估显示准确性有所提高。
Genome Biol. 2016 Sep 7;17(1):184. doi: 10.1186/s13059-016-1037-6.
6
The GOA database: gene Ontology annotation updates for 2015.基因本体注释数据库(GOA):2015年基因本体注释更新
Nucleic Acids Res. 2015 Jan;43(Database issue):D1057-63. doi: 10.1093/nar/gku1113. Epub 2014 Nov 6.
7
The impact of incomplete knowledge on the evaluation of protein function prediction: a structured-output learning perspective.知识不完整对蛋白质功能预测评估的影响:结构化输出学习视角
Bioinformatics. 2014 Sep 1;30(17):i609-16. doi: 10.1093/bioinformatics/btu472.
8
Variation Ontology for annotation of variation effects and mechanisms.变异本体论用于注释变异的影响和机制。
Genome Res. 2014 Feb;24(2):356-64. doi: 10.1101/gr.157495.113. Epub 2013 Oct 25.
9
Information-theoretic evaluation of predicted ontological annotations.基于信息论的预测本体论注释评估。
Bioinformatics. 2013 Jul 1;29(13):i53-61. doi: 10.1093/bioinformatics/btt228.
10
Biases in the experimental annotations of protein function and their effect on our understanding of protein function space.蛋白质功能的实验注释中的偏差及其对我们理解蛋白质功能空间的影响。
PLoS Comput Biol. 2013;9(5):e1003063. doi: 10.1371/journal.pcbi.1003063. Epub 2013 May 30.