• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

迈向整合我们的形态学知识:利用本体和机器推理提取跨研究的存在/缺失进化表型。

Toward Synthesizing Our Knowledge of Morphology: Using Ontologies and Machine Reasoning to Extract Presence/Absence Evolutionary Phenotypes across Studies.

作者信息

Dececchi T Alexander, Balhoff James P, Lapp Hilmar, Mabee Paula M

机构信息

Department of Biology, University of South Dakota, Vermillion, SD 57069, USA;

National Evolutionary Synthesis Center, Durham, NC 27705, USA; University of North Carolina, Chapel Hill, NC 27599, USA;

出版信息

Syst Biol. 2015 Nov;64(6):936-52. doi: 10.1093/sysbio/syv031. Epub 2015 May 26.

DOI:10.1093/sysbio/syv031
PMID:26018570
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4604830/
Abstract

The reality of larger and larger molecular databases and the need to integrate data scalably have presented a major challenge for the use of phenotypic data. Morphology is currently primarily described in discrete publications, entrenched in noncomputer readable text, and requires enormous investments of time and resources to integrate across large numbers of taxa and studies. Here we present a new methodology, using ontology-based reasoning systems working with the Phenoscape Knowledgebase (KB; kb.phenoscape.org), to automatically integrate large amounts of evolutionary character state descriptions into a synthetic character matrix of neomorphic (presence/absence) data. Using the KB, which includes more than 55 studies of sarcopterygian taxa, we generated a synthetic supermatrix of 639 variable characters scored for 1051 taxa, resulting in over 145,000 populated cells. Of these characters, over 76% were made variable through the addition of inferred presence/absence states derived by machine reasoning over the formal semantics of the source ontologies. Inferred data reduced the missing data in the variable character-subset from 98.5% to 78.2%. Machine reasoning also enables the isolation of conflicts in the data, that is, cells where both presence and absence are indicated; reports regarding conflicting data provenance can be generated automatically. Further, reasoning enables quantification and new visualizations of the data, here for example, allowing identification of character space that has been undersampled across the fin-to-limb transition. The approach and methods demonstrated here to compute synthetic presence/absence supermatrices are applicable to any taxonomic and phenotypic slice across the tree of life, providing the data are semantically annotated. Because such data can also be linked to model organism genetics through computational scoring of phenotypic similarity, they open a rich set of future research questions into phenotype-to-genome relationships.

摘要

越来越大的分子数据库以及可扩展地整合数据的需求,给表型数据的使用带来了重大挑战。形态学目前主要在离散的出版物中描述,以非计算机可读文本形式存在,并且需要投入大量时间和资源才能整合大量的分类群和研究。在此,我们提出一种新方法,利用基于本体的推理系统与Phenoscape知识库(KB;kb.phenoscape.org)协同工作,将大量进化特征状态描述自动整合到一个新形态(存在/缺失)数据的综合特征矩阵中。利用包含超过55项肉鳍鱼类分类群研究的知识库,我们生成了一个综合超级矩阵,其中为1051个分类群的639个可变特征进行了评分,产生了超过145,000个填充单元格。在这些特征中,超过76%的特征通过对源本体的形式语义进行机器推理得出的推断存在/缺失状态而变得可变。推断数据将可变特征子集中的缺失数据从98.5%减少到了78.2%。机器推理还能够分离数据中的冲突,即同时显示存在和缺失的单元格;关于冲突数据来源的报告可以自动生成。此外,推理能够对数据进行量化和新的可视化展示,例如在这里可以识别出在鳍到肢体转变过程中采样不足的特征空间。这里展示的计算综合存在/缺失超级矩阵的方法适用于生命之树中的任何分类学和表型切片,前提是数据经过语义注释。由于此类数据还可以通过表型相似性的计算评分与模式生物遗传学相联系,它们为未来关于表型与基因组关系的一系列丰富研究问题打开了大门。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f9/4604830/d82ea9caa53e/syv031f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f9/4604830/e3e6271878e4/syv031f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f9/4604830/058f88872168/syv031f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f9/4604830/fcec2a7dd17f/syv031f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f9/4604830/b01826afd7e6/syv031f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f9/4604830/d82ea9caa53e/syv031f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f9/4604830/e3e6271878e4/syv031f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f9/4604830/058f88872168/syv031f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f9/4604830/fcec2a7dd17f/syv031f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f9/4604830/b01826afd7e6/syv031f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89f9/4604830/d82ea9caa53e/syv031f5.jpg

相似文献

1
Toward Synthesizing Our Knowledge of Morphology: Using Ontologies and Machine Reasoning to Extract Presence/Absence Evolutionary Phenotypes across Studies.迈向整合我们的形态学知识:利用本体和机器推理提取跨研究的存在/缺失进化表型。
Syst Biol. 2015 Nov;64(6):936-52. doi: 10.1093/sysbio/syv031. Epub 2015 May 26.
2
Evolutionary characters, phenotypes and ontologies: curating data from the systematic biology literature.进化特征、表型和本体论:从系统生物学文献中整理数据。
PLoS One. 2010 May 20;5(5):e10708. doi: 10.1371/journal.pone.0010708.
3
Using the phenoscape knowledgebase to relate genetic perturbations to phenotypic evolution.利用表型景观知识库将基因扰动与表型进化联系起来。
Genesis. 2015 Aug;53(8):561-71. doi: 10.1002/dvg.22878. Epub 2015 Aug 11.
4
Microbial phenomics information extractor (MicroPIE): a natural language processing tool for the automated acquisition of prokaryotic phenotypic characters from text sources.微生物表型组学信息提取器(MicroPIE):一种用于从文本来源自动获取原核生物表型特征的自然语言处理工具。
BMC Bioinformatics. 2016 Dec 13;17(1):528. doi: 10.1186/s12859-016-1396-8.
5
500,000 fish phenotypes: The new informatics landscape for evolutionary and developmental biology of the vertebrate skeleton.50万种鱼类表型:脊椎动物骨骼进化与发育生物学的新信息学格局
J Appl Ichthyol. 2012 Jun 1;28(3):300-305. doi: 10.1111/j.1439-0426.2012.01985.x. Epub 2012 May 21.
6
Phenex: ontological annotation of phenotypic diversity.凤凰:表型多样性的本体论注释。
PLoS One. 2010 May 5;5(5):e10500. doi: 10.1371/journal.pone.0010500.
7
The teleost anatomy ontology: anatomical representation for the genomics age.硬骨鱼解剖本体论:基因组时代的解剖学表示法。
Syst Biol. 2010 Jul;59(4):369-83. doi: 10.1093/sysbio/syq013. Epub 2010 Mar 29.
8
Connecting evolutionary morphology to genomics using ontologies: a case study from Cypriniformes including zebrafish.利用本体将进化形态学与基因组学联系起来:以包括斑马鱼在内的鲤形目为例的研究。
J Exp Zool B Mol Dev Evol. 2007 Sep 15;308(5):655-68. doi: 10.1002/jez.b.21181.
9
The vertebrate taxonomy ontology: a framework for reasoning across model organism and species phenotypes.脊椎动物分类学本体论:一个用于跨模型生物和物种表型进行推理的框架。
J Biomed Semantics. 2013 Nov 22;4(1):34. doi: 10.1186/2041-1480-4-34.
10
The flora phenotype ontology (FLOPO): tool for integrating morphological traits and phenotypes of vascular plants.植物区系表型本体论(FLOPO):整合维管植物形态特征和表型的工具。
J Biomed Semantics. 2016 Nov 14;7(1):65. doi: 10.1186/s13326-016-0107-8.

引用本文的文献

1
The changing landscape of text mining: a review of approaches for ecology and evolution.文本挖掘的变化格局:对生态学和进化学方法的综述。
Proc Biol Sci. 2024 Jul;291(2027):20240423. doi: 10.1098/rspb.2024.0423. Epub 2024 Jul 31.
2
Computable species descriptions and nanopublications: applying ontology-based technologies to dung beetles (Coleoptera, Scarabaeinae).可计算的物种描述与纳米出版物:将基于本体的技术应用于蜣螂(鞘翅目,金龟亚科)
Biodivers Data J. 2024 Jun 13;12:e121562. doi: 10.3897/BDJ.12.e121562. eCollection 2024.
3
The Ontology of Biological Attributes (OBA)-computational traits for the life sciences.

本文引用的文献

1
Comparative cladistics.比较分支系统学
Cladistics. 2009 Dec;25(6):624-659. doi: 10.1111/j.1096-0031.2009.00265.x. Epub 2009 Aug 4.
2
Calculating structural complexity in phylogenies using ancestral ontologies.使用祖先本体计算系统发育中的结构复杂性。
Cladistics. 2014 Dec;30(6):635-649. doi: 10.1111/cla.12075. Epub 2014 Apr 22.
3
Moving the mountain: analysis of the effort required to transform comparative anatomy into computable anatomy.移山:对将比较解剖学转化为可计算解剖学所需努力的分析。
生物属性本体论(OBA)——生命科学的计算特征。
Mamm Genome. 2023 Sep;34(3):364-378. doi: 10.1007/s00335-023-09992-1. Epub 2023 Apr 19.
4
New Phylogenetic Markov Models for Inapplicable Morphological Characters.新的不可应用形态特征的系统发育马尔可夫模型。
Syst Biol. 2023 Jun 17;72(3):681-693. doi: 10.1093/sysbio/syad005.
5
Anatomy and the type concept in biology show that ontologies must be adapted to the diagnostic needs of research.解剖学和生物学中的类型概念表明,本体论必须适应研究的诊断需求。
J Biomed Semantics. 2022 Jun 27;13(1):18. doi: 10.1186/s13326-022-00268-2.
6
Past and future uses of text mining in ecology and evolution.文本挖掘在生态学和进化中的过去和未来用途。
Proc Biol Sci. 2022 May 25;289(1975):20212721. doi: 10.1098/rspb.2021.2721. Epub 2022 May 18.
7
FAIR data representation in times of eScience: a comparison of instance-based and class-based semantic representations of empirical data using phenotype descriptions as example.在电子科学时代实现 FAIR 数据表示:以表型描述为例比较基于实例和基于类的经验数据语义表示。
J Biomed Semantics. 2021 Nov 25;12(1):20. doi: 10.1186/s13326-021-00254-0.
8
Craniodental and Postcranial Characters of Non-Avian Dinosauria Often Imply Different Trees.非鸟恐龙的颅骨牙齿特征和颅后骨骼特征往往暗示着不同的系统发育树。
Syst Biol. 2020 Jul 1;69(4):638-659. doi: 10.1093/sysbio/syz077.
9
A Logical Model of Homology for Comparative Biology.同源性的逻辑模型在比较生物学中的应用。
Syst Biol. 2020 Mar 1;69(2):345-362. doi: 10.1093/sysbio/syz067.
10
Using text-mined trait data to test for cooperate-and-radiate co-evolution between ants and plants.利用文本挖掘的特征数据来检验蚂蚁和植物之间的合作与辐射共同进化。
PLoS Comput Biol. 2019 Oct 3;15(10):e1007323. doi: 10.1371/journal.pcbi.1007323. eCollection 2019 Oct.
Database (Oxford). 2015 May 13;2015:bav040. doi: 10.1093/database/bav040. Print 2015.
4
Unification of multi-species vertebrate anatomy ontologies for comparative biology in Uberon.在Uberon中统一用于比较生物学的多物种脊椎动物解剖学本体。
J Biomed Semantics. 2014 May 19;5:21. doi: 10.1186/2041-1480-5-21. eCollection 2014.
5
The origins of adipose fins: an analysis of homoplasy and the serial homology of vertebrate appendages.脂肪鳍的起源:同源性分析与脊椎动物附肢的系列同源性
Proc Biol Sci. 2014 Mar 5;281(1781):20133120. doi: 10.1098/rspb.2013.3120. Print 2014 Apr 22.
6
Conservation and divergence of regulatory strategies at Hox Loci and the origin of tetrapod digits.同源异型盒基因座的调控策略的保守性和变异性与四足动物趾的起源。
PLoS Biol. 2014 Jan;12(1):e1001773. doi: 10.1371/journal.pbio.1001773. Epub 2014 Jan 21.
7
Pelvic girdle and fin of Tiktaalik roseae.Tiktaalik roseae 的骨盆带和鳍
Proc Natl Acad Sci U S A. 2014 Jan 21;111(3):893-9. doi: 10.1073/pnas.1322559111. Epub 2014 Jan 13.
8
The vertebrate taxonomy ontology: a framework for reasoning across model organism and species phenotypes.脊椎动物分类学本体论:一个用于跨模型生物和物种表型进行推理的框架。
J Biomed Semantics. 2013 Nov 22;4(1):34. doi: 10.1186/2041-1480-4-34.
9
Lost branches on the tree of life.生命之树上的失落枝丫。
PLoS Biol. 2013 Sep;11(9):e1001636. doi: 10.1371/journal.pbio.1001636. Epub 2013 Sep 3.
10
Several developmental and morphogenetic factors govern the evolution of stomatal patterning in land plants.几种发育和形态发生因子控制着陆地植物气孔模式的演化。
New Phytol. 2013 Nov;200(3):598-614. doi: 10.1111/nph.12406. Epub 2013 Jul 26.