• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

帮助作者生成可实现公平原则的分类学数据:对作者驱动的表型数据生成原型的评估

Helping authors produce FAIR taxonomic data: evaluation of an author-driven phenotype data production prototype.

作者信息

Zhang Limin, Starr Julian, Ford Bruce, Reznicek Anton, Zhou Yuxuan, Léveillé-Bourret Étienne, Lacroix-Carignan Étienne, Cayouette Jacques, Smith Tyler W, Sutherland Donald, Catling Paul, Saarela Jeffery M, Cui Hong, Macklin James

机构信息

School of Information, University of Arizona, 1103 E. 2nd Street, Tucson, AZ 85719, USA.

School of Fine Arts, Huaiyin Normal University, 71 Jiaotong Road, Huaian, Jiangsu 223001, China.

出版信息

Database (Oxford). 2025 Jan 29;2025. doi: 10.1093/database/baae097.

DOI:10.1093/database/baae097
PMID:39879563
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11928229/
Abstract

It is well-known that the use of vocabulary in phenotype treatments is often inconsistent. An earlier survey of biologists who create or use phenotypic characters revealed that this lack of standardization leads to ambiguities, frustrating both the consumers and producers of phenotypic data. Such ambiguities are challenging for biologists, and more so for Artificial Intelligence, to resolve. That survey also indicated a strong interest in a new authoring workflow supported by ontologies to ensure published phenotype data are FAIR (Findable, Accessible, Interoperable, and Reusable) and suitable for large-scale computational analyses. In this article, we introduce a prototype software system designed for authors to produce computational phenotype data. This platform includes a web-based, ontology-enhanced editor for taxonomic characters (Character Recorder), an Ontology Backend holding standardized vocabulary (the Cared Ontology), and a mobile application for resolving ontological conflicts (Conflict Resolver). We present two formal user evaluations of Character Recorder, the main interface authors would interact with to produce FAIR data. The evaluations were conducted with undergraduate biology students and Carex experts. We evaluated Character Recorder against Microsoft Excel on their effectiveness, efficiency, and the cognitive demands of the users in producing computable taxon-by-character matrices. The evaluations showed that Character Recorder is quickly learnable for both student and professional participants, with its cognitive demand comparable to Excel's. Participants agreed that the quality of the data Character Recorder yielded was superior. Students praised Character Recorder's educational value, while Carex experts were keen to recommend it and help evolve it from a prototype into a comprehensive tool. Feature improvements recommended by expert participants have been implemented after the evaluation.

摘要

众所周知,在表型处理中词汇的使用往往不一致。一项针对创建或使用表型特征的生物学家的早期调查显示,这种缺乏标准化的情况会导致歧义,令表型数据的使用者和生产者都感到沮丧。对于生物学家来说,解决这些歧义具有挑战性,而对于人工智能来说更是如此。该调查还表明,人们对由本体支持的新创作工作流程有着浓厚兴趣,以确保已发表的表型数据是FAIR的(可查找、可访问、可互操作和可重用),并适用于大规模计算分析。在本文中,我们介绍了一个为作者设计的用于生成计算表型数据的原型软件系统。这个平台包括一个基于网络的、用于分类特征的本体增强编辑器(特征记录器)、一个保存标准化词汇的本体后端(Cared本体)以及一个用于解决本体冲突的移动应用程序(冲突解决器)。我们对特征记录器进行了两次正式的用户评估,特征记录器是作者为生成FAIR数据而与之交互的主要界面。评估是与本科生物学学生和苔草专家进行的。我们将特征记录器与Microsoft Excel在生成可计算的分类单元-特征矩阵时的有效性、效率以及对用户的认知要求方面进行了评估。评估表明,特征记录器对于学生和专业参与者来说都很容易学习,其认知要求与Excel相当。参与者一致认为特征记录器生成的数据质量更高。学生们称赞了特征记录器的教育价值,而苔草专家则热衷于推荐它,并帮助将其从一个原型发展成为一个全面的工具。专家参与者建议的功能改进在评估后已经得到实施。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/c097b42d9dc3/baae097f18.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/f41201fedbe1/baae097f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/7600f657ea1b/baae097f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/8d35279b6cca/baae097f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/62d99986fb1e/baae097f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/16cd3d749910/baae097f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/3dfd5472a426/baae097f5b.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/c58d7ff571aa/baae097f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/48a9894dd542/baae097f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/5e4103262f52/baae097f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/a95d71853990/baae097f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/33ceddb11d67/baae097f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/a64e982da080/baae097f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/88848fa6be44/baae097f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/a4011be0e4d5/baae097f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/332c572719a7/baae097f14a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/31a7300438fc/baae097f14b.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/ba7acb49bfa8/baae097f15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/1c759a83472f/baae097f16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/da94e2b48374/baae097f17.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/c097b42d9dc3/baae097f18.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/f41201fedbe1/baae097f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/7600f657ea1b/baae097f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/8d35279b6cca/baae097f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/62d99986fb1e/baae097f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/16cd3d749910/baae097f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/3dfd5472a426/baae097f5b.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/c58d7ff571aa/baae097f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/48a9894dd542/baae097f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/5e4103262f52/baae097f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/a95d71853990/baae097f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/33ceddb11d67/baae097f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/a64e982da080/baae097f11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/88848fa6be44/baae097f12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/a4011be0e4d5/baae097f13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/332c572719a7/baae097f14a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/31a7300438fc/baae097f14b.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/ba7acb49bfa8/baae097f15.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/1c759a83472f/baae097f16.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/da94e2b48374/baae097f17.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ce7/11928229/c097b42d9dc3/baae097f18.jpg

相似文献

1
Helping authors produce FAIR taxonomic data: evaluation of an author-driven phenotype data production prototype.帮助作者生成可实现公平原则的分类学数据:对作者驱动的表型数据生成原型的评估
Database (Oxford). 2025 Jan 29;2025. doi: 10.1093/database/baae097.
2
Measurement Recorder: developing a useful tool for making species descriptions that produces computable phenotypes.记录器:开发一种有用的工具来进行物种描述,生成可计算的表型。
Database (Oxford). 2020 Nov 20;2020. doi: 10.1093/database/baaa079.
3
Modifier Ontologies for frequency, certainty, degree, and coverage phenotype modifier.用于频率、确定性、程度和覆盖表型修饰符的修饰符本体。
Biodivers Data J. 2018 Nov 28(6):e29232. doi: 10.3897/BDJ.6.e29232. eCollection 2018.
4
Microbial phenomics information extractor (MicroPIE): a natural language processing tool for the automated acquisition of prokaryotic phenotypic characters from text sources.微生物表型组学信息提取器(MicroPIE):一种用于从文本来源自动获取原核生物表型特征的自然语言处理工具。
BMC Bioinformatics. 2016 Dec 13;17(1):528. doi: 10.1186/s12859-016-1396-8.
5
Phenex: ontological annotation of phenotypic diversity.凤凰:表型多样性的本体论注释。
PLoS One. 2010 May 5;5(5):e10500. doi: 10.1371/journal.pone.0010500.
6
Which methods are the most effective in enabling novice users to participate in ontology creation? A usability study.哪些方法对于帮助新手用户参与本体创建最为有效?一项可用性研究。
Database (Oxford). 2021 Jun 22;2021. doi: 10.1093/database/baab035.
7
Development and Applications of Interoperable Biomedical Ontologies for Integrative Data and Knowledge Representation and Multiscale Modeling in Systems Medicine.可互操作的生物医学本体在系统医学中的综合数据和知识表示以及多尺度建模中的开发与应用。
Methods Mol Biol. 2022;2486:233-244. doi: 10.1007/978-1-0716-2265-0_12.
8
Technical Note: Ontology-guided radiomics analysis workflow (O-RAW).技术说明:本体引导的放射组学分析工作流程(O-RAW)。
Med Phys. 2019 Dec;46(12):5677-5684. doi: 10.1002/mp.13844. Epub 2019 Oct 25.
9
Toward Synthesizing Our Knowledge of Morphology: Using Ontologies and Machine Reasoning to Extract Presence/Absence Evolutionary Phenotypes across Studies.迈向整合我们的形态学知识:利用本体和机器推理提取跨研究的存在/缺失进化表型。
Syst Biol. 2015 Nov;64(6):936-52. doi: 10.1093/sysbio/syv031. Epub 2015 May 26.
10
A Data Transformation Methodology to Create Findable, Accessible, Interoperable, and Reusable Health Data: Software Design, Development, and Evaluation Study.一种创建可发现、可访问、可互操作和可重用健康数据的数据转换方法:软件设计、开发和评估研究。
J Med Internet Res. 2023 Mar 8;25:e42822. doi: 10.2196/42822.

本文引用的文献

1
Authors' attitude toward adopting a new workflow to improve the computability of phenotype publications.作者对采用新工作流程以提高表型出版物可计算性的态度。
Database (Oxford). 2022 Feb 2;2022. doi: 10.1093/database/baac001.
2
Which methods are the most effective in enabling novice users to participate in ontology creation? A usability study.哪些方法对于帮助新手用户参与本体创建最为有效?一项可用性研究。
Database (Oxford). 2021 Jun 22;2021. doi: 10.1093/database/baab035.
3
Measurement Recorder: developing a useful tool for making species descriptions that produces computable phenotypes.
记录器:开发一种有用的工具来进行物种描述,生成可计算的表型。
Database (Oxford). 2020 Nov 20;2020. doi: 10.1093/database/baaa079.
4
Annotation of phenotypes using ontologies: a gold standard for the training and evaluation of natural language processing systems.使用本体论对表型进行注释:自然语言处理系统的培训和评估的黄金标准。
Database (Oxford). 2018 Jan 1;2018:bay110. doi: 10.1093/database/bay110.
5
Modifier Ontologies for frequency, certainty, degree, and coverage phenotype modifier.用于频率、确定性、程度和覆盖表型修饰符的修饰符本体。
Biodivers Data J. 2018 Nov 28(6):e29232. doi: 10.3897/BDJ.6.e29232. eCollection 2018.
6
Incentivising use of structured language in biological descriptions: Author-driven phenotype data and ontology production.激励在生物学描述中使用结构化语言:作者驱动的表型数据与本体生成。
Biodivers Data J. 2018 Nov 7(6):e29616. doi: 10.3897/BDJ.6.e29616. eCollection 2018.
7
Crowd-sourcing and author submission as alternatives to professional curation.众包和作者投稿作为专业编目的替代方式。
Database (Oxford). 2016 Dec 26;2016. doi: 10.1093/database/baw149. Print 2016.
8
Introducing Explorer of Taxon Concepts with a case study on spider measurement matrix building.通过蜘蛛测量矩阵构建的案例研究介绍分类单元概念探索器。
BMC Bioinformatics. 2016 Nov 17;17(1):471. doi: 10.1186/s12859-016-1352-7.
9
Development of a classification scheme for disease-related enzyme information.疾病相关酶信息分类方案的制定。
BMC Bioinformatics. 2011 Aug 9;12:329. doi: 10.1186/1471-2105-12-329.
10
Text mining and manual curation of chemical-gene-disease networks for the comparative toxicogenomics database (CTD).文本挖掘和化学-基因-疾病网络的人工整理用于比较毒理学基因组数据库(CTD)。
BMC Bioinformatics. 2009 Oct 8;10:326. doi: 10.1186/1471-2105-10-326.