• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

数字化生物样本库作为性状数据来源和VertNet新资源的重要性。

The importance of digitized biocollections as a source of trait data and a new VertNet resource.

作者信息

Guralnick Robert P, Zermoglio Paula F, Wieczorek John, LaFrance Raphael, Bloom David, Russell Laura

机构信息

University of Florida Museum of Natural History University of Florida at Gainesville, Gainesville, FL, USA

Departamento de Ecología, Genética y Evolución, Instituto IEGEBA (CONICET-UBA), Facultad de Ciencias Exactas y Naturales, Universidad de Buenos Aires, Buenos Aires, Argentina.

出版信息

Database (Oxford). 2016 Dec 26;2016. doi: 10.1093/database/baw158. Print 2016.

DOI:10.1093/database/baw158
PMID:28025346
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5199146/
Abstract

For vast areas of the globe and large parts of the tree of life, data needed to inform trait diversity is incomplete. Such trait data, when fully assembled, however, form the link between the evolutionary history of organisms, their assembly into communities, and the nature and functioning of ecosystems. Recent efforts to close data gaps have focused on collating trait-by-species databases, which only provide species-level, aggregated value ranges for traits of interest and often lack the direct observations on which those ranges are based. Perhaps under-appreciated is that digitized biocollection records collectively contain a vast trove of trait data measured directly from individuals, but this content remains hidden and highly heterogeneous, impeding discoverability and use. We developed and deployed a suite of openly accessible software tools in order to collate a full set of trait descriptions and extract two key traits, body length and mass, from >18 million specimen records in VertNet, a global biodiversity data publisher and aggregator. We tested success rate of these tools against hand-checked validation data sets and characterized quality and quantity. A post-processing toolkit was developed to standardize and harmonize data sets, and to integrate this improved content into VertNet for broadest reuse. The result of this work was to add more than 1.5 million harmonized measurements on vertebrate body mass and length directly to specimen records. Rates of false positives and negatives for extracted data were extremely low. We also created new tools for filtering, querying, and assembling this research-ready vertebrate trait content for view and download. Our work has yielded a novel database and platform for harmonized trait content that will grow as tools introduced here become part of publication workflows. We close by noting how this effort extends to new communities already developing similar digitized content.Database URL: http://portal.vertnet.org/search?advanced=1.

摘要

在地球上的广大区域以及生命之树的大部分分支中,用于了解性状多样性的数据并不完整。然而,这些性状数据一旦完全汇集起来,便构成了生物体进化史、它们组成群落的方式以及生态系统的性质和功能之间的联系。近期为填补数据空白所做的努力主要集中在整理按物种分类的性状数据库,这些数据库仅提供感兴趣性状的物种层面的汇总值域,而且往往缺乏这些值域所依据的直接观测数据。或许未得到充分重视的是,数字化生物标本记录集合中集体包含了大量直接从个体测量得到的性状数据,但这些内容仍然隐藏且高度异质,阻碍了发现和利用。我们开发并部署了一套可公开访问的软件工具,以便整理出一套完整的性状描述,并从全球生物多样性数据发布和聚合平台VertNet中的1800多万条标本记录中提取两个关键性状——体长和体重。我们对照人工检查的验证数据集测试了这些工具的成功率,并对质量和数量进行了表征。开发了一个后处理工具包,用于标准化和协调数据集,并将这些改进后的内容整合到VertNet中以便最广泛地重复使用。这项工作的结果是直接在标本记录中增加了超过150万个关于脊椎动物体重和体长的统一测量值。提取数据的假阳性和假阴性率极低。我们还创建了新工具,用于筛选、查询和整理这些可供研究使用的脊椎动物性状内容以供查看和下载。我们的工作产生了一个用于统一性状内容的新颖数据库和平台,随着此处介绍的工具成为出版工作流程的一部分,该数据库和平台将会不断发展。最后,我们指出这项工作如何扩展到已经在开发类似数字化内容的新群落。数据库网址:http://portal.vertnet.org/search?advanced=1

相似文献

1
The importance of digitized biocollections as a source of trait data and a new VertNet resource.数字化生物样本库作为性状数据来源和VertNet新资源的重要性。
Database (Oxford). 2016 Dec 26;2016. doi: 10.1093/database/baw158. Print 2016.
2
A Standardized Reference Data Set for Vertebrate Taxon Name Resolution.脊椎动物分类名称解析的标准化参考数据集。
PLoS One. 2016 Jan 13;11(1):e0146894. doi: 10.1371/journal.pone.0146894. eCollection 2016.
3
Integrative Functional Genomics for Systems Genetics in GeneWeaver.org.GeneWeaver.org中用于系统遗传学的整合功能基因组学
Methods Mol Biol. 2017;1488:131-152. doi: 10.1007/978-1-4939-6427-7_6.
4
The trouble with triplets in biodiversity informatics: a data-driven case against current identifier practices.生物多样性信息学中三胞胎的问题:一个基于数据反对当前标识符做法的案例。
PLoS One. 2014 Dec 3;9(12):e114069. doi: 10.1371/journal.pone.0114069. eCollection 2014.
5
The bovine QTL viewer: a web accessible database of bovine Quantitative Trait Loci.牛QTL浏览器:一个可通过网络访问的牛数量性状基因座数据库。
BMC Bioinformatics. 2006 Jun 5;7:283. doi: 10.1186/1471-2105-7-283.
6
InterStoreDB: a generic integration resource for genetic and genomic data.InterStoreDB:用于遗传和基因组数据的通用集成资源。
J Integr Plant Biol. 2012 May;54(5):345-55. doi: 10.1111/j.1744-7909.2012.01120.x.
7
Genome-wide in silico screening for microRNA genetic variability in livestock species.全基因组计算机筛选家畜物种中的 microRNA 遗传变异。
Anim Genet. 2013 Dec;44(6):669-77. doi: 10.1111/age.12072. Epub 2013 Jul 19.
8
TraitMap: an XML-based genetic-map database combining multigenic loci and biomolecular networks.特质图谱:一个基于XML的遗传图谱数据库,整合了多基因座和生物分子网络。
Bioinformatics. 2004 Aug 4;20 Suppl 1:i152-60. doi: 10.1093/bioinformatics/bth940.
9
MADA: Malagasy Animal trait Data Archive.马达加斯加动物性状数据存档库
Ecology. 2018 Apr;99(4):990. doi: 10.1002/ecy.2167.
10
A gene-based high-resolution comparative radiation hybrid map as a framework for genome sequence assembly of a bovine chromosome 6 region associated with QTL for growth, body composition, and milk performance traits.基于基因的高分辨率比较辐射杂种图谱,作为与生长、体组成和乳性能性状的QTL相关的牛6号染色体区域基因组序列组装的框架。
BMC Genomics. 2006 Mar 16;7:53. doi: 10.1186/1471-2164-7-53.

引用本文的文献

1
Integrative species delimitation reveals an Idaho-endemic ground squirrel, (Merriam 1913).综合物种界定揭示了一种爱达荷州特有的地松鼠(Merriam,1913年)。
J Mammal. 2024 Dec 12;106(2):406-430. doi: 10.1093/jmammal/gyae135. eCollection 2025 Apr.
2
Integrating animal tracking and trait data to facilitate global ecological discoveries.整合动物追踪与特征数据以推动全球生态发现。
J Exp Biol. 2025 Feb 15;228(Suppl_1). doi: 10.1242/jeb.247981. Epub 2025 Feb 20.
3
Arctos: Community-driven innovations for managing natural and cultural history collections.

本文引用的文献

1
The environment ontology in 2016: bridging domains with increased scope, semantic density, and interoperation.2016年的环境本体:通过扩大范围、增加语义密度和实现互操作性来弥合各领域之间的差距。
J Biomed Semantics. 2016 Sep 23;7(1):57. doi: 10.1186/s13326-016-0097-6.
2
Biodiversity analysis in the digital era.数字时代的生物多样性分析。
Philos Trans R Soc Lond B Biol Sci. 2016 Sep 5;371(1702). doi: 10.1098/rstb.2015.0337.
3
Monitoring plant functional diversity from space.从太空监测植物功能多样性。
Arctos:用于管理自然和文化历史收藏的社区驱动创新。
PLoS One. 2024 May 31;19(5):e0296478. doi: 10.1371/journal.pone.0296478. eCollection 2024.
4
FloraTraiter: Automated parsing of traits from descriptive biodiversity literature.植物特征提取器:从描述性生物多样性文献中自动解析特征
Appl Plant Sci. 2024 Jan 18;12(1):e11563. doi: 10.1002/aps3.11563. eCollection 2024 Jan-Feb.
5
Recent advances in availability and synthesis of the economic costs of biological invasions.生物入侵经济成本的可得性与综合研究的最新进展。
Bioscience. 2023 Aug 22;73(8):560-574. doi: 10.1093/biosci/biad060. eCollection 2023 Aug.
6
Insect collecting bias in Arizona with a preliminary checklist of the beetles from the Sand Tank Mountains.亚利桑那州昆虫采集偏差及桑德坦克山甲虫初步清单
Biodivers Data J. 2023 Jun 28;11:e101960. doi: 10.3897/BDJ.11.e101960. eCollection 2023.
7
A solution to the challenges of interdisciplinary aggregation and use of specimen-level trait data.跨学科整合与使用样本水平特征数据挑战的解决方案。
iScience. 2022 Sep 13;25(10):105101. doi: 10.1016/j.isci.2022.105101. eCollection 2022 Oct 21.
8
Digitized collections elucidate invasion history and patterns of awn polymorphism in Microstegium vimineum.数字化馆藏阐明了芒属植物芒的入侵历史和芒刺多态性模式。
Am J Bot. 2022 May;109(5):689-705. doi: 10.1002/ajb2.1852. Epub 2022 May 27.
9
Rapid phenotypic change in a polymorphic salamander over 43 years.43 年来,一种多态性蝾螈的快速表型变化。
Sci Rep. 2021 Nov 22;11(1):22681. doi: 10.1038/s41598-021-02124-2.
10
Mammalian body size is determined by interactions between climate, urbanization, and ecological traits.哺乳动物的体型由气候、城市化和生态特征之间的相互作用决定。
Commun Biol. 2021 Aug 16;4(1):972. doi: 10.1038/s42003-021-02505-3.
Nat Plants. 2016 Mar 2;2:16024. doi: 10.1038/nplants.2016.24.
4
A Natural Language Processing Tool for Large-Scale Data Extraction from Echocardiography Reports.一种用于从超声心动图报告中大规模提取数据的自然语言处理工具。
PLoS One. 2016 Apr 28;11(4):e0153749. doi: 10.1371/journal.pone.0153749. eCollection 2016.
5
Bridging Inter- and Intraspecific Trait Evolution with a Hierarchical Bayesian Approach.用分层贝叶斯方法连接种间和种内性状进化
Syst Biol. 2016 May;65(3):417-31. doi: 10.1093/sysbio/syw010. Epub 2016 Feb 23.
6
EXTRACT: interactive extraction of environment metadata and term suggestion for metagenomic sample annotation.摘要:用于宏基因组样本注释的环境元数据交互式提取和术语建议
Database (Oxford). 2016 Feb 20;2016. doi: 10.1093/database/baw005. Print 2016.
7
A Standardized Reference Data Set for Vertebrate Taxon Name Resolution.脊椎动物分类名称解析的标准化参考数据集。
PLoS One. 2016 Jan 13;11(1):e0146894. doi: 10.1371/journal.pone.0146894. eCollection 2016.
8
Natural history collections as windows on evolutionary processes.作为洞察进化过程窗口的自然历史收藏。
Mol Ecol. 2016 Feb;25(4):864-81. doi: 10.1111/mec.13529.
9
Plant functional traits have globally consistent effects on competition.植物功能性状对竞争具有全球一致的影响。
Nature. 2016 Jan 14;529(7585):204-7. doi: 10.1038/nature16476. Epub 2015 Dec 23.
10
Use of model organism and disease databases to support matchmaking for human disease gene discovery.利用模式生物和疾病数据库支持人类疾病基因发现的匹配工作。
Hum Mutat. 2015 Oct;36(10):979-84. doi: 10.1002/humu.22857. Epub 2015 Sep 8.