利用自然历史标本馆高效构建大规模系统发育树的高通量方法。

High-throughput methods for efficiently building massive phylogenies from natural history collections.

作者信息

Folk Ryan A, Kates Heather R, LaFrance Raphael, Soltis Douglas E, Soltis Pamela S, Guralnick Robert P

机构信息

Department of Biological Sciences Mississippi State University Mississippi State Mississippi USA.

Florida Museum of Natural History University of Florida Gainesville Florida USA.

出版信息

Appl Plant Sci. 2021 Feb 27;9(2):e11410. doi: 10.1002/aps3.11410. eCollection 2021 Feb.

DOI:10.1002/aps3.11410

PMID:33680581

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7910806/

Abstract

PREMISE

Large phylogenetic data sets have often been restricted to small numbers of loci from GenBank, and a vetted sampling-to-sequencing phylogenomic protocol scaling to thousands of species is not yet available. Here, we report a high-throughput collections-based approach that empowers researchers to explore more branches of the tree of life with numerous loci.

METHODS

We developed an integrated Specimen-to-Laboratory Information Management System (SLIMS), connecting sampling and wet lab efforts with progress tracking at each stage. Using unique identifiers encoded in QR codes and a taxonomic database, a research team can sample herbarium specimens, efficiently record the sampling event, and capture specimen images. After sampling in herbaria, images are uploaded to a citizen science platform for metadata generation, and tissue samples are moved through a simple, high-throughput, plate-based herbarium DNA extraction and sequencing protocol.

RESULTS

We applied this sampling-to-sequencing workflow to ~15,000 species, producing for the first time a data set with ~50% taxonomic representation of the "nitrogen-fixing clade" of angiosperms.

DISCUSSION

The approach we present is appropriate at any taxonomic scale and is extensible to other collection types. The widespread use of large-scale sampling strategies repositions herbaria as accessible but largely untapped resources for broad taxonomic sampling with thousands of species.

摘要

前提

大型系统发育数据集通常局限于来自GenBank的少数基因座，目前还没有一种经过审查的、可扩展到数千个物种的从采样到测序的系统发育基因组学方案。在此，我们报告了一种基于高通量样本收集的方法，使研究人员能够利用众多基因座探索生命之树的更多分支。

方法

我们开发了一个集成的标本到实验室信息管理系统（SLIMS），将采样和湿实验室工作与每个阶段的进展跟踪联系起来。通过使用二维码编码的唯一标识符和分类数据库，研究团队可以对标本馆标本进行采样，有效地记录采样事件，并采集标本图像。在标本馆采样后，图像被上传到一个公民科学平台以生成元数据，组织样本则通过一个简单的、基于平板的高通量标本馆DNA提取和测序方案进行处理。

结果

我们将这种从采样到测序的工作流程应用于约15,000个物种，首次产生了一个数据集，其分类代表性约为被子植物“固氮分支”的50%。

讨论

我们提出的方法适用于任何分类规模，并且可以扩展到其他样本类型。大规模采样策略的广泛应用将标本馆重新定位为可获取但在很大程度上未被利用的资源，可用于对数千个物种进行广泛的分类采样。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da38/7910806/869ac75f0ca5/APS3-9-e11410-g001.jpg

相似文献

High-throughput methods for efficiently building massive phylogenies from natural history collections.利用自然历史标本馆高效构建大规模系统发育树的高通量方法。

Appl Plant Sci. 2021 Feb 27;9(2):e11410. doi: 10.1002/aps3.11410. eCollection 2021 Feb.

A Comprehensive Phylogenomic Platform for Exploring the Angiosperm Tree of Life.探索被子植物生命之树的综合系统基因组学平台。

Syst Biol. 2022 Feb 10;71(2):301-319. doi: 10.1093/sysbio/syab035.

Factors Affecting Targeted Sequencing of 353 Nuclear Genes From Herbarium Specimens Spanning the Diversity of Angiosperms.影响来自跨越被子植物多样性的植物标本馆标本中353个核基因靶向测序的因素

Front Plant Sci. 2019 Sep 18;10:1102. doi: 10.3389/fpls.2019.01102. eCollection 2019.

Small herbaria contribute unique biogeographic records to county, locality, and temporal scales.小型植物标本馆为县、地点和时间尺度提供独特的生物地理记录。

Am J Bot. 2020 Nov;107(11):1577-1587. doi: 10.1002/ajb2.1563. Epub 2020 Nov 20.

Harnessing Large-Scale Herbarium Image Datasets Through Representation Learning.通过表征学习利用大规模植物标本图像数据集

Front Plant Sci. 2022 Jan 13;12:806407. doi: 10.3389/fpls.2021.806407. eCollection 2021.

iNaturalist as a tool to expand the research value of museum specimens.将iNaturalist作为一种拓展博物馆标本研究价值的工具。

Appl Plant Sci. 2018 Nov 7;6(11):e01193. doi: 10.1002/aps3.1193. eCollection 2018 Nov.

Digitizing specimens in a small herbarium: A viable workflow for collections working with limited resources.小型植物标本馆中的标本数字化：资源有限的收藏机构的可行工作流程。

Appl Plant Sci. 2017 Apr 11;5(4). doi: 10.3732/apps.1600125. eCollection 2017 Apr.

The Effects of Herbarium Specimen Characteristics on Short-Read NGS Sequencing Success in Nearly 8000 Specimens: Old, Degraded Samples Have Lower DNA Yields but Consistent Sequencing Success.植物标本特征对近8000份标本短读长NGS测序成功率的影响：陈旧、降解的样本DNA产量较低，但测序成功率一致。

Front Plant Sci. 2021 Jun 23;12:669064. doi: 10.3389/fpls.2021.669064. eCollection 2021.

An algorithm competition for automatic species identification from herbarium specimens.一场用于从植物标本中自动识别物种的算法竞赛。

Appl Plant Sci. 2020 Jul 1;8(6):e11365. doi: 10.1002/aps3.11365. eCollection 2020 Jun.

MHA Herbarium: Eastern European collections of vascular plants.MHA植物标本馆：东欧维管植物收藏

Biodivers Data J. 2020 Oct 23;8:e57512. doi: 10.3897/BDJ.8.e57512. eCollection 2020.

引用本文的文献

Hidden treasures of herbaria - even small collections contain a wealth of diversity: the powdery mildews of the North Carolina State Larry F. Grand Mycological Herbarium.植物标本馆的隐藏瑰宝——即使是小型馆藏也蕴含着丰富的多样性：北卡罗来纳州立大学拉里·F·格兰德真菌植物标本馆的白粉菌。

IMA Fungus. 2025 Jun 10;16:e156231. doi: 10.3897/imafungus.16.156231. eCollection 2025.

Phylogenomics, reticulation, and biogeographical history of Elaeagnaceae.胡颓子科的系统发育基因组学、网状进化及生物地理历史

Plant Divers. 2024 Jul 18;46(6):683-697. doi: 10.1016/j.pld.2024.07.001. eCollection 2024 Nov.

An integrative framework reveals widespread gene flow during the early radiation of oaks and relatives in Quercoideae (Fagaceae).一个综合框架揭示了栎属及其壳斗科青冈亚科近缘植物早期辐射分化期间广泛存在的基因流动。

J Integr Plant Biol. 2025 Apr;67(4):1119-1141. doi: 10.1111/jipb.13773. Epub 2024 Sep 19.

Herbarium collections remain essential in the age of community science.在社区科学时代，植物标本馆收藏仍然至关重要。

Nat Commun. 2024 Aug 31;15(1):7586. doi: 10.1038/s41467-024-51899-1.

Shifts in evolutionary lability underlie independent gains and losses of root-nodule symbiosis in a single clade of plants.进化不稳定性的转变是同一植物类群中根瘤共生的独立获得和丧失的基础。

Nat Commun. 2024 May 27;15(1):4262. doi: 10.1038/s41467-024-48036-3.

Developing the Protocol Infrastructure for DNA Sequencing Natural History Collections.开发用于DNA测序自然历史标本馆的协议基础设施。

Biodivers Data J. 2023 Oct 27;11:e102317. doi: 10.3897/BDJ.11.e102317. eCollection 2023.

Retrieval of long DNA reads from herbarium specimens.从植物标本中获取长DNA读数。

AoB Plants. 2023 Nov 8;15(6):plad074. doi: 10.1093/aobpla/plad074. eCollection 2023 Dec.

FieldPrism: A system for creating snapshot vouchers from field images using photogrammetric markers and QR codes.FieldPrism：一种使用摄影测量标记和二维码从现场图像创建快照凭证的系统。

Appl Plant Sci. 2023 Sep 28;11(5):e11545. doi: 10.1002/aps3.11545. eCollection 2023 Sep-Oct.

Applying a modified metabarcoding approach for the sequencing of macrofungal specimens from fungarium collections.应用改良的宏条形码方法对真菌标本馆收藏的大型真菌标本进行测序。

Appl Plant Sci. 2023 Feb 2;11(1):e11508. doi: 10.1002/aps3.11508. eCollection 2023 Jan-Feb.

Structural and evolutionary insights into astacin metallopeptidases.对虾红素金属肽酶的结构与进化见解。

Front Mol Biosci. 2023 Jan 4;9:1080836. doi: 10.3389/fmolb.2022.1080836. eCollection 2022.

本文引用的文献

Misleading results of likelihood-based phylogenetic analyses in the presence of missing data.存在缺失数据时基于似然法的系统发育分析的误导性结果。

Cladistics. 2012 Apr;28(2):208-222. doi: 10.1111/j.1096-0031.2011.00375.x. Epub 2011 Oct 3.

A basic ddRADseq two-enzyme protocol performs well with herbarium and silica-dried tissues across four genera.一种基本的双酶ddRADseq方案在四个属的标本馆和硅胶干燥组织中表现良好。

Appl Plant Sci. 2020 Apr 23;8(4):e11344. doi: 10.1002/aps3.11344. eCollection 2020 Apr.

Phylogenomics of 10,575 genomes reveals evolutionary proximity between domains Bacteria and Archaea.10575 个基因组的系统发生基因组学揭示了细菌域和古菌域之间的进化亲缘关系。

Nat Commun. 2019 Dec 2;10(1):5477. doi: 10.1038/s41467-019-13443-4.

One thousand plant transcriptomes and the phylogenomics of green plants.一万种植物转录组与绿色植物的系统发生基因组学

Nature. 2019 Oct;574(7780):679-685. doi: 10.1038/s41586-019-1693-2. Epub 2019 Oct 23.

Hyb-Seq for Flowering Plant Systematics.植物系统学中的杂交测序

Trends Plant Sci. 2019 Oct;24(10):887-891. doi: 10.1016/j.tplants.2019.07.011. Epub 2019 Aug 30.

A Universal Probe Set for Targeted Sequencing of 353 Nuclear Genes from Any Flowering Plant Designed Using k-Medoids Clustering.基于 k-中值聚类设计的用于靶向测序任何开花植物中 353 个核基因的通用探针集。

Syst Biol. 2019 Jul 1;68(4):594-606. doi: 10.1093/sysbio/syy086.

The Increasing Disconnection of Primary Biodiversity Data from Specimens: How Does It Happen and How to Handle It?主要生物多样性数据与标本日益脱节：这种情况是如何发生的，又该如何应对？

Syst Biol. 2018 Nov 1;67(6):1110-1119. doi: 10.1093/sysbio/syy044.

Digitization of herbaria enables novel research.植物标本馆的数字化使新的研究成为可能。

Am J Bot. 2017 Sep;104(9):1281-1284. doi: 10.3732/ajb.1700281.

Constructing a broadly inclusive seed plant phylogeny.构建一个广泛包容的种子植物系统发育树。

Am J Bot. 2018 Mar;105(3):302-314. doi: 10.1002/ajb2.1019. Epub 2018 Feb 14.

Use of globally unique identifiers (GUIDs) to link herbarium specimen records to physical specimens.使用全球唯一标识符（GUID）将植物标本记录与实体标本相链接。

Appl Plant Sci. 2018 Mar 7;6(2):e1027. doi: 10.1002/aps3.1027. eCollection 2018 Feb.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用自然历史标本馆高效构建大规模系统发育树的高通量方法。

High-throughput methods for efficiently building massive phylogenies from natural history collections.

作者信息

机构信息

出版信息

PREMISE

METHODS

RESULTS

DISCUSSION

前提

方法

结果

讨论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献