GenErode：一个用于研究濒危和灭绝物种基因组侵蚀的生物信息学管道。

GenErode: a bioinformatics pipeline to investigate genome erosion in endangered and extinct species.

机构信息

Department of Biochemistry and Biophysics, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Stockholm University, Solna, Sweden.

Department of Cell and Molecular Biology, National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Uppsala University, Uppsala, Sweden.

出版信息

BMC Bioinformatics. 2022 Jun 13;23(1):228. doi: 10.1186/s12859-022-04757-0.

DOI:10.1186/s12859-022-04757-0

PMID:35698034

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9195343/

Abstract

BACKGROUND

Many wild species have suffered drastic population size declines over the past centuries, which have led to 'genomic erosion' processes characterized by reduced genetic diversity, increased inbreeding, and accumulation of harmful mutations. Yet, genomic erosion estimates of modern-day populations often lack concordance with dwindling population sizes and conservation status of threatened species. One way to directly quantify the genomic consequences of population declines is to compare genome-wide data from pre-decline museum samples and modern samples. However, doing so requires computational data processing and analysis tools specifically adapted to comparative analyses of degraded, ancient or historical, DNA data with modern DNA data as well as personnel trained to perform such analyses.

RESULTS

Here, we present a highly flexible, scalable, and modular pipeline to compare patterns of genomic erosion using samples from disparate time periods. The GenErode pipeline uses state-of-the-art bioinformatics tools to simultaneously process whole-genome re-sequencing data from ancient/historical and modern samples, and to produce comparable estimates of several genomic erosion indices. No programming knowledge is required to run the pipeline and all bioinformatic steps are well-documented, making the pipeline accessible to users with different backgrounds. GenErode is written in Snakemake and Python3 and uses Conda and Singularity containers to achieve reproducibility on high-performance compute clusters. The source code is freely available on GitHub ( https://github.com/NBISweden/GenErode ).

CONCLUSIONS

GenErode is a user-friendly and reproducible pipeline that enables the standardization of genomic erosion indices from temporally sampled whole genome re-sequencing data.

摘要

背景

在过去的几个世纪里，许多野生物种的种群数量急剧减少，导致了“基因组侵蚀”过程，其特征是遗传多样性减少、近亲繁殖增加和有害突变积累。然而，现代种群的基因组侵蚀估计往往与濒危物种的种群减少和保护状况缺乏一致性。一种直接量化种群减少对基因组影响的方法是比较衰退前博物馆样本和现代样本的全基因组数据。然而，这样做需要专门用于比较降解、古老或历史 DNA 数据与现代 DNA 数据的计算数据处理和分析工具，以及受过执行此类分析培训的人员。

结果

在这里，我们提出了一个高度灵活、可扩展和模块化的管道，用于使用来自不同时期的样本比较基因组侵蚀模式。GenErode 管道使用最先进的生物信息学工具，同时处理来自古代/历史和现代样本的全基因组重测序数据，并生成几个基因组侵蚀指数的可比估计值。运行管道不需要编程知识，并且所有生物信息学步骤都有详细的文档记录，这使得不同背景的用户都可以使用该管道。GenErode 是用 Snakemake 和 Python3 编写的，并使用 Conda 和 Singularity 容器在高性能计算集群上实现可重复性。源代码可在 GitHub 上免费获得（https://github.com/NBISweden/GenErode）。

结论

GenErode 是一个用户友好且可重复的管道，它能够标准化来自时间采样的全基因组重测序数据的基因组侵蚀指数。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/55ab/9195343/2c36539a82fe/12859_2022_4757_Fig1_HTML.jpg

相似文献

GenErode: a bioinformatics pipeline to investigate genome erosion in endangered and extinct species.GenErode：一个用于研究濒危和灭绝物种基因组侵蚀的生物信息学管道。

BMC Bioinformatics. 2022 Jun 13;23(1):228. doi: 10.1186/s12859-022-04757-0.

Quantifying Temporal Genomic Erosion in Endangered Species.量化濒危物种的时间性基因组侵蚀。

Trends Ecol Evol. 2018 Mar;33(3):176-185. doi: 10.1016/j.tree.2017.12.002. Epub 2017 Dec 27.

Historical Genomes Reveal the Genomic Consequences of Recent Population Decline in Eastern Gorillas.历史基因组揭示了东部大猩猩近期种群减少的基因组后果。

Curr Biol. 2019 Jan 7;29(1):165-170.e6. doi: 10.1016/j.cub.2018.11.055. Epub 2018 Dec 27.

ILIAD: a suite of automated Snakemake workflows for processing genomic data for downstream applications.ILIAD：一套用于处理基因组数据以用于下游应用的自动化 Snakemake 工作流程套件。

BMC Bioinformatics. 2023 Nov 8;24(1):424. doi: 10.1186/s12859-023-05548-x.

Mapache: a flexible pipeline to map ancient DNA.Mapache：一个灵活的古 DNA 映射管道。

Bioinformatics. 2023 Feb 3;39(2). doi: 10.1093/bioinformatics/btad028.

RASflow: an RNA-Seq analysis workflow with Snakemake.RASflow：一个基于 Snakemake 的 RNA-Seq 分析工作流程。

BMC Bioinformatics. 2020 Mar 18;21(1):110. doi: 10.1186/s12859-020-3433-x.

kGWASflow: a modular, flexible, and reproducible Snakemake workflow for k-mers-based GWAS.kGWASflow：一种基于 k-mer 的 GWAS 的模块化、灵活和可重复的 Snakemake 工作流程。

G3 (Bethesda). 2023 Dec 29;14(1). doi: 10.1093/g3journal/jkad246.

Functional genomic diversity is correlated with neutral genomic diversity in populations of an endangered rattlesnake.功能基因组多样性与濒危响尾蛇种群中性基因组多样性相关。

Proc Natl Acad Sci U S A. 2023 Oct 24;120(43):e2303043120. doi: 10.1073/pnas.2303043120. Epub 2023 Oct 16.

Natrix: a Snakemake-based workflow for processing, clustering, and taxonomically assigning amplicon sequencing reads.Natrix：一个基于 SnakeMake 的工作流程，用于处理、聚类和分类分配扩增子测序reads。

BMC Bioinformatics. 2020 Nov 16;21(1):526. doi: 10.1186/s12859-020-03852-4.

P-DOR, an easy-to-use pipeline to reconstruct bacterial outbreaks using genomics.P-DOR，一个使用基因组学重建细菌爆发的简单易用的管道。

Bioinformatics. 2023 Sep 2;39(9). doi: 10.1093/bioinformatics/btad571.

引用本文的文献

The Genomic Basis of the Svalbard Reindeer's Adaptation to an Extreme Arctic Environment.斯瓦尔巴德驯鹿适应极端北极环境的基因组基础。

Genome Biol Evol. 2025 Sep 2;17(9). doi: 10.1093/gbe/evaf160.

Using sedimentary ancient DNA in coastal and marine contexts to explore past human-environmental interactions in Australia.利用沿海和海洋环境中的沉积古代DNA探索澳大利亚过去的人类与环境相互作用。

Philos Trans R Soc Lond B Biol Sci. 2025 Jul 10;380(1930):20240032. doi: 10.1098/rstb.2024.0032.

Genome analyses suggest recent speciation and postglacial isolation in the Norwegian lemming.基因组分析表明，挪威旅鼠近期出现了物种形成和冰期后隔离现象。

Proc Natl Acad Sci U S A. 2025 Jul 15;122(28):e2424333122. doi: 10.1073/pnas.2424333122. Epub 2025 Jun 30.

Potential risk of dipterocarps in the marginal Asian rainforests: low population size and high genomic erosion.亚洲边缘雨林中龙脑香科植物的潜在风险：种群规模小且基因组侵蚀严重。

BMC Biol. 2025 Jun 9;23(1):161. doi: 10.1186/s12915-025-02275-y.

Constraints to gene flow increase the risk of genome erosion in the Ngorongoro Crater lion population.基因流动的限制增加了恩戈罗恩戈罗火山口狮子种群基因组侵蚀的风险。

Commun Biol. 2025 Apr 21;8(1):640. doi: 10.1038/s42003-025-07986-0.

PopGLen-a Snakemake pipeline for performing population genomic analyses using genotype likelihood-based methods.PopGLen——一个使用基于基因型似然性方法进行群体基因组分析的Snakemake工作流。

Bioinformatics. 2025 Mar 4;41(3). doi: 10.1093/bioinformatics/btaf105.

A High-Throughput Ancient DNA Extraction Method for Large-Scale Sample Screening.一种用于大规模样本筛选的高通量古DNA提取方法。

Mol Ecol Resour. 2025 May;25(4):e14077. doi: 10.1111/1755-0998.14077. Epub 2025 Feb 6.

Heterochronous mitogenomes shed light on the Holocene history of the Scandinavian brown bear.异时性线粒体基因组揭示了全新世斯堪的纳维亚棕熊的历史。

Sci Rep. 2024 Oct 22;14(1):24917. doi: 10.1038/s41598-024-75028-6.

Range-wide and temporal genomic analyses reveal the consequences of near-extinction in Swedish moose.全范围和时间的基因组分析揭示了瑞典驼鹿近灭绝的后果。

Commun Biol. 2023 Oct 17;6(1):1035. doi: 10.1038/s42003-023-05385-x.

Adaptation to the High-Arctic island environment despite long-term reduced genetic variation in Svalbard reindeer.尽管斯瓦尔巴德驯鹿的遗传变异长期减少，但仍能适应北极高纬度岛屿环境。

iScience. 2023 Sep 3;26(10):107811. doi: 10.1016/j.isci.2023.107811. eCollection 2023 Oct 20.

本文引用的文献

Ancient and historical DNA in conservation policy.保护政策中的古代和历史DNA

Trends Ecol Evol. 2022 May;37(5):420-429. doi: 10.1016/j.tree.2021.12.010. Epub 2022 Jan 24.

Genomic insights into the conservation status of the world's last remaining Sumatran rhinoceros populations.对世界上最后剩余的苏门答腊犀牛种群的保护状况的基因组分析。

Nat Commun. 2021 Apr 26;12(1):2393. doi: 10.1038/s41467-021-22386-8.

Million-year-old DNA sheds light on the genomic history of mammoths.百万年前的 DNA 揭示了猛犸象的基因组历史。

Nature. 2021 Mar;591(7849):265-269. doi: 10.1038/s41586-021-03224-9. Epub 2021 Feb 17.

Twelve years of SAMtools and BCFtools.SAMtools 和 BCFtools 十二年。

Gigascience. 2021 Feb 16;10(2). doi: 10.1093/gigascience/giab008.

Population genomics reveals lack of greater white-fronted introgression into the Swedish lesser white-fronted goose.种群基因组学揭示了瑞典小白额雁中大白额雁的基因渗入较少。

Sci Rep. 2020 Oct 27;10(1):18347. doi: 10.1038/s41598-020-75315-y.

A demonstration of conservation genomics for threatened species management.保护遗传学在濒危物种管理中的应用示范。

Mol Ecol Resour. 2020 Nov;20(6):1526-1541. doi: 10.1111/1755-0998.13211. Epub 2020 Jul 24.

The Genomic Footprints of the Fall and Recovery of the Crested Ibis.朱鹮的兴衰基因组印记。

Curr Biol. 2019 Jan 21;29(2):340-349.e7. doi: 10.1016/j.cub.2018.12.008. Epub 2019 Jan 10.

Historical Genomes Reveal the Genomic Consequences of Recent Population Decline in Eastern Gorillas.历史基因组揭示了东部大猩猩近期种群减少的基因组后果。

Curr Biol. 2019 Jan 7;29(1):165-170.e6. doi: 10.1016/j.cub.2018.11.055. Epub 2018 Dec 27.

fastp: an ultra-fast all-in-one FASTQ preprocessor.fastp：一个超快速的一体化 FASTQ 预处理程序。

Bioinformatics. 2018 Sep 1;34(17):i884-i890. doi: 10.1093/bioinformatics/bty560.

Quantifying and reducing spurious alignments for the analysis of ultra-short ancient DNA sequences.量化和减少超短古 DNA 序列分析中的虚假比对。

BMC Biol. 2018 Oct 25;16(1):121. doi: 10.1186/s12915-018-0581-9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

GenErode：一个用于研究濒危和灭绝物种基因组侵蚀的生物信息学管道。

GenErode: a bioinformatics pipeline to investigate genome erosion in endangered and extinct species.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献