• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DiscoSnp-RAD:用于RAD-Seq群体基因组学的小变异体从头检测

DiscoSnp-RAD: de novo detection of small variants for RAD-Seq population genomics.

作者信息

Gauthier Jérémy, Mouden Charlotte, Suchan Tomasz, Alvarez Nadir, Arrigo Nils, Riou Chloé, Lemaitre Claire, Peterlongo Pierre

机构信息

Univ. Rennes, Inria, CNRS, IRISA, Rennes, France.

W. Szafer Institute of Botany, Polish Academy of Sciences, Krakow, Poland.

出版信息

PeerJ. 2020 Jun 10;8:e9291. doi: 10.7717/peerj.9291. eCollection 2020.

DOI:10.7717/peerj.9291
PMID:32566401
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7293188/
Abstract

Restriction site Associated DNA Sequencing (RAD-Seq) is a technique characterized by the sequencing of specific loci along the genome that is widely employed in the field of evolutionary biology since it allows to exploit variants (mainly Single Nucleotide Polymorphism-SNPs) information from entire populations at a reduced cost. Common RAD dedicated tools, such as or , are based on all-vs-all read alignments, which require consequent time and computing resources. We present an original method, DiscoSnp-RAD, that avoids this pitfall since variants are detected by exploiting specific parts of the assembly graph built from the reads, hence preventing all-vs-all read alignments. We tested the implementation on simulated datasets of increasing size, up to 1,000 samples, and on real RAD-Seq data from 259 specimens of flies, morphologically assigned to seven species. All individuals were successfully assigned to their species using both STRUCTURE and Maximum Likelihood phylogenetic reconstruction. Moreover, identified variants succeeded to reveal a within-species genetic structure linked to the geographic distribution. Furthermore, our results show that DiscoSnp-RAD is significantly faster than state-of-the-art tools. The overall results show that DiscoSnp-RAD is suitable to identify variants from RAD-Seq data, it does not require time-consuming parameterization steps and it stands out from other tools due to its completely different principle, making it substantially faster, in particular on large datasets.

摘要

限制性酶切位点关联DNA测序(RAD-Seq)是一种通过对基因组上特定位点进行测序的技术,因其能够以较低成本利用整个群体的变异(主要是单核苷酸多态性-SNP)信息,而在进化生物学领域得到广泛应用。常见的RAD专用工具,如 或 ,基于全对全读段比对,这需要大量的时间和计算资源。我们提出了一种原创方法DiscoSnp-RAD,该方法避免了这一缺陷,因为变异是通过利用从读段构建的组装图的特定部分来检测的,从而避免了全对全读段比对。我们在不断增大规模直至1000个样本的模拟数据集以及来自259个形态学上归为7个物种的果蝇样本的真实RAD-Seq数据上测试了该实现方法。使用STRUCTURE和最大似然系统发育重建方法,所有个体都成功地被归为其所属物种。此外,鉴定出的变异成功揭示了与地理分布相关的种内遗传结构。此外,我们的结果表明DiscoSnp-RAD比现有工具显著更快。总体结果表明,DiscoSnp-RAD适用于从RAD-Seq数据中鉴定变异,不需要耗时的参数化步骤,并且由于其完全不同的原理而在其他工具中脱颖而出,使其速度大幅提升,尤其是在大型数据集上。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca43/7293188/e997493694af/peerj-08-9291-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca43/7293188/d6cb87ffc5ad/peerj-08-9291-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca43/7293188/98a9bcb8899a/peerj-08-9291-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca43/7293188/e7d7d2350dfe/peerj-08-9291-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca43/7293188/e997493694af/peerj-08-9291-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca43/7293188/d6cb87ffc5ad/peerj-08-9291-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca43/7293188/98a9bcb8899a/peerj-08-9291-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca43/7293188/e7d7d2350dfe/peerj-08-9291-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca43/7293188/e997493694af/peerj-08-9291-g004.jpg

相似文献

1
DiscoSnp-RAD: de novo detection of small variants for RAD-Seq population genomics.DiscoSnp-RAD:用于RAD-Seq群体基因组学的小变异体从头检测
PeerJ. 2020 Jun 10;8:e9291. doi: 10.7717/peerj.9291. eCollection 2020.
2
Low impact of different SNP panels from two building-loci pipelines on RAD-Seq population genomic metrics: case study on five diverse aquatic species.来自两个构建位点流程的不同单核苷酸多态性(SNP)面板对简化基因组测序(RAD-Seq)群体基因组指标的影响较小:以五种不同水生物种为例
BMC Genomics. 2021 Mar 2;22(1):150. doi: 10.1186/s12864-021-07465-w.
3
Selecting RAD-Seq Data Analysis Parameters for Population Genetics: The More the Better?为群体遗传学选择RAD-Seq数据分析参数:越多越好?
Front Genet. 2019 May 29;10:533. doi: 10.3389/fgene.2019.00533. eCollection 2019.
4
Exploring the utility of cross-laboratory RAD-sequencing datasets for phylogenetic analysis.探索跨实验室RAD测序数据集在系统发育分析中的效用。
BMC Res Notes. 2015 Jul 8;8:299. doi: 10.1186/s13104-015-1261-2.
5
PMERGE: Computational filtering of paralogous sequences from RAD-seq data.PMERGE:从RAD-seq数据中对旁系同源序列进行计算过滤
Ecol Evol. 2018 Jun 11;8(14):7002-7013. doi: 10.1002/ece3.4219. eCollection 2018 Jul.
6
Population structure of Atlantic mackerel inferred from RAD-seq-derived SNP markers: effects of sequence clustering parameters and hierarchical SNP selection.基于RAD测序衍生的SNP标记推断大西洋鲭鱼的种群结构:序列聚类参数和分层SNP选择的影响
Mol Ecol Resour. 2016 Jul;16(4):991-1001. doi: 10.1111/1755-0998.12518. Epub 2016 Mar 22.
7
Stacks 2: Analytical methods for paired-end sequencing improve RADseq-based population genomics.Stacks 2:用于双端测序的分析方法改进了基于 RADseq 的群体基因组学。
Mol Ecol. 2019 Nov;28(21):4737-4754. doi: 10.1111/mec.15253. Epub 2019 Oct 17.
8
An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals.一种用于对来自多个个体的重叠双末端RAD reads进行本地从头组装的优化方法。
R Soc Open Sci. 2018 Feb 28;5(2):171589. doi: 10.1098/rsos.171589. eCollection 2018 Feb.
9
Assessing the potential of RAD-sequencing to resolve phylogenetic relationships within species radiations: The fly genus Chiastocheta (Diptera: Anthomyiidae) as a case study.评估RAD测序在解析物种辐射内系统发育关系方面的潜力:以果蝇属Chiastocheta(双翅目:花蝇科)为例进行研究。
Mol Phylogenet Evol. 2017 Sep;114:189-198. doi: 10.1016/j.ympev.2017.06.012. Epub 2017 Jun 21.
10
Reference-free detection of isolated SNPs.孤立单核苷酸多态性的无参考检测
Nucleic Acids Res. 2015 Jan;43(2):e11. doi: 10.1093/nar/gku1187. Epub 2014 Nov 17.

引用本文的文献

1
A scalable distributed pipeline for reference-free variants calling.一种用于无参考变异检测的可扩展分布式流程。
BMC Genomics. 2025 Jun 3;26(Suppl 1):557. doi: 10.1186/s12864-025-11722-7.
2
K-mer-based Approaches to Bridging Pangenomics and Population Genetics.基于K-mer的泛基因组学与群体遗传学关联方法。
Mol Biol Evol. 2025 Mar 5;42(3). doi: 10.1093/molbev/msaf047.
3
Genetic Homogeneity of subsp. Strains in Kazakhstan.哈萨克斯坦亚种菌株的遗传同质性。

本文引用的文献

1
ipyrad: Interactive assembly and analysis of RADseq datasets.ipyrad:RADseq 数据集的交互式组装和分析。
Bioinformatics. 2020 Apr 15;36(8):2592-2594. doi: 10.1093/bioinformatics/btz966.
2
Stacks 2: Analytical methods for paired-end sequencing improve RADseq-based population genomics.Stacks 2:用于双端测序的分析方法改进了基于 RADseq 的群体基因组学。
Mol Ecol. 2019 Nov;28(21):4737-4754. doi: 10.1111/mec.15253. Epub 2019 Oct 17.
3
Deriving genotypes from RAD-seq short-read data using Stacks.使用 Stacks 从 RAD-seq 短读数据中推导基因型。
Pathogens. 2024 Jul 12;13(7):581. doi: 10.3390/pathogens13070581.
4
Taxonomic review of (Spix, 1823) (Primates, Callitrichidae), and description of a new species.(Spix,1823)(灵长目,Callitrichidae)的分类学研究及一新种的描述。
PeerJ. 2023 Jan 11;11:e14526. doi: 10.7717/peerj.14526. eCollection 2023.
5
An integrative analysis uncovers a new, pseudo-cryptic species of Amazonian marmoset (Primates: Callitrichidae: Mico) from the arc of deforestation.一项综合分析揭示了一种新的、拟隐匿的亚马孙狨猴(灵长目:卷尾猴科:狨猴属)物种,来自森林砍伐区的弧形地带。
Sci Rep. 2021 Aug 2;11(1):15665. doi: 10.1038/s41598-021-93943-w.
Nat Protoc. 2017 Dec;12(12):2640-2659. doi: 10.1038/nprot.2017.123. Epub 2017 Nov 30.
4
Assessing the potential of RAD-sequencing to resolve phylogenetic relationships within species radiations: The fly genus Chiastocheta (Diptera: Anthomyiidae) as a case study.评估RAD测序在解析物种辐射内系统发育关系方面的潜力:以果蝇属Chiastocheta(双翅目:花蝇科)为例进行研究。
Mol Phylogenet Evol. 2017 Sep;114:189-198. doi: 10.1016/j.ympev.2017.06.012. Epub 2017 Jun 21.
5
RADcap: sequence capture of dual-digest RADseq libraries with identifiable duplicates and reduced missing data.RADcap:具有可识别重复序列和减少缺失数据的双酶切RADseq文库的序列捕获
Mol Ecol Resour. 2016 Sep;16(5):1264-78. doi: 10.1111/1755-0998.12566.
6
Hybridization Capture Using RAD Probes (hyRAD), a New Tool for Performing Genomic Analyses on Collection Specimens.使用RAD探针的杂交捕获技术(hyRAD),一种对采集样本进行基因组分析的新工具。
PLoS One. 2016 Mar 21;11(3):e0151651. doi: 10.1371/journal.pone.0151651. eCollection 2016.
7
Clustering Genes of Common Evolutionary History.具有共同进化历史的基因聚类
Mol Biol Evol. 2016 Jun;33(6):1590-605. doi: 10.1093/molbev/msw038. Epub 2016 Feb 17.
8
Harnessing the power of RADseq for ecological and evolutionary genomics.利用RADseq技术助力生态与进化基因组学研究
Nat Rev Genet. 2016 Feb;17(2):81-92. doi: 10.1038/nrg.2015.28. Epub 2016 Jan 5.
9
Short Tree, Long Tree, Right Tree, Wrong Tree: New Acquisition Bias Corrections for Inferring SNP Phylogenies.短树、长树、正确树、错误树:用于推断单核苷酸多态性系统发育的新获取偏差校正方法
Syst Biol. 2015 Nov;64(6):1032-47. doi: 10.1093/sysbio/syv053. Epub 2015 Jul 29.
10
Reference-free detection of isolated SNPs.孤立单核苷酸多态性的无参考检测
Nucleic Acids Res. 2015 Jan;43(2):e11. doi: 10.1093/nar/gku1187. Epub 2014 Nov 17.