• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

dDocent:一种 RADseq 变体调用管道,专为非模式生物的群体基因组学设计。

dDocent: a RADseq, variant-calling pipeline designed for population genomics of non-model organisms.

机构信息

Marine Genomics Laboratory, Harte Research Institute, Texas A&M University-Corpus Christi , Corpus Christi, TX , USA.

出版信息

PeerJ. 2014 Jun 10;2:e431. doi: 10.7717/peerj.431. eCollection 2014.

DOI:10.7717/peerj.431
PMID:24949246
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4060032/
Abstract

Restriction-site associated DNA sequencing (RADseq) has become a powerful and useful approach for population genomics. Currently, no software exists that utilizes both paired-end reads from RADseq data to efficiently produce population-informative variant calls, especially for non-model organisms with large effective population sizes and high levels of genetic polymorphism. dDocent is an analysis pipeline with a user-friendly, command-line interface designed to process individually barcoded RADseq data (with double cut sites) into informative SNPs/Indels for population-level analyses. The pipeline, written in BASH, uses data reduction techniques and other stand-alone software packages to perform quality trimming and adapter removal, de novo assembly of RAD loci, read mapping, SNP and Indel calling, and baseline data filtering. Double-digest RAD data from population pairings of three different marine fishes were used to compare dDocent with Stacks, the first generally available, widely used pipeline for analysis of RADseq data. dDocent consistently identified more SNPs shared across greater numbers of individuals and with higher levels of coverage. This is due to the fact that dDocent quality trims instead of filtering, incorporates both forward and reverse reads (including reads with INDEL polymorphisms) in assembly, mapping, and SNP calling. The pipeline and a comprehensive user guide can be found at http://dDocent.wordpress.com.

摘要

限制性位点相关 DNA 测序(RADseq)已经成为群体基因组学的一种强大而有用的方法。目前,还没有软件可以利用 RADseq 数据的配对末端读取来有效地生成群体信息变异调用,特别是对于具有大有效种群大小和高水平遗传多态性的非模式生物。dDocent 是一个具有用户友好的命令行界面的分析管道,旨在将单独标记的 RADseq 数据(具有双切割位点)处理成有信息的 SNP/Indels,用于群体水平分析。该管道使用 BASH 编写,利用数据缩减技术和其他独立的软件包来执行质量修剪和接头去除、RAD 位点的从头组装、读映射、SNP 和 Indel 调用以及基线数据过滤。使用来自三种不同海洋鱼类的群体配对的双消化 RAD 数据来比较 dDocent 和 Stacks,Stacks 是第一个可用于 RADseq 数据分析的广泛使用的管道。dDocent 始终能够识别更多的 SNP,这些 SNP 在更多的个体中共享,并且具有更高的覆盖水平。这是因为 dDocent 进行质量修剪而不是过滤,在组装、映射和 SNP 调用中同时包含正向和反向读取(包括具有 INDEL 多态性的读取)。该管道和综合用户指南可在 http://dDocent.wordpress.com 上找到。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d51/4060032/9063b0b1d768/peerj-02-431-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d51/4060032/464c26384d3c/peerj-02-431-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d51/4060032/9063b0b1d768/peerj-02-431-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d51/4060032/464c26384d3c/peerj-02-431-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7d51/4060032/9063b0b1d768/peerj-02-431-g002.jpg

相似文献

1
dDocent: a RADseq, variant-calling pipeline designed for population genomics of non-model organisms.dDocent:一种 RADseq 变体调用管道,专为非模式生物的群体基因组学设计。
PeerJ. 2014 Jun 10;2:e431. doi: 10.7717/peerj.431. eCollection 2014.
2
gmRAD: an integrated SNP calling pipeline for genetic mapping with RADseq across a hybrid population.gmRAD:一种用于在杂交群体中通过RADseq进行遗传图谱绘制的综合单核苷酸多态性(SNP)检测流程。
Brief Bioinform. 2020 Jan 17;21(1):329-337. doi: 10.1093/bib/bby114.
3
PyRAD: assembly of de novo RADseq loci for phylogenetic analyses.PyRAD:用于系统发育分析的从头RADseq位点组装
Bioinformatics. 2014 Jul 1;30(13):1844-9. doi: 10.1093/bioinformatics/btu121. Epub 2014 Mar 5.
4
A reference-free approach to analyse RADseq data using standard next generation sequencing toolkits.一种使用标准下一代测序工具包分析 RADseq 数据的无参考方法。
Mol Ecol Resour. 2021 May;21(4):1085-1097. doi: 10.1111/1755-0998.13324. Epub 2021 Feb 8.
5
Read trimming has minimal effect on bacterial SNP-calling accuracy.reads 修剪对细菌 SNP 调用准确性的影响最小。
Microb Genom. 2020 Dec;6(12). doi: 10.1099/mgen.0.000434. Epub 2020 Dec 11.
6
Stacks 2: Analytical methods for paired-end sequencing improve RADseq-based population genomics.Stacks 2:用于双端测序的分析方法改进了基于 RADseq 的群体基因组学。
Mol Ecol. 2019 Nov;28(21):4737-4754. doi: 10.1111/mec.15253. Epub 2019 Oct 17.
7
AftrRAD: a pipeline for accurate and efficient de novo assembly of RADseq data.AftrRAD:一种用于准确高效地对RADseq数据进行从头组装的流程。
Mol Ecol Resour. 2015 Sep;15(5):1163-71. doi: 10.1111/1755-0998.12378. Epub 2015 Feb 16.
8
Experimental validation of in silico predicted RAD locus frequencies using genomic resources and short read data from a model marine mammal.利用基因组资源和来自模式海洋哺乳动物的短读序列数据对 RAD 基因座频率进行计算机预测的实验验证。
BMC Genomics. 2019 Jan 22;20(1):72. doi: 10.1186/s12864-019-5440-8.
9
Exploring single-sample SNP and INDEL calling with whole-genome de novo assembly.利用全基因组从头组装进行单样本 SNP 和 INDEL 调用的探索。
Bioinformatics. 2012 Jul 15;28(14):1838-44. doi: 10.1093/bioinformatics/bts280. Epub 2012 May 7.
10
Software for pre-processing Illumina next-generation sequencing short read sequences.用于预处理Illumina下一代测序短读序列的软件。
Source Code Biol Med. 2014 May 3;9:8. doi: 10.1186/1751-0473-9-8. eCollection 2014.

引用本文的文献

1
The Genomic Basis of the Tristylous Floral Polymorphism: Evidence for a Role of Gene Duplications in a Region of Restricted Recombination.三型花柱花多态性的基因组基础:基因重复在有限重组区域中作用的证据
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf170.
2
Peripheral Budding Following Range Expansion Explains Diversity and Distribution of One-Sided Livebearing Fish.范围扩张后的外周萌芽解释了单侧卵胎生鱼类的多样性和分布。
Mol Ecol. 2025 Aug;34(16):e70023. doi: 10.1111/mec.70023. Epub 2025 Jul 11.
3
Global stock structure of the Silky shark (, Carcharhinidae) assessed with high-throughput DNA sequencing.

本文引用的文献

1
From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline.从FastQ数据到高可信度变异检测:基因组分析工具包最佳实践流程
Curr Protoc Bioinformatics. 2013;43(1110):11.10.1-11.10.33. doi: 10.1002/0471250953.bi1110s43.
2
ezRAD: a simplified method for genomic genotyping in non-model organisms.ezRAD:一种简化的非模式生物基因组基因分型方法。
PeerJ. 2013 Nov 19;1:e203. doi: 10.7717/peerj.203. eCollection 2013.
3
Genome sequencing and population genomics in non-model organisms.非模式生物的基因组测序和群体基因组学。
利用高通量DNA测序评估丝鲨(真鲨科)的全球种群结构
PeerJ. 2025 Jul 7;13:e19493. doi: 10.7717/peerj.19493. eCollection 2025.
4
Non-Random Mortality in an Experimental Oyster Restoration.实验性牡蛎恢复中的非随机死亡率
Evol Appl. 2025 Jul 6;18(7):e70128. doi: 10.1111/eva.70128. eCollection 2025 Jul.
5
Harpy: a pipeline for processing haplotagging linked-read data.哈比:一种用于处理单倍型标记连接读段数据的流程。
Bioinform Adv. 2025 Jun 5;5(1):vbaf133. doi: 10.1093/bioadv/vbaf133. eCollection 2025.
6
Harnessing Genomics for Breeding L.: Genotyping and Ploidy Testing of Clonal Lines Through ddRADseq Applications.利用基因组学进行育种L.:通过ddRADseq应用对克隆系进行基因分型和倍性检测。
Int J Mol Sci. 2025 May 20;26(10):4898. doi: 10.3390/ijms26104898.
7
Strong Environmental and Genome-Wide Population Differentiation Underpins Adaptation and High Genomic Vulnerability in the Dominant Australian Kelp ().强烈的环境和全基因组种群分化是澳大利亚优势海带适应和高基因组脆弱性的基础。
Ecol Evol. 2025 May 12;15(5):e71158. doi: 10.1002/ece3.71158. eCollection 2025 May.
8
Genomic Selection for Pea Grain Yield and Protein Content in Italian Environments for Target and Non-Target Genetic Bases.意大利环境下针对目标和非目标遗传基础的豌豆籽粒产量和蛋白质含量的基因组选择
Int J Mol Sci. 2025 Mar 25;26(7):2991. doi: 10.3390/ijms26072991.
9
Submerged Corridors of Ancient Gene Flow in an Island Amphibian.岛屿两栖动物中古代基因流动的水下通道
Mol Ecol. 2025 May;34(9):e17742. doi: 10.1111/mec.17742. Epub 2025 Apr 3.
10
Genome-wide exploration of soybean domestication traits: integrating association mapping and SNP × SNP interaction analyses.大豆驯化性状的全基因组探索:整合关联图谱和单核苷酸多态性×单核苷酸多态性相互作用分析
Plant Mol Biol. 2025 Apr 3;115(2):55. doi: 10.1007/s11103-025-01583-9.
Trends Ecol Evol. 2014 Jan;29(1):51-63. doi: 10.1016/j.tree.2013.09.008. Epub 2013 Oct 17.
4
Genotyping-by-sequencing in ecological and conservation genomics.生态与保护基因组学中的简化基因组测序
Mol Ecol. 2013 Jun;22(11):2841-7. doi: 10.1111/mec.12350. Epub 2013 May 25.
5
Stacks: an analysis tool set for population genomics.Stacks:用于群体基因组学的分析工具集。
Mol Ecol. 2013 Jun;22(11):3124-40. doi: 10.1111/mec.12354. Epub 2013 May 24.
6
CD-HIT: accelerated for clustering the next-generation sequencing data.CD-HIT:用于加速下一代测序数据聚类的工具。
Bioinformatics. 2012 Dec 1;28(23):3150-2. doi: 10.1093/bioinformatics/bts565. Epub 2012 Oct 11.
7
Rainbow: an integrated tool for efficient clustering and assembling RAD-seq reads.Rainbow:一种用于高效聚类和组装 RAD-seq 读取的集成工具。
Bioinformatics. 2012 Nov 1;28(21):2732-7. doi: 10.1093/bioinformatics/bts482. Epub 2012 Sep 1.
8
Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species.双酶切 RAD 测序:一种在模式生物和非模式生物中进行从头 SNP 发现和基因分型的廉价方法。
PLoS One. 2012;7(5):e37135. doi: 10.1371/journal.pone.0037135. Epub 2012 May 31.
9
2b-RAD: a simple and flexible method for genome-wide genotyping.2b-RAD:一种简单灵活的全基因组基因分型方法。
Nat Methods. 2012 May 20;9(8):808-10. doi: 10.1038/nmeth.2023.
10
Pervasive indels and their evolutionary dynamics after the fish-specific genome duplication.鱼类特异性基因组加倍后广泛存在的缺失及其进化动态。
Mol Biol Evol. 2012 Oct;29(10):3005-22. doi: 10.1093/molbev/mss108. Epub 2012 Apr 4.