• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

预测真核生物生命之树上的RAD测序标记数量。

Predicting RAD-seq Marker Numbers across the Eukaryotic Tree of Life.

作者信息

Herrera Santiago, Reyes-Herrera Paula H, Shank Timothy M

机构信息

Biology Department, Woods Hole Oceanographic Institution Biology Department, Massachusetts Institute of Technology

Colombian Corporation for Agricultural Research (CORPOICA), Bogotá, Colombia.

出版信息

Genome Biol Evol. 2015 Nov 3;7(12):3207-25. doi: 10.1093/gbe/evv210.

DOI:10.1093/gbe/evv210
PMID:26537225
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4700943/
Abstract

High-throughput sequencing of reduced representation libraries obtained through digestion with restriction enzymes--generically known as restriction site associated DNA sequencing (RAD-seq)--is a common strategy to generate genome-wide genotypic and sequence data from eukaryotes. A critical design element of any RAD-seq study is knowledge of the approximate number of genetic markers that can be obtained for a taxon using different restriction enzymes, as this number determines the scope of a project, and ultimately defines its success. This number can only be directly determined if a reference genome sequence is available, or it can be estimated if the genome size and restriction recognition sequence probabilities are known. However, both scenarios are uncommon for nonmodel species. Here, we performed systematic in silico surveys of recognition sequences, for diverse and commonly used type II restriction enzymes across the eukaryotic tree of life. Our observations reveal that recognition sequence frequencies for a given restriction enzyme are strikingly variable among broad eukaryotic taxonomic groups, being largely determined by phylogenetic relatedness. We demonstrate that genome sizes can be predicted from cleavage frequency data obtained with restriction enzymes targeting "neutral" elements. Models based on genomic compositions are also effective tools to accurately calculate probabilities of recognition sequences across taxa, and can be applied to species for which reduced representation data are available (including transcriptomes and neutral RAD-seq data sets). The analytical pipeline developed in this study, PredRAD (https://github.com/phrh/PredRAD), and the resulting databases constitute valuable resources that will help guide the design of any study using RAD-seq or related methods.

摘要

通过用限制酶消化获得的简化代表性文库的高通量测序——一般称为限制性位点相关DNA测序(RAD-seq)——是从真核生物中生成全基因组基因型和序列数据的常用策略。任何RAD-seq研究的一个关键设计要素是了解使用不同限制酶可以为一个分类群获得的遗传标记的大致数量,因为这个数量决定了一个项目的范围,并最终决定其成败。只有在有参考基因组序列的情况下才能直接确定这个数量,或者如果已知基因组大小和限制识别序列概率,也可以进行估计。然而,这两种情况对于非模式物种来说都不常见。在这里,我们对真核生物生命树中各种常用的II型限制酶的识别序列进行了系统的计算机模拟调查。我们的观察结果表明,给定限制酶的识别序列频率在广泛的真核生物分类群中存在显著差异,这在很大程度上由系统发育相关性决定。我们证明,可以根据针对“中性”元件的限制酶获得的切割频率数据来预测基因组大小。基于基因组组成的模型也是准确计算不同分类群识别序列概率的有效工具,并且可以应用于有简化代表性数据可用的物种(包括转录组和中性RAD-seq数据集)。本研究开发的分析流程PredRAD(https://github.com/phrh/PredRAD)以及由此产生的数据库构成了宝贵的资源,将有助于指导任何使用RAD-seq或相关方法的研究设计。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/90969d0261f5/evv210f9p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/b80444f94a2f/evv210f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/94fd8c9ee9ae/evv210f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/7c87937361a1/evv210f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/0f1c790fc490/evv210f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/36a995965dd3/evv210f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/848e9568b10e/evv210f6p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/90fc04917d3b/evv210f7p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/f0a748a26971/evv210f8p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/90969d0261f5/evv210f9p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/b80444f94a2f/evv210f1p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/94fd8c9ee9ae/evv210f2p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/7c87937361a1/evv210f3p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/0f1c790fc490/evv210f4p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/36a995965dd3/evv210f5p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/848e9568b10e/evv210f6p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/90fc04917d3b/evv210f7p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/f0a748a26971/evv210f8p.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3386/4700943/90969d0261f5/evv210f9p.jpg

相似文献

1
Predicting RAD-seq Marker Numbers across the Eukaryotic Tree of Life.预测真核生物生命之树上的RAD测序标记数量。
Genome Biol Evol. 2015 Nov 3;7(12):3207-25. doi: 10.1093/gbe/evv210.
2
Experimental validation of in silico predicted RAD locus frequencies using genomic resources and short read data from a model marine mammal.利用基因组资源和来自模式海洋哺乳动物的短读序列数据对 RAD 基因座频率进行计算机预测的实验验证。
BMC Genomics. 2019 Jan 22;20(1):72. doi: 10.1186/s12864-019-5440-8.
3
Development of a universal double-digest RAD sequencing approach for a group of nonmodel, ecologically and economically important insect and fish taxa.为一组非模式、具有生态和经济重要性的昆虫和鱼类分类群开发通用的双酶切RAD测序方法。
Mol Ecol Resour. 2016 Nov;16(6):1303-1314. doi: 10.1111/1755-0998.12527. Epub 2016 May 17.
4
Population genomic analysis of model and nonmodel organisms using sequenced RAD tags.使用测序RAD标签对模式生物和非模式生物进行群体基因组分析。
Methods Mol Biol. 2012;888:235-60. doi: 10.1007/978-1-61779-870-2_14.
5
Sequence Capture versus Restriction Site Associated DNA Sequencing for Shallow Systematics.靶向捕获测序与限制性位点相关 DNA 测序在浅层系统学中的应用。
Syst Biol. 2016 Sep;65(5):910-24. doi: 10.1093/sysbio/syw036. Epub 2016 Jun 10.
6
Deriving genotypes from RAD-seq short-read data using Stacks.使用 Stacks 从 RAD-seq 短读数据中推导基因型。
Nat Protoc. 2017 Dec;12(12):2640-2659. doi: 10.1038/nprot.2017.123. Epub 2017 Nov 30.
7
Exploring the utility of cross-laboratory RAD-sequencing datasets for phylogenetic analysis.探索跨实验室RAD测序数据集在系统发育分析中的效用。
BMC Res Notes. 2015 Jul 8;8:299. doi: 10.1186/s13104-015-1261-2.
8
Inferring phylogenies from RAD sequence data.从 RAD 序列数据推断系统发育关系。
PLoS One. 2012;7(4):e33394. doi: 10.1371/journal.pone.0033394. Epub 2012 Apr 6.
9
Double-digest RAD sequencing using Ion Proton semiconductor platform (ddRADseq-ion) with nonmodel organisms.使用离子质子半导体平台(ddRADseq-ion)对非模式生物进行双酶切RAD测序。
Mol Ecol Resour. 2015 Nov;15(6):1316-29. doi: 10.1111/1755-0998.12406. Epub 2015 Apr 6.
10
Analytical workflow of double-digest restriction site-associated DNA sequencing based on empirical and in silico optimization in tomato.基于经验和计算机模拟优化的番茄双酶切限制性位点相关DNA测序分析流程
DNA Res. 2016 Apr;23(2):145-53. doi: 10.1093/dnares/dsw004. Epub 2016 Feb 29.

引用本文的文献

1
Evaluating restriction enzyme selection for reduced representation sequencing in conservation genomics.评估保护基因组学中简化代表性测序的限制性内切酶选择
Mol Ecol Resour. 2025 Jul;25(5):e13865. doi: 10.1111/1755-0998.13865. Epub 2023 Sep 14.
2
Population Genomics Analysis with RAD, Reprised: Stacks 2.基于 RAD 技术的群体基因组分析:Stacks2
Methods Mol Biol. 2022;2498:99-149. doi: 10.1007/978-1-0716-2313-8_7.
3
Facilitating population genomics of non-model organisms through optimized experimental design for reduced representation sequencing.

本文引用的文献

1
RAD sequencing enables unprecedented phylogenetic resolution and objective species delimitation in recalcitrant divergent taxa.RAD测序能够在顽固的分歧类群中实现前所未有的系统发育分辨率和客观的物种界定。
Mol Phylogenet Evol. 2016 Jul;100:70-79. doi: 10.1016/j.ympev.2016.03.010. Epub 2016 Mar 15.
2
Synthesis of phylogeny and taxonomy into a comprehensive tree of life.将系统发育学和分类学整合为一个全面的生命之树。
Proc Natl Acad Sci U S A. 2015 Oct 13;112(41):12764-9. doi: 10.1073/pnas.1423041112. Epub 2015 Sep 18.
3
The evolution of animal genomes.
通过优化简化代表性测序的实验设计,促进非模式生物的群体基因组学研究。
BMC Genomics. 2021 Aug 21;22(1):625. doi: 10.1186/s12864-021-07917-3.
4
OpenTree: A Python Package for Accessing and Analyzing Data from the Open Tree of Life.OpenTree:一个用于访问和分析开放生命树数据的 Python 包。
Syst Biol. 2021 Oct 13;70(6):1295-1301. doi: 10.1093/sysbio/syab033.
5
The rise and fall of the ancient northern pike master sex-determining gene.古代白斑狗鱼性别决定主基因的兴衰
Elife. 2021 Jan 28;10:e62858. doi: 10.7554/eLife.62858.
6
Double-digest RAD-sequencing: do pre- and post-sequencing protocol parameters impact biological results?双酶切 RAD 测序:测序前后的协议参数是否会影响生物学结果?
Mol Genet Genomics. 2021 Mar;296(2):457-471. doi: 10.1007/s00438-020-01756-9. Epub 2021 Jan 20.
7
Evolutionary selection against short nucleotide sequences in viruses and their related hosts.病毒及其相关宿主中对短核苷酸序列的进化选择。
DNA Res. 2020 Apr 1;27(2). doi: 10.1093/dnares/dsaa008.
8
Population genomics of rapidly invading lionfish in the Caribbean reveals signals of range expansion in the absence of spatial population structure.加勒比地区迅速入侵的狮子鱼的种群基因组学揭示了在缺乏空间种群结构的情况下范围扩张的信号。
Ecol Evol. 2019 Feb 10;9(6):3306-3320. doi: 10.1002/ece3.4952. eCollection 2019 Mar.
9
Experimental validation of in silico predicted RAD locus frequencies using genomic resources and short read data from a model marine mammal.利用基因组资源和来自模式海洋哺乳动物的短读序列数据对 RAD 基因座频率进行计算机预测的实验验证。
BMC Genomics. 2019 Jan 22;20(1):72. doi: 10.1186/s12864-019-5440-8.
10
Complex signatures of genomic variation of two non-model marine species in a homogeneous environment.两种均一环境下非模式海洋物种基因组变异的复杂特征。
BMC Genomics. 2018 May 9;19(1):347. doi: 10.1186/s12864-018-4721-y.
动物基因组的进化。
Curr Opin Genet Dev. 2015 Dec;35:25-32. doi: 10.1016/j.gde.2015.08.006. Epub 2015 Sep 9.
4
Evolutionary and biogeographical patterns of barnacles from deep-sea hydrothermal vents.深海热液喷口藤壶的进化与生物地理模式。
Mol Ecol. 2015 Feb;24(3):673-89. doi: 10.1111/mec.13054.
5
Amplification biases and consistent recovery of loci in a double-digest RAD-seq protocol.双酶切RAD-seq方案中的扩增偏差和位点的一致回收
PLoS One. 2014 Sep 4;9(9):e106713. doi: 10.1371/journal.pone.0106713. eCollection 2014.
6
Restriction site-associated DNA sequencing, genotyping error estimation and de novo assembly optimization for population genetic inference.基于限制位点相关 DNA 测序的种群遗传推断中的基因分型错误估计和从头组装优化。
Mol Ecol Resour. 2015 Jan;15(1):28-41. doi: 10.1111/1755-0998.12291. Epub 2014 Jul 3.
7
SimRAD: an R package for simulation-based prediction of the number of loci expected in RADseq and similar genotyping by sequencing approaches.SimRAD:一个用于基于模拟预测RADseq及类似测序基因分型方法中预期位点数量的R包。
Mol Ecol Resour. 2014 Nov;14(6):1314-21. doi: 10.1111/1755-0998.12273. Epub 2014 May 26.
8
ezRAD: a simplified method for genomic genotyping in non-model organisms.ezRAD:一种简化的非模式生物基因组基因分型方法。
PeerJ. 2013 Nov 19;1:e203. doi: 10.7717/peerj.203. eCollection 2013.
9
Stacks: an analysis tool set for population genomics.Stacks:用于群体基因组学的分析工具集。
Mol Ecol. 2013 Jun;22(11):3124-40. doi: 10.1111/mec.12354. Epub 2013 May 24.
10
Adaptive evolution during an ongoing range expansion: the invasive bank vole (Myodes glareolus) in Ireland.持续扩张过程中的适应性进化:爱尔兰的入侵田鼠(Myodes glareolus)。
Mol Ecol. 2013 Jun;22(11):2971-85. doi: 10.1111/mec.12343. Epub 2013 May 24.