• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用基因组资源和来自模式海洋哺乳动物的短读序列数据对 RAD 基因座频率进行计算机预测的实验验证。

Experimental validation of in silico predicted RAD locus frequencies using genomic resources and short read data from a model marine mammal.

机构信息

Department of Animal Behavior, University of Bielefeld, Postfach 100131, 33615, Bielefeld, Germany.

British Antarctic Survey, High Cross, Madingley Road, Cambridge, CB3 OET, UK.

出版信息

BMC Genomics. 2019 Jan 22;20(1):72. doi: 10.1186/s12864-019-5440-8.

DOI:10.1186/s12864-019-5440-8
PMID:30669975
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6341687/
Abstract

BACKGROUND

Restriction site-associated DNA sequencing (RADseq) has revolutionized the study of wild organisms by allowing cost-effective genotyping of thousands of loci. However, for species lacking reference genomes, it can be challenging to select the restriction enzyme that offers the best balance between the number of obtained RAD loci and depth of coverage, which is crucial for a successful outcome. To address this issue, PredRAD was recently developed, which uses probabilistic models to predict restriction site frequencies from a transcriptome assembly or other sequence resource based on either GC content or mono-, di- or trinucleotide composition. This program generates predictions that are broadly consistent with estimates of the true number of restriction sites obtained through in silico digestion of available reference genome assemblies. However, in practice the actual number of loci obtained could potentially differ as incomplete enzymatic digestion or patchy sequence coverage across the genome might lead to some loci not being represented in a RAD dataset, while erroneous assembly could potentially inflate the number of loci. To investigate this, we used genome and transcriptome assemblies together with RADseq data from the Antarctic fur seal (Arctocephalus gazella) to compare PredRAD predictions with empirical estimates of the number of loci obtained via in silico digestion and from de novo assemblies.

RESULTS

PredRAD yielded consistently higher predicted numbers of restriction sites for the transcriptome assembly relative to the genome assembly. The trinucleotide and dinucleotide models also predicted higher frequencies than the mononucleotide or GC content models. Overall, the dinucleotide and trinucleotide models applied to the transcriptome and the genome assemblies respectively generated predictions that were closest to the number of restriction sites estimated by in silico digestion. Furthermore, the number of de novo assembled RAD loci mapping to restriction sites was similar to the expectation based on in silico digestion.

CONCLUSIONS

Our study reveals generally high concordance between PredRAD predictions and empirical estimates of the number of RAD loci. This further supports the utility of PredRAD, while also suggesting that it may be feasible to sequence and assemble the majority of RAD loci present in an organism's genome.

摘要

背景

限制酶相关 DNA 测序(RADseq)通过允许对数千个基因座进行经济高效的基因分型,彻底改变了对野生生物的研究。然而,对于缺乏参考基因组的物种来说,选择既能获得最多 RAD 基因座数量又能保证覆盖深度的限制酶是具有挑战性的,而这对成功的结果至关重要。为了解决这个问题,最近开发了 PredRAD,它使用概率模型根据 GC 含量或单、二或三核苷酸组成,从转录组组装或其他序列资源中预测限制酶位点频率。该程序生成的预测结果与通过对现有参考基因组组装进行计算机消化获得的真实限制酶位点数量的估计大致一致。然而,在实践中,实际获得的基因座数量可能会有所不同,因为不完全的酶消化或基因组覆盖不均匀可能导致某些基因座在 RAD 数据集上没有被代表,而错误的组装可能会潜在地增加基因座数量。为了研究这一点,我们使用基因组和转录组组装以及来自南极毛皮海豹(Arctocephalus gazella)的 RADseq 数据,将 PredRAD 预测与通过计算机消化获得的基因座数量的经验估计值以及从头组装进行比较。

结果

PredRAD 对转录组组装的预测结果始终高于对基因组组装的预测结果,三核苷酸和二核苷酸模型的预测频率也高于单核苷酸或 GC 含量模型。总体而言,分别应用于转录组和基因组组装的二核苷酸和三核苷酸模型生成的预测结果与通过计算机消化估计的限制酶位点数量最接近。此外,从头组装的 RAD 基因座映射到限制酶位点的数量与基于计算机消化的预期数量相似。

结论

我们的研究表明,PredRAD 预测与 RAD 基因座数量的经验估计之间存在高度一致性。这进一步支持了 PredRAD 的实用性,同时也表明,对生物体基因组中存在的大多数 RAD 基因座进行测序和组装是可行的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/241d/6341687/5c4ee5b953f4/12864_2019_5440_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/241d/6341687/9e8206ad2499/12864_2019_5440_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/241d/6341687/722964dec246/12864_2019_5440_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/241d/6341687/080cd59b357b/12864_2019_5440_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/241d/6341687/7e2e973306d6/12864_2019_5440_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/241d/6341687/5c4ee5b953f4/12864_2019_5440_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/241d/6341687/9e8206ad2499/12864_2019_5440_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/241d/6341687/722964dec246/12864_2019_5440_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/241d/6341687/080cd59b357b/12864_2019_5440_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/241d/6341687/7e2e973306d6/12864_2019_5440_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/241d/6341687/5c4ee5b953f4/12864_2019_5440_Fig5_HTML.jpg

相似文献

1
Experimental validation of in silico predicted RAD locus frequencies using genomic resources and short read data from a model marine mammal.利用基因组资源和来自模式海洋哺乳动物的短读序列数据对 RAD 基因座频率进行计算机预测的实验验证。
BMC Genomics. 2019 Jan 22;20(1):72. doi: 10.1186/s12864-019-5440-8.
2
Predicting RAD-seq Marker Numbers across the Eukaryotic Tree of Life.预测真核生物生命之树上的RAD测序标记数量。
Genome Biol Evol. 2015 Nov 3;7(12):3207-25. doi: 10.1093/gbe/evv210.
3
RAD Sequencing and a Hybrid Antarctic Fur Seal Genome Assembly Reveal Rapidly Decaying Linkage Disequilibrium, Global Population Structure and Evidence for Inbreeding.RAD测序与一个杂交南极毛皮海狗基因组组装揭示了快速衰减的连锁不平衡、全球种群结构及近亲繁殖证据。
G3 (Bethesda). 2018 Jul 31;8(8):2709-2722. doi: 10.1534/g3.118.200171.
4
Gene discovery in the Antarctic fur seal (Arctocephalus gazella) skin transcriptome.南极软毛海豹(Arctocephalus gazella)皮肤转录组中的基因发现。
Mol Ecol Resour. 2011 Jul;11(4):703-10. doi: 10.1111/j.1755-0998.2011.02999.x. Epub 2011 Mar 16.
5
A draft fur seal genome provides insights into factors affecting SNP validation and how to mitigate them.一份海狗基因组草图为影响单核苷酸多态性(SNP)验证的因素以及如何缓解这些因素提供了见解。
Mol Ecol Resour. 2016 Jul;16(4):909-21. doi: 10.1111/1755-0998.12502. Epub 2016 Jan 20.
6
Breaking RAD: an evaluation of the utility of restriction site-associated DNA sequencing for genome scans of adaptation.打破常规:对限制性位点相关DNA测序在适应性基因组扫描中的效用评估
Mol Ecol Resour. 2017 Mar;17(2):142-152. doi: 10.1111/1755-0998.12635. Epub 2016 Dec 16.
7
Challenges and advances for transcriptome assembly in non-model species.非模式物种转录组组装面临的挑战与进展
PLoS One. 2017 Sep 20;12(9):e0185020. doi: 10.1371/journal.pone.0185020. eCollection 2017.
8
Comparative performance of transcriptome assembly methods for non-model organisms.非模式生物转录组组装方法的比较性能
BMC Genomics. 2016 Jul 27;17:523. doi: 10.1186/s12864-016-2923-8.
9
A novel approach for mining polymorphic microsatellite markers in silico.一种新的基于计算机的挖掘多态性微卫星标记的方法。
PLoS One. 2011;6(8):e23283. doi: 10.1371/journal.pone.0023283. Epub 2011 Aug 10.
10
Genetic sex assignment in wild populations using genotyping-by-sequencing data: A statistical threshold approach.利用测序数据进行基因分型的野生种群遗传性别鉴定:一种基于统计阈值的方法。
Mol Ecol Resour. 2018 Mar;18(2):179-190. doi: 10.1111/1755-0998.12767. Epub 2018 Mar 3.

引用本文的文献

1
Differentiation of European yellow rust subraces within the 'Warrior(-)' genetic group.“勇士(-)”基因组内欧洲条锈菌小种的分化
PLoS One. 2025 May 23;20(5):e0323046. doi: 10.1371/journal.pone.0323046. eCollection 2025.
2
Genetic variability of the 16S rRNA gene of Nocardia brasiliensis, the most common causative agent of actinomycetoma in Latin America and the Caribbean.巴西奴卡菌 16S rRNA 基因的遗传变异性,其为拉丁美洲和加勒比地区导致放线菌病的最常见病原体。
Rev Inst Med Trop Sao Paulo. 2023 Apr 14;65:e31. doi: 10.1590/S1678-9946202365031. eCollection 2023.
3
Reduced metagenome sequencing for strain-resolution taxonomic profiles.

本文引用的文献

1
RAD Sequencing and a Hybrid Antarctic Fur Seal Genome Assembly Reveal Rapidly Decaying Linkage Disequilibrium, Global Population Structure and Evidence for Inbreeding.RAD测序与一个杂交南极毛皮海狗基因组组装揭示了快速衰减的连锁不平衡、全球种群结构及近亲繁殖证据。
G3 (Bethesda). 2018 Jul 31;8(8):2709-2722. doi: 10.1534/g3.118.200171.
2
Developing genome-wide SNPs and constructing an ultrahigh-density linkage map in oil palm.在油棕中开发全基因组 SNPs 并构建超高密度连锁图谱。
Sci Rep. 2018 Jan 12;8(1):691. doi: 10.1038/s41598-017-18613-2.
3
Population genomic footprints of host adaptation, introgression and recombination in coffee leaf rust.
减少宏基因组测序以获得基于菌株分辨率的分类特征。
Microbiome. 2021 Mar 29;9(1):79. doi: 10.1186/s40168-021-01019-8.
咖啡叶锈菌宿主适应性、渐渗和重组的群体基因组足迹。
Mol Plant Pathol. 2018 Jul;19(7):1742-1753. doi: 10.1111/mpp.12657. Epub 2018 Feb 22.
4
De novo transcriptome assembly, annotation and comparison of four ecological and evolutionary model salmonid fish species.从头组装、注释和比较四种生态和进化模式的鲑鱼科鱼类的转录组。
BMC Genomics. 2018 Jan 8;19(1):32. doi: 10.1186/s12864-017-4379-x.
5
RAD sequencing resolves fine-scale population structure in a benthic invertebrate: implications for understanding phenotypic plasticity.RAD测序解析了一种底栖无脊椎动物的精细种群结构:对理解表型可塑性的启示
R Soc Open Sci. 2017 Feb 8;4(2):160548. doi: 10.1098/rsos.160548. eCollection 2017 Feb.
6
Transcriptomic SNP discovery for custom genotyping arrays: impacts of sequence data, SNP calling method and genotyping technology on the probability of validation success.用于定制基因分型阵列的转录组SNP发现:序列数据、SNP检测方法和基因分型技术对验证成功概率的影响。
BMC Res Notes. 2016 Aug 26;9(1):418. doi: 10.1186/s13104-016-2209-x.
7
Next-generation biology: Sequencing and data analysis approaches for non-model organisms.下一代生物学:非模式生物的测序与数据分析方法
Mar Genomics. 2016 Dec;30:3-13. doi: 10.1016/j.margen.2016.04.012. Epub 2016 May 13.
8
Characterization of the mantle transcriptome in bivalves: Pecten maximus, Mytilus edulis and Crassostrea gigas.双壳贝类外套膜转录组的特征分析:大扇贝、紫贻贝和太平洋牡蛎。
Mar Genomics. 2016 Jun;27:9-15. doi: 10.1016/j.margen.2016.04.003. Epub 2016 May 7.
9
Harnessing the power of RADseq for ecological and evolutionary genomics.利用RADseq技术助力生态与进化基因组学研究
Nat Rev Genet. 2016 Feb;17(2):81-92. doi: 10.1038/nrg.2015.28. Epub 2016 Jan 5.
10
Predicting RAD-seq Marker Numbers across the Eukaryotic Tree of Life.预测真核生物生命之树上的RAD测序标记数量。
Genome Biol Evol. 2015 Nov 3;7(12):3207-25. doi: 10.1093/gbe/evv210.