• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于对来自多个个体的重叠双末端RAD reads进行本地从头组装的优化方法。

An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals.

作者信息

Li Yu-Long, Xue Dong-Xiu, Zhang Bai-Dong, Liu Jin-Xian

机构信息

CAS Key Laboratory of Marine Ecology and Environmental Sciences, Institute of Oceanology, Chinese Academy of Sciences, 7 Nanhai Road, Qingdao 266071, Shandong, People's Republic of China.

Laboratory for Marine Ecology and Environmental Science, Qingdao National Laboratory for Marine Science and Technology, Qingdao 266071, People's Republic of China.

出版信息

R Soc Open Sci. 2018 Feb 28;5(2):171589. doi: 10.1098/rsos.171589. eCollection 2018 Feb.

DOI:10.1098/rsos.171589
PMID:29515871
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5830760/
Abstract

Restriction site-associated DNA (RAD) sequencing is revolutionizing studies in ecological, evolutionary and conservation genomics. However, the assembly of paired-end RAD reads with random-sheared ends is still challenging, especially for non-model species with high genetic variance. Here, we present an efficient optimized approach with a pipeline software, RADassembler, which makes full use of paired-end RAD reads with random-sheared ends from multiple individuals to assemble RAD contigs. RADassembler integrates the algorithms for choosing the optimal number of mismatches within and across individuals at the clustering stage, and then uses a two-step assembly approach at the assembly stage. RADassembler also uses data reduction and parallelization strategies to promote efficiency. Compared to other tools, both the assembly results based on simulation and real RAD datasets demonstrated that RADassembler could always assemble the appropriate number of contigs with high qualities, and more read pairs were properly mapped to the assembled contigs. This approach provides an optimal tool for dealing with the complexity in the assembly of paired-end RAD reads with random-sheared ends for non-model species in ecological, evolutionary and conservation studies. RADassembler is available at https://github.com/lyl8086/RADscripts.

摘要

限制性内切酶位点相关DNA(RAD)测序正在彻底改变生态、进化和保护基因组学的研究。然而,具有随机剪切末端的双末端RAD读段的组装仍然具有挑战性,尤其是对于具有高遗传变异的非模式物种。在这里,我们提出了一种高效的优化方法,并附带一个管道软件RADassembler,它充分利用来自多个个体的具有随机剪切末端的双末端RAD读段来组装RAD重叠群。RADassembler在聚类阶段集成了用于选择个体内部和个体之间最佳错配数的算法,然后在组装阶段使用两步组装方法。RADassembler还使用数据缩减和并行化策略来提高效率。与其他工具相比,基于模拟和真实RAD数据集的组装结果都表明,RADassembler总能高质量地组装出合适数量的重叠群,并且有更多的读段对被正确地映射到组装好的重叠群上。这种方法为生态、进化和保护研究中的非模式物种处理具有随机剪切末端的双末端RAD读段组装中的复杂性提供了一个最佳工具。RADassembler可在https://github.com/lyl8086/RADscripts上获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baad/5830760/80a23b11cb33/rsos171589-g5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baad/5830760/391cf1f684c3/rsos171589-g1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baad/5830760/5212a25103a7/rsos171589-g2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baad/5830760/80309ffb6c25/rsos171589-g3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baad/5830760/03fc6c42696a/rsos171589-g4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baad/5830760/80a23b11cb33/rsos171589-g5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baad/5830760/391cf1f684c3/rsos171589-g1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baad/5830760/5212a25103a7/rsos171589-g2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baad/5830760/80309ffb6c25/rsos171589-g3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baad/5830760/03fc6c42696a/rsos171589-g4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/baad/5830760/80a23b11cb33/rsos171589-g5.jpg

相似文献

1
An optimized approach for local de novo assembly of overlapping paired-end RAD reads from multiple individuals.一种用于对来自多个个体的重叠双末端RAD reads进行本地从头组装的优化方法。
R Soc Open Sci. 2018 Feb 28;5(2):171589. doi: 10.1098/rsos.171589. eCollection 2018 Feb.
2
Rainbow: an integrated tool for efficient clustering and assembling RAD-seq reads.Rainbow:一种用于高效聚类和组装 RAD-seq 读取的集成工具。
Bioinformatics. 2012 Nov 1;28(21):2732-7. doi: 10.1093/bioinformatics/bts482. Epub 2012 Sep 1.
3
A rapid and cost-effective approach for the development of polymorphic microsatellites in non-model species using paired-end RAD sequencing.一种利用双末端RAD测序在非模式物种中开发多态微卫星的快速且经济高效的方法。
Mol Genet Genomics. 2017 Oct;292(5):1165-1174. doi: 10.1007/s00438-017-1337-x. Epub 2017 Jun 20.
4
Local de novo assembly of RAD paired-end contigs using short sequencing reads.使用短测序读长进行 RAD 配对末端 contigs 的本地从头组装。
PLoS One. 2011 Apr 13;6(4):e18561. doi: 10.1371/journal.pone.0018561.
5
Index suffix-prefix overlaps by (w, k)-minimizer to generate long contigs for reads compression.通过 (w, k)-最小化子索引后缀-前缀重叠来生成用于读取压缩的长连续体。
Bioinformatics. 2019 Jun 1;35(12):2066-2074. doi: 10.1093/bioinformatics/bty936.
6
Stacks 2: Analytical methods for paired-end sequencing improve RADseq-based population genomics.Stacks 2:用于双端测序的分析方法改进了基于 RADseq 的群体基因组学。
Mol Ecol. 2019 Nov;28(21):4737-4754. doi: 10.1111/mec.15253. Epub 2019 Oct 17.
7
ELOPER: elongation of paired-end reads as a pre-processing tool for improved de novo genome assembly.ELOPER: 作为一种提高从头基因组组装质量的预处理工具,使用配对末端 reads 的延伸。
Bioinformatics. 2013 Jun 1;29(11):1455-7. doi: 10.1093/bioinformatics/btt169. Epub 2013 Apr 19.
8
NucBreak: location of structural errors in a genome assembly by using paired-end Illumina reads.NucBreak:利用 Illumina 配对末端读取来定位基因组组装中的结构错误。
BMC Bioinformatics. 2020 Feb 21;21(1):66. doi: 10.1186/s12859-020-3414-0.
9
NeatFreq: reference-free data reduction and coverage normalization for De Novo sequence assembly.NeatFreq:用于从头序列组装的无参考数据缩减和覆盖度归一化
BMC Bioinformatics. 2014 Nov 19;15(1):357. doi: 10.1186/s12859-014-0357-3.
10
RAD paired-end sequencing for local de novo assembly and SNP discovery in non-model organisms.用于非模式生物中本地从头组装和单核苷酸多态性发现的RAD双端测序。
Methods Mol Biol. 2012;888:135-51. doi: 10.1007/978-1-61779-870-2_9.

引用本文的文献

1
Genetic Analysis of Plant Pathogens Natural Populations.植物病原体自然种群的遗传分析
Methods Mol Biol. 2022;2536:405-422. doi: 10.1007/978-1-0716-2517-0_23.
2
Genome assembly using quantum and quantum-inspired annealing.基于量子和量子启发式退火的基因组组装。
Sci Rep. 2021 Jun 23;11(1):13183. doi: 10.1038/s41598-021-88321-5.
3
RAD genotyping reveals fine-scale population structure and provides evidence for adaptive divergence in a commercially important fish from the northwestern Pacific Ocean.RAD基因分型揭示了精细尺度的种群结构,并为西北太平洋一种具有商业重要性的鱼类的适应性分化提供了证据。

本文引用的文献

1
Harnessing the power of RADseq for ecological and evolutionary genomics.利用RADseq技术助力生态与进化基因组学研究
Nat Rev Genet. 2016 Feb;17(2):81-92. doi: 10.1038/nrg.2015.28. Epub 2016 Jan 5.
2
Development and preliminary evaluation of a genomewide single nucleotide polymorphisms resource generated by RAD-seq for the small yellow croaker (Larimichthys polyactis).利用 RAD-seq 技术开发并初步评估小黄鱼(Larimichthys polyactis)全基因组单核苷酸多态性资源。
Mol Ecol Resour. 2016 May;16(3):755-68. doi: 10.1111/1755-0998.12476. Epub 2015 Oct 29.
3
Similarity thresholds used in DNA sequence assembly from short reads can reduce the comparability of population histories across species.
PeerJ. 2019 Jul 3;7:e7242. doi: 10.7717/peerj.7242. eCollection 2019.
4
Population Genomic Signatures of Genetic Structure and Environmental Selection in the Catadromous Roughskin Sculpin Trachidermus fasciatus.群体基因组揭示有溯洄特性的纹缟杜父鱼遗传结构和环境选择的特征
Genome Biol Evol. 2019 Jul 1;11(7):1751-1764. doi: 10.1093/gbe/evz118.
用于从短读段进行DNA序列组装的相似性阈值可能会降低不同物种群体历史的可比性。
PeerJ. 2015 Apr 21;3:e895. doi: 10.7717/peerj.895. eCollection 2015.
4
Linkage mapping with paralogs exposes regions of residual tetrasomic inheritance in chum salmon (Oncorhynchus keta).利用旁系同源基因进行连锁图谱分析揭示了大麻哈鱼(Oncorhynchus keta)中残余四体遗传的区域。
Mol Ecol Resour. 2016 Jan;16(1):17-28. doi: 10.1111/1755-0998.12394. Epub 2015 Mar 11.
5
Illumina TruSeq synthetic long-reads empower de novo assembly and resolve complex, highly-repetitive transposable elements.Illumina TruSeq合成长读段技术助力从头组装,并解析复杂的、高度重复的转座元件。
PLoS One. 2014 Sep 4;9(9):e106689. doi: 10.1371/journal.pone.0106689. eCollection 2014.
6
Defining loci in restriction-based reduced representation genomic data from nonmodel species: sources of bias and diagnostics for optimal clustering.在非模式物种基于限制性的简化基因组数据中定义基因座:偏差来源及优化聚类的诊断方法
Biomed Res Int. 2014;2014:675158. doi: 10.1155/2014/675158. Epub 2014 Jun 25.
7
dDocent: a RADseq, variant-calling pipeline designed for population genomics of non-model organisms.dDocent:一种 RADseq 变体调用管道,专为非模式生物的群体基因组学设计。
PeerJ. 2014 Jun 10;2:e431. doi: 10.7717/peerj.431. eCollection 2014.
8
Genome-wide single-generation signatures of local selection in the panmictic European eel.泛在性的欧洲鳗中局部选择的全基因组单世代特征。
Mol Ecol. 2014 May;23(10):2514-28. doi: 10.1111/mec.12753.
9
PyRAD: assembly of de novo RADseq loci for phylogenetic analyses.PyRAD:用于系统发育分析的从头RADseq位点组装
Bioinformatics. 2014 Jul 1;30(13):1844-9. doi: 10.1093/bioinformatics/btu121. Epub 2014 Mar 5.
10
High-throughput DNA sequencing errors are reduced by orders of magnitude using circle sequencing.高通量 DNA 测序错误可通过环测序降低数量级。
Proc Natl Acad Sci U S A. 2013 Dec 3;110(49):19872-7. doi: 10.1073/pnas.1319590110. Epub 2013 Nov 15.