• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过大量平行测序 Fosmid 池对大型和复杂基因组进行高效从头组装。

Efficient de novo assembly of large and complex genomes by massively parallel sequencing of Fosmid pools.

机构信息

School of Biotechnology, Science for Life Laboratory, KTH Royal Institute of Technology, Box 1031, 171 21 Solna, Sweden.

出版信息

BMC Genomics. 2014 Jun 6;15(1):439. doi: 10.1186/1471-2164-15-439.

DOI:10.1186/1471-2164-15-439
PMID:24906298
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4070561/
Abstract

BACKGROUND

Sampling genomes with Fosmid vectors and sequencing of pooled Fosmid libraries on the Illumina platform for massive parallel sequencing is a novel and promising approach to optimizing the trade-off between sequencing costs and assembly quality.

RESULTS

In order to sequence the genome of Norway spruce, which is of great size and complexity, we developed and applied a new technology based on the massive production, sequencing, and assembly of Fosmid pools (FP). The spruce chromosomes were sampled with ~40,000 bp Fosmid inserts to obtain around two-fold genome coverage, in parallel with traditional whole genome shotgun sequencing (WGS) of haploid and diploid genomes. Compared to the WGS results, the contiguity and quality of the FP assemblies were high, and they allowed us to fill WGS gaps resulting from repeats, low coverage, and allelic differences. The FP contig sets were further merged with WGS data using a novel software package GAM-NGS.

CONCLUSIONS

By exploiting FP technology, the first published assembly of a conifer genome was sequenced entirely with massively parallel sequencing. Here we provide a comprehensive report on the different features of the approach and the optimization of the process.We have made public the input data (FASTQ format) for the set of pools used in this study:ftp://congenie.org/congenie/Nystedt_2013/Assembly/ProcessedData/FosmidPools/.(alternatively accessible via http://congenie.org/downloads).The software used for running the assembly process is available at http://research.scilifelab.se/andrej_alexeyenko/downloads/fpools/.

摘要

背景

使用 Fosmid 载体对基因组进行抽样,并在 Illumina 平台上对汇集的 Fosmid 文库进行大规模平行测序,这是一种优化测序成本和组装质量之间权衡的新方法。

结果

为了对挪威云杉这一大而复杂的基因组进行测序,我们开发并应用了一种基于大规模生产、测序和 Fosmid 池(FP)组装的新技术。用约 40000 bp 的 Fosmid 插入物对云杉染色体进行抽样,以获得大约两倍的基因组覆盖度,同时对单倍体和二倍体基因组进行传统的全基因组鸟枪法测序(WGS)。与 WGS 结果相比,FP 组装的连续性和质量都很高,它们允许我们填补由于重复、低覆盖率和等位基因差异导致的 WGS 缺口。进一步使用一种新的软件包 GAM-NGS 将 FP 拼接集与 WGS 数据进行合并。

结论

通过利用 FP 技术,首次对针叶树基因组进行了完全的大规模平行测序组装。在此,我们全面报告了该方法的不同特点和流程优化。我们已经公开了本研究中使用的一组汇集池的输入数据(FASTQ 格式):ftp://congenie.org/congenie/Nystedt_2013/Assembly/ProcessedData/FosmidPools/。(也可通过 http://congenie.org/downloads 访问)。用于运行组装过程的软件可在 http://research.scilifelab.se/andrej_alexeyenko/downloads/fpools/ 获得。

相似文献

1
Efficient de novo assembly of large and complex genomes by massively parallel sequencing of Fosmid pools.通过大量平行测序 Fosmid 池对大型和复杂基因组进行高效从头组装。
BMC Genomics. 2014 Jun 6;15(1):439. doi: 10.1186/1471-2164-15-439.
2
A Fosmid Pool-Based Next Generation Sequencing Approach to Haplotype-Resolve Whole Genomes.一种基于Fosmid文库的新一代测序方法用于单倍型解析全基因组。
Methods Mol Biol. 2017;1551:223-269. doi: 10.1007/978-1-4939-6750-6_13.
3
Assembling the 20 Gb white spruce (Picea glauca) genome from whole-genome shotgun sequencing data.利用全基因组鸟枪法测序数据组装 20 Gb 白云杉(Picea glauca)基因组。
Bioinformatics. 2013 Jun 15;29(12):1492-7. doi: 10.1093/bioinformatics/btt178. Epub 2013 May 22.
4
Targeted isolation, sequence assembly and characterization of two white spruce (Picea glauca) BAC clones for terpenoid synthase and cytochrome P450 genes involved in conifer defence reveal insights into a conifer genome.针对参与针叶树防御的萜类合酶和细胞色素P450基因,对两个白云杉(Picea glauca)BAC克隆进行靶向分离、序列组装和表征,揭示了针叶树基因组的相关见解。
BMC Plant Biol. 2009 Aug 6;9:106. doi: 10.1186/1471-2229-9-106.
5
LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly.LR_Gapcloser:一种基于平铺路径的缺口闭合器,它使用长读长来完成基因组组装。
Gigascience. 2019 Jan 1;8(1):giy157. doi: 10.1093/gigascience/giy157.
6
Sequencing of 6.7 Mb of the melon genome using a BAC pooling strategy.利用 BAC 池策略对甜瓜基因组的 6.7 Mb 进行测序。
BMC Plant Biol. 2010 Nov 12;10:246. doi: 10.1186/1471-2229-10-246.
7
Development of a highly efficient 50K single nucleotide polymorphism genotyping array for the large and complex genome of Norway spruce (Picea abies L. Karst) by whole genome resequencing and its transferability to other spruce species.利用全基因组重测序开发高效的 50K 单核苷酸多态性基因分型芯片用于挪威云杉(Picea abies L. Karst)这个庞大而复杂的基因组及其向其他云杉物种的可转移性。
Mol Ecol Resour. 2021 Apr;21(3):880-896. doi: 10.1111/1755-0998.13292. Epub 2020 Dec 2.
8
Purge Haplotigs: allelic contig reassignment for third-gen diploid genome assemblies.清除单倍型:三代二倍体基因组组装的等位基因 contig 重新分配。
BMC Bioinformatics. 2018 Nov 29;19(1):460. doi: 10.1186/s12859-018-2485-7.
9
Paired-end sequencing of Fosmid libraries by Illumina.Illumina 对 Fosmid 文库进行双端测序。
Genome Res. 2012 Nov;22(11):2241-9. doi: 10.1101/gr.138925.112. Epub 2012 Jul 16.
10
Assembly of the Complete Sitka Spruce Chloroplast Genome Using 10X Genomics' GemCode Sequencing Data.利用10X基因组学的GemCode测序数据组装完整的西加云杉叶绿体基因组
PLoS One. 2016 Sep 15;11(9):e0163059. doi: 10.1371/journal.pone.0163059. eCollection 2016.

引用本文的文献

1
Deciphering molecular landscape of breast cancer progression and insights from functional genomics and therapeutic explorations followed by in vitro validation.解析乳腺癌进展的分子特征,并通过体外验证对功能基因组学和治疗探索的深入了解。
Sci Rep. 2024 Nov 20;14(1):28794. doi: 10.1038/s41598-024-80455-6.
2
Pharmacogenomics: A Genetic Approach to Drug Development and Therapy.药物基因组学:药物研发与治疗的遗传学方法。
Pharmaceuticals (Basel). 2024 Jul 13;17(7):940. doi: 10.3390/ph17070940.
3
From Pine Cones to Read Clouds: Rescaffolding the Megagenome of Sugar Pine ().

本文引用的文献

1
GAM-NGS: genomic assemblies merger for next generation sequencing.GAM-NGS:用于下一代测序的基因组组装合并。
BMC Bioinformatics. 2013;14 Suppl 7(Suppl 7):S6. doi: 10.1186/1471-2105-14-S7-S6. Epub 2013 Apr 22.
2
The Norway spruce genome sequence and conifer genome evolution.挪威云杉基因组序列与针叶树基因组进化。
Nature. 2013 May 30;497(7451):579-84. doi: 10.1038/nature12211. Epub 2013 May 22.
3
F conjugation: back to the beginning.F 构型:回到起点。
从松果到读云:重新构建糖松的超级基因组()。
G3 (Bethesda). 2017 May 5;7(5):1563-1568. doi: 10.1534/g3.117.040055.
4
Signatures of Crested Ibis MHC Revealed by Recombination Screening and Short-Reads Assembly Strategy.通过重组筛选和短读长组装策略揭示朱鹮主要组织相容性复合体的特征
PLoS One. 2016 Dec 20;11(12):e0168744. doi: 10.1371/journal.pone.0168744. eCollection 2016.
5
The Genome 10K Project: a way forward.基因组 10K 项目:前进之路。
Annu Rev Anim Biosci. 2015;3:57-111. doi: 10.1146/annurev-animal-090414-014900.
Plasmid. 2013 Jul;70(1):18-32. doi: 10.1016/j.plasmid.2013.03.010. Epub 2013 Apr 28.
4
The oyster genome reveals stress adaptation and complexity of shell formation.牡蛎基因组揭示了其对压力的适应能力和贝壳形成的复杂性。
Nature. 2012 Oct 4;490(7418):49-54. doi: 10.1038/nature11413. Epub 2012 Sep 19.
5
Improved gap size estimation for scaffolding algorithms.改进支架算法的缺口大小估计。
Bioinformatics. 2012 Sep 1;28(17):2215-22. doi: 10.1093/bioinformatics/bts441.
6
Paired-end sequencing of Fosmid libraries by Illumina.Illumina 对 Fosmid 文库进行双端测序。
Genome Res. 2012 Nov;22(11):2241-9. doi: 10.1101/gr.138925.112. Epub 2012 Jul 16.
7
Feature-by-feature--evaluating de novo sequence assembly.逐特征评估从头序列组装。
PLoS One. 2012;7(2):e31002. doi: 10.1371/journal.pone.0031002. Epub 2012 Feb 3.
8
Haplotype-resolved genome sequencing of a Gujarati Indian individual.单体型解析的古吉拉特邦印度个体基因组测序。
Nat Biotechnol. 2011 Jan;29(1):59-63. doi: 10.1038/nbt.1740. Epub 2010 Dec 19.
9
The Sequence Alignment/Map format and SAMtools.序列比对/映射格式和 SAMtools。
Bioinformatics. 2009 Aug 15;25(16):2078-9. doi: 10.1093/bioinformatics/btp352. Epub 2009 Jun 8.
10
Clone-based systematic haplotyping (CSH): a procedure for physical haplotyping of whole genomes.基于克隆的系统单倍型分型(CSH):一种对全基因组进行物理单倍型分型的方法。
Genome Res. 2003 Dec;13(12):2717-24. doi: 10.1101/gr.1442303.