• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用第二代高通量测序技术在未测序基因组中进行大规模单核苷酸多态性发现:应用于火鸡。

Large scale single nucleotide polymorphism discovery in unsequenced genomes using second generation high throughput sequencing technology: applied to turkey.

作者信息

Kerstens Hindrik H D, Crooijmans Richard P M A, Veenendaal Albertine, Dibbits Bert W, Chin-A-Woeng Thomas F C, den Dunnen Johan T, Groenen Martien A M

机构信息

Animal Breeding and Genomics Center, Wageningen University, Marijkeweg 40, Wageningen, 6709 PG, the Netherlands.

出版信息

BMC Genomics. 2009 Oct 16;10:479. doi: 10.1186/1471-2164-10-479.

DOI:10.1186/1471-2164-10-479
PMID:19835600
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2772860/
Abstract

BACKGROUND

The development of second generation sequencing methods has enabled large scale DNA variation studies at moderate cost. For the high throughput discovery of single nucleotide polymorphisms (SNPs) in species lacking a sequenced reference genome, we set-up an analysis pipeline based on a short read de novo sequence assembler and a program designed to identify variation within short reads. To illustrate the potential of this technique, we present the results obtained with a randomly sheared, enzymatically generated, 2-3 kbp genome fraction of six pooled Meleagris gallopavo (turkey) individuals.

RESULTS

A total of 100 million 36 bp reads were generated, representing approximately 5-6% (approximately 62 Mbp) of the turkey genome, with an estimated sequence depth of 58. Reads consisting of bases called with less than 1% error probability were selected and assembled into contigs. Subsequently, high throughput discovery of nucleotide variation was performed using sequences with more than 90% reliability by using the assembled contigs that were 50 bp or longer as the reference sequence. We identified more than 7,500 SNPs with a high probability of representing true nucleotide variation in turkeys. Increasing the reference genome by adding publicly available turkey BAC-end sequences increased the number of SNPs to over 11,000. A comparison with the sequenced chicken genome indicated that the assembled turkey contigs were distributed uniformly across the turkey genome. Genotyping of a representative sample of 340 SNPs resulted in a SNP conversion rate of 95%. The correlation of the minor allele count (MAC) and observed minor allele frequency (MAF) for the validated SNPs was 0.69.

CONCLUSION

We provide an efficient and cost-effective approach for the identification of thousands of high quality SNPs in species currently lacking a sequenced genome and applied this to turkey. The methodology addresses a random fraction of the genome, resulting in an even distribution of SNPs across the targeted genome.

摘要

背景

第二代测序方法的发展使得大规模DNA变异研究能够以适中的成本进行。为了在缺乏已测序参考基因组的物种中高通量发现单核苷酸多态性(SNP),我们基于短读长从头序列组装器和一个用于识别短读长内变异的程序建立了一个分析流程。为了说明该技术的潜力,我们展示了对六个混合的火鸡个体随机剪切、酶促产生的2 - 3kbp基因组片段进行分析所获得的结果。

结果

共产生了1亿条36bp的读段,约占火鸡基因组的5 - 6%(约62Mbp),估计序列深度为58。选择错误概率小于1%的碱基组成的读段并组装成重叠群。随后,使用可靠性超过90%的序列,以50bp或更长的组装重叠群作为参考序列,进行核苷酸变异的高通量发现。我们鉴定出超过7500个SNP,它们极有可能代表火鸡中的真实核苷酸变异。通过添加公开可用的火鸡BAC末端序列来增加参考基因组,使SNP数量增加到超过11000个。与已测序的鸡基因组进行比较表明,组装的火鸡重叠群在火鸡基因组中均匀分布。对340个SNP的代表性样本进行基因分型,SNP转化率为95%。验证后的SNP的次要等位基因计数(MAC)与观察到的次要等位基因频率(MAF)的相关性为0.69。

结论

我们提供了一种高效且经济有效的方法,用于在目前缺乏已测序基因组的物种中鉴定数千个高质量SNP,并将其应用于火鸡。该方法针对基因组的随机部分,导致SNP在目标基因组中均匀分布。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c28a/2772860/e92721b69669/1471-2164-10-479-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c28a/2772860/6458d4674fd4/1471-2164-10-479-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c28a/2772860/ca83c0d4cd1b/1471-2164-10-479-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c28a/2772860/e92721b69669/1471-2164-10-479-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c28a/2772860/6458d4674fd4/1471-2164-10-479-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c28a/2772860/ca83c0d4cd1b/1471-2164-10-479-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c28a/2772860/e92721b69669/1471-2164-10-479-3.jpg

相似文献

1
Large scale single nucleotide polymorphism discovery in unsequenced genomes using second generation high throughput sequencing technology: applied to turkey.利用第二代高通量测序技术在未测序基因组中进行大规模单核苷酸多态性发现:应用于火鸡。
BMC Genomics. 2009 Oct 16;10:479. doi: 10.1186/1471-2164-10-479.
2
SNP discovery in swine by reduced representation and high throughput pyrosequencing.通过简化基因组和高通量焦磷酸测序技术在猪中发现单核苷酸多态性
BMC Genet. 2008 Dec 4;9:81. doi: 10.1186/1471-2156-9-81.
3
Whole genome SNP discovery and analysis of genetic diversity in Turkey (Meleagris gallopavo).土耳其火鸡(Meleagris gallopavo)全基因组 SNP 发现与遗传多样性分析。
BMC Genomics. 2012 Aug 14;13:391. doi: 10.1186/1471-2164-13-391.
4
Application of massive parallel sequencing to whole genome SNP discovery in the porcine genome.大规模平行测序在猪基因组全基因组单核苷酸多态性发现中的应用。
BMC Genomics. 2009 Aug 12;10:374. doi: 10.1186/1471-2164-10-374.
5
Survey of a cDNA library from the turkey (Meleagris gallopavo).火鸡(吐绶鸡)cDNA文库的研究
Genome. 2005 Feb;48(1):12-7. doi: 10.1139/g04-088.
6
Genome-wide SNP discovery in walnut with an AGSNP pipeline updated for SNP discovery in allogamous organisms.利用 AGSNP 管道进行全基因组 SNP 发现,该管道经过更新,可用于发现所有杂交生物中的 SNP。
BMC Genomics. 2012 Jul 31;13:354. doi: 10.1186/1471-2164-13-354.
7
A comparative physical map reveals the pattern of chromosomal evolution between the turkey (Meleagris gallopavo) and chicken (Gallus gallus) genomes.比较物理图谱揭示了火鸡(Meleagris gallopavo)和鸡(Gallus gallus)基因组之间的染色体进化模式。
BMC Genomics. 2011 Sep 9;12:447. doi: 10.1186/1471-2164-12-447.
8
The development and characterization of a 60K SNP chip for chicken.鸡的 60K SNP 芯片的开发与特性分析。
BMC Genomics. 2011 May 31;12(1):274. doi: 10.1186/1471-2164-12-274.
9
Development and characterization of expressed sequence tags for the turkey (Meleagris gallopavo) genome and comparative sequence analysis with other birds.火鸡(吐绶鸡)基因组表达序列标签的开发与特性分析以及与其他鸟类的比较序列分析
Anim Genet. 2000 Feb;31(1):62-7. doi: 10.1046/j.1365-2052.2000.00590.x.
10
Annotation-based genome-wide SNP discovery in the large and complex Aegilops tauschii genome using next-generation sequencing without a reference genome sequence.基于注释的全基因组 SNP 发现利用下一代测序技术在没有参考基因组序列的情况下在大型复杂的粗山羊草基因组中
BMC Genomics. 2011 Jan 25;12:59. doi: 10.1186/1471-2164-12-59.

引用本文的文献

1
Genome Variation Map of Domestic Qinghai-Tibet Plateau Yaks by SLAF-Seq Reveals Genetic Footprint during Artificial Selection.基于SLAF-Seq技术的青藏高原家牦牛基因组变异图谱揭示人工选择过程中的遗传印记
Animals (Basel). 2023 Sep 19;13(18):2963. doi: 10.3390/ani13182963.
2
Heterozygous Single Nucleotide Polymorphic Loci in Haploid Gametophytes of (Rhodophyta).红藻单倍体配子体中的杂合单核苷酸多态性位点
Front Genet. 2019 Dec 6;10:1256. doi: 10.3389/fgene.2019.01256. eCollection 2019.
3
Genome-Wide SNP Discovery and Analysis of Genetic Diversity in Farmed Sika Deer () in Northeast China Using Double-Digest Restriction Site-Associated DNA Sequencing.

本文引用的文献

1
Sequence variants at the TERT-CLPTM1L locus associate with many cancer types.端粒酶逆转录酶(TERT)-跨膜蛋白1样蛋白(CLPTM1L)基因座处的序列变异与多种癌症类型相关。
Nat Genet. 2009 Feb;41(2):221-7. doi: 10.1038/ng.296. Epub 2009 Jan 18.
2
Genome-wide association study for early-onset and morbid adult obesity identifies three new risk loci in European populations.针对早发性和病态成人肥胖的全基因组关联研究在欧洲人群中发现了三个新的风险基因座。
Nat Genet. 2009 Feb;41(2):157-9. doi: 10.1038/ng.301. Epub 2009 Jan 18.
3
De novo assembly of the Pseudomonas syringae pv. syringae B728a genome using Illumina/Solexa short sequence reads.
利用双酶切简化基因组测序技术对中国东北养殖梅花鹿进行全基因组SNP发掘及遗传多样性分析
G3 (Bethesda). 2017 Sep 7;7(9):3169-3176. doi: 10.1534/g3.117.300082.
4
Identification of Sex-determining Loci in Pacific White Shrimp Litopeneaus vannamei Using Linkage and Association Analysis.利用连锁和关联分析鉴定凡纳滨对虾性别决定位点
Mar Biotechnol (NY). 2017 Jun;19(3):277-286. doi: 10.1007/s10126-017-9749-5. Epub 2017 May 16.
5
Population-level consequences of complementary sex determination in a solitary parasitoid.一种独居寄生蜂中互补性性别决定的种群水平后果。
BMC Evol Biol. 2015 May 30;15:98. doi: 10.1186/s12862-015-0340-2.
6
Genome-wide candidate regions for selective sweeps revealed through massive parallel sequencing of DNA across ten turkey populations.通过对十个火鸡群体的DNA进行大规模平行测序揭示的全基因组选择性清除候选区域。
BMC Genet. 2014 Nov 25;15:117. doi: 10.1186/s12863-014-0117-4.
7
Single nucleotide polymorphism analysis of Korean native chickens using next generation sequencing data.利用下一代测序数据对韩国本土鸡进行单核苷酸多态性分析。
Mol Biol Rep. 2015 Feb;42(2):471-7. doi: 10.1007/s11033-014-3790-5. Epub 2014 Oct 11.
8
All-Food-Seq (AFS): a quantifiable screen for species in biological samples by deep DNA sequencing.全食物测序(AFS):通过深度DNA测序对生物样本中的物种进行可量化筛选。
BMC Genomics. 2014 Jul 31;15(1):639. doi: 10.1186/1471-2164-15-639.
9
Identification and characterization of nucleotide variations in the genome of Ziziphus jujuba (Rhamnaceae) by next generation sequencing.利用二代测序技术鉴定和表征枣(鼠李科)基因组中的核苷酸变异
Mol Biol Rep. 2014 May;41(5):3219-23. doi: 10.1007/s11033-014-3184-8. Epub 2014 Jan 30.
10
Generation of SNP datasets for orangutan population genomics using improved reduced-representation sequencing and direct comparisons of SNP calling algorithms.利用改良的简化代表性测序和 SNP 调用算法的直接比较,生成猩猩群体基因组学的 SNP 数据集。
BMC Genomics. 2014 Jan 10;15:16. doi: 10.1186/1471-2164-15-16.
利用Illumina/Solexa短序列 reads 对丁香假单胞菌丁香致病变种B728a基因组进行从头组装。
FEMS Microbiol Lett. 2009 Feb;291(1):103-11. doi: 10.1111/j.1574-6968.2008.01441.x. Epub 2008 Dec 9.
4
The YH database: the first Asian diploid genome database.YH数据库:首个亚洲二倍体基因组数据库。
Nucleic Acids Res. 2009 Jan;37(Database issue):D1025-8. doi: 10.1093/nar/gkn966.
5
SNP discovery in swine by reduced representation and high throughput pyrosequencing.通过简化基因组和高通量焦磷酸测序技术在猪中发现单核苷酸多态性
BMC Genet. 2008 Dec 4;9:81. doi: 10.1186/1471-2156-9-81.
6
Mapping short DNA sequencing reads and calling variants using mapping quality scores.使用比对质量分数比对短DNA测序读数并识别变异。
Genome Res. 2008 Nov;18(11):1851-8. doi: 10.1101/gr.078212.108. Epub 2008 Aug 19.
7
The new paradigm of flow cell sequencing.流动槽测序的新范式。
Genome Res. 2008 Jun;18(6):839-46. doi: 10.1101/gr.073262.107.
8
Association and in silico assignment of sequences from turkey BACs.来自火鸡细菌人工染色体(BACs)序列的关联及计算机分析
Anim Biotechnol. 2008;19(2):80-3. doi: 10.1080/10495390701876209.
9
Whole genome comparative studies between chicken and turkey and their implications for avian genome evolution.鸡和火鸡的全基因组比较研究及其对鸟类基因组进化的意义。
BMC Genomics. 2008 Apr 14;9:168. doi: 10.1186/1471-2164-9-168.
10
Velvet: algorithms for de novo short read assembly using de Bruijn graphs.《天鹅绒:使用德布鲁因图进行从头短读长拼接的算法》
Genome Res. 2008 May;18(5):821-9. doi: 10.1101/gr.074492.107. Epub 2008 Mar 18.