• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

调用没有参考序列的 SNPs。

Calling SNPs without a reference sequence.

机构信息

Center for Comparative Genomics and Bioinformatics, Pennsylvania State University, USA.

出版信息

BMC Bioinformatics. 2010 Mar 15;11:130. doi: 10.1186/1471-2105-11-130.

DOI:10.1186/1471-2105-11-130
PMID:20230626
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2851604/
Abstract

BACKGROUND

The most common application for the next-generation sequencing technologies is resequencing, where short reads from the genome of an individual are aligned to a reference genome sequence for the same species. These mappings can then be used to identify genetic differences among individuals in a population, and perhaps ultimately to explain phenotypic variation. Many algorithms capable of aligning short reads to the reference, and determining differences between them have been reported. Much less has been reported on how to use these technologies to determine genetic differences among individuals of a species for which a reference sequence is not available, which drastically limits the number of species that can easily benefit from these new technologies.

RESULTS

We describe a computational pipeline, called DIAL (De novo Identification of Alleles), for identifying single-base substitutions between two closely related genomes without the help of a reference genome. The method works even when the depth of coverage is insufficient for de novo assembly, and it can be extended to determine small insertions/deletions. We evaluate the software's effectiveness using published Roche/454 sequence data from the genome of Dr. James Watson (to detect heterozygous positions) and recent Illumina data from orangutan, in each case comparing our results to those from computational analysis that uses a reference genome assembly. We also illustrate the use of DIAL to identify nucleotide differences among transcriptome sequences.

CONCLUSIONS

DIAL can be used for identification of nucleotide differences in species for which no reference sequence is available. Our main motivation is to use this tool to survey the genetic diversity of endangered species as the identified sequence differences can be used to design genotyping arrays to assist in the species' management. The DIAL source code is freely available at http://www.bx.psu.edu/miller_lab/.

摘要

背景

下一代测序技术最常见的应用是重测序,即将个体基因组的短读段与同一物种的参考基因组序列进行比对。这些比对可以用来识别群体中个体之间的遗传差异,并最终解释表型变异。已经有许多能够将短读段与参考序列进行比对并确定它们之间差异的算法被报道。但关于如何利用这些技术来确定没有参考序列的物种中个体之间的遗传差异的报道却很少,这极大地限制了可以从这些新技术中受益的物种数量。

结果

我们描述了一种名为 DIAL(从头鉴定等位基因)的计算流程,用于在没有参考基因组的情况下识别两个密切相关的基因组之间的单碱基替换。该方法甚至在覆盖深度不足以进行从头组装的情况下也能工作,并且可以扩展用于确定小的插入/缺失。我们使用已发表的 Roche/454 序列数据(来自 Dr. James Watson 的基因组,用于检测杂合位置)和最近的猩猩 Illumina 数据来评估软件的有效性,在每种情况下,我们将结果与使用参考基因组组装的计算分析进行比较。我们还展示了 DIAL 用于鉴定转录组序列中核苷酸差异的用法。

结论

DIAL 可用于鉴定没有参考序列的物种中的核苷酸差异。我们的主要动机是使用此工具来调查濒危物种的遗传多样性,因为所鉴定的序列差异可用于设计基因分型阵列以协助物种管理。DIAL 的源代码可在 http://www.bx.psu.edu/miller_lab/ 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6856/2851604/94d6d0ea7889/1471-2105-11-130-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6856/2851604/c465c8766d1c/1471-2105-11-130-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6856/2851604/94d6d0ea7889/1471-2105-11-130-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6856/2851604/c465c8766d1c/1471-2105-11-130-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6856/2851604/94d6d0ea7889/1471-2105-11-130-4.jpg

相似文献

1
Calling SNPs without a reference sequence.调用没有参考序列的 SNPs。
BMC Bioinformatics. 2010 Mar 15;11:130. doi: 10.1186/1471-2105-11-130.
2
Coverage-based consensus calling (CbCC) of short sequence reads and comparison of CbCC results to identify SNPs in chickpea (Cicer arietinum; Fabaceae), a crop species without a reference genome.基于覆盖度的短序列读取共识调用(CbCC),并将 CbCC 结果与 SNP 进行比较,以鉴定无参考基因组的作物豌豆(Cicer arietinum;豆科)。
Am J Bot. 2012 Feb;99(2):186-92. doi: 10.3732/ajb.1100419. Epub 2012 Feb 1.
3
High quality SNP calling using Illumina data at shallow coverage.使用 Illumina 数据进行低深度覆盖的高质量 SNP 调用。
Bioinformatics. 2010 Apr 15;26(8):1029-35. doi: 10.1093/bioinformatics/btq092. Epub 2010 Feb 26.
4
Short reads and nonmodel species: exploring the complexities of next-generation sequence assembly and SNP discovery in the absence of a reference genome.短读序列和非模式物种:在缺乏参考基因组的情况下探索下一代序列组装和 SNP 发现的复杂性。
Mol Ecol Resour. 2011 Mar;11 Suppl 1:93-108. doi: 10.1111/j.1755-0998.2010.02969.x.
5
Orthology Guided Assembly in highly heterozygous crops: creating a reference transcriptome to uncover genetic diversity in Lolium perenne.同源基因引导的高度杂合作物组装:创建参考转录组以揭示黑麦草中的遗传多样性。
Plant Biotechnol J. 2013 Jun;11(5):605-17. doi: 10.1111/pbi.12051. Epub 2013 Feb 21.
6
Correction of sequencing errors in a mixed set of reads.纠正混合读取集中的测序错误。
Bioinformatics. 2010 May 15;26(10):1284-90. doi: 10.1093/bioinformatics/btq151. Epub 2010 Apr 8.
7
Exploring single-sample SNP and INDEL calling with whole-genome de novo assembly.利用全基因组从头组装进行单样本 SNP 和 INDEL 调用的探索。
Bioinformatics. 2012 Jul 15;28(14):1838-44. doi: 10.1093/bioinformatics/bts280. Epub 2012 May 7.
8
EagleView: a genome assembly viewer for next-generation sequencing technologies.EagleView:一款用于下一代测序技术的基因组组装查看器。
Genome Res. 2008 Sep;18(9):1538-43. doi: 10.1101/gr.076067.108. Epub 2008 Jun 11.
9
Sequencing of natural strains of Arabidopsis thaliana with short reads.对拟南芥自然菌株进行短读长测序。
Genome Res. 2008 Dec;18(12):2024-33. doi: 10.1101/gr.080200.108. Epub 2008 Sep 25.
10
SNP calling from RNA-seq data without a reference genome: identification, quantification, differential analysis and impact on the protein sequence.无参考基因组情况下从RNA测序数据中进行单核苷酸多态性(SNP)检测:鉴定、定量、差异分析及其对蛋白质序列的影响
Nucleic Acids Res. 2016 Nov 2;44(19):e148. doi: 10.1093/nar/gkw655. Epub 2016 Jul 25.

引用本文的文献

1
A single QTL with large effect is associated with female functional virginity in an asexual parasitoid wasp.一个具有较大效应的单一 QTL 与一种无性寄生蜂的雌性功能性处女有关。
Mol Ecol. 2021 May;30(9):1979-1992. doi: 10.1111/mec.15863. Epub 2021 Mar 15.
2
SNP Mining in Functional Genes from Nonmodel Species by Next-Generation Sequencing: A Case of Flowering, Pre-Harvest Sprouting, and Dehydration Resistant Genes in Wheat.利用下一代测序技术挖掘非模式物种功能基因中的单核苷酸多态性:以小麦开花、收获前发芽及脱水抗性基因为例
Biomed Res Int. 2016;2016:3524908. doi: 10.1155/2016/3524908. Epub 2016 Mar 14.
3
4Pipe4--A 454 data analysis pipeline for SNP detection in datasets with no reference sequence or strain information.

本文引用的文献

1
Optimization methods for selecting founder individuals for captive breeding or reintroduction of endangered species.用于为圈养繁殖或濒危物种重新引入选择奠基个体的优化方法。
Pac Symp Biocomput. 2010:43-53. doi: 10.1142/9789814295291_0006.
2
Large scale single nucleotide polymorphism discovery in unsequenced genomes using second generation high throughput sequencing technology: applied to turkey.利用第二代高通量测序技术在未测序基因组中进行大规模单核苷酸多态性发现:应用于火鸡。
BMC Genomics. 2009 Oct 16;10:479. doi: 10.1186/1471-2164-10-479.
3
Application of massive parallel sequencing to whole genome SNP discovery in the porcine genome.
4Pipe4——一种用于在没有参考序列或菌株信息的数据集中检测单核苷酸多态性的454数据分析流程。
BMC Bioinformatics. 2016 Jan 19;17:41. doi: 10.1186/s12859-016-0892-1.
4
Reliable in silico identification of sequence polymorphisms and their application for extending the genetic map of sugar beet (Beta vulgaris).可靠的序列多态性计算机识别及其在扩展甜菜(Beta vulgaris)遗传图谱中的应用。
PLoS One. 2014 Oct 10;9(10):e110113. doi: 10.1371/journal.pone.0110113. eCollection 2014.
5
Using next-generation sequencing to isolate mutant genes from forward genetic screens.利用下一代测序技术从正向遗传学筛选中分离突变基因。
Nat Rev Genet. 2014 Oct;15(10):662-76. doi: 10.1038/nrg3745. Epub 2014 Aug 20.
6
Brain transcriptome of the violet-eared waxbill Uraeginthus granatina and recent evolution in the songbird genome.紫耳蜂虎的脑转录组和鸣禽基因组的近期进化。
Open Biol. 2013 Sep 4;3(9):130063. doi: 10.1098/rsob.130063.
7
Development of strategies for SNP detection in RNA-seq data: application to lymphoblastoid cell lines and evaluation using 1000 Genomes data.RNA-seq 数据中 SNP 检测策略的开发:在淋巴母细胞系中的应用及使用 1000 基因组数据的评估。
PLoS One. 2013;8(3):e58815. doi: 10.1371/journal.pone.0058815. Epub 2013 Mar 26.
8
Aye-aye population genomic analyses highlight an important center of endemism in northern Madagascar.大眼长尾穿山甲的种群基因组分析突出了马达加斯加北部一个重要的特有中心。
Proc Natl Acad Sci U S A. 2013 Apr 9;110(15):5823-8. doi: 10.1073/pnas.1211990110. Epub 2013 Mar 25.
9
Identification of high-quality single-nucleotide polymorphisms in Glycine latifolia using a heterologous reference genome sequence.利用异源参考基因组序列鉴定甘草中的高质量单核苷酸多态性。
Theor Appl Genet. 2013 Jun;126(6):1627-38. doi: 10.1007/s00122-013-2079-8. Epub 2013 Mar 15.
10
Mutation identification by direct comparison of whole-genome sequencing data from mutant and wild-type individuals using k-mers.使用 k- -mer 通过比较突变体和野生型个体的全基因组测序数据来鉴定突变。
Nat Biotechnol. 2013 Apr;31(4):325-30. doi: 10.1038/nbt.2515. Epub 2013 Mar 10.
大规模平行测序在猪基因组全基因组单核苷酸多态性发现中的应用。
BMC Genomics. 2009 Aug 12;10:374. doi: 10.1186/1471-2164-10-374.
4
The first Korean genome sequence and analysis: full genome sequencing for a socio-ethnic group.首个韩国人基因组序列及分析:针对一个社会族群的全基因组测序
Genome Res. 2009 Sep;19(9):1622-9. doi: 10.1101/gr.092197.109. Epub 2009 May 26.
5
SHRiMP: accurate mapping of short color-space reads.SHRiMP:短颜色空间读数的精确映射
PLoS Comput Biol. 2009 May;5(5):e1000386. doi: 10.1371/journal.pcbi.1000386. Epub 2009 May 22.
6
Reduced heterozygosity impairs sperm quality in endangered mammals.杂合性降低会损害濒危哺乳动物的精子质量。
Biol Lett. 2009 Jun 23;5(3):320-3. doi: 10.1098/rsbl.2008.0734. Epub 2009 Mar 4.
7
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.短DNA序列与人类基因组的超快速且内存高效比对。
Genome Biol. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. Epub 2009 Mar 4.
8
Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing.用于大规模平行靶向测序的超长寡核苷酸溶液杂交选择法。
Nat Biotechnol. 2009 Feb;27(2):182-9. doi: 10.1038/nbt.1523. Epub 2009 Feb 1.
9
SNP discovery in swine by reduced representation and high throughput pyrosequencing.通过简化基因组和高通量焦磷酸测序技术在猪中发现单核苷酸多态性
BMC Genet. 2008 Dec 4;9:81. doi: 10.1186/1471-2156-9-81.
10
Gene-boosted assembly of a novel bacterial genome from very short reads.基于极短读段的新型细菌基因组的基因增强组装
PLoS Comput Biol. 2008 Sep 26;4(9):e1000186. doi: 10.1371/journal.pcbi.1000186.