文献检索文档翻译深度研究
Suppr Zotero 插件Zotero 插件
邀请有礼套餐&价格历史记录

新学期,新优惠

限时优惠:9月1日-9月22日

30天高级会员仅需29元

1天体验卡首发特惠仅需5.99元

了解详情
不再提醒
插件&应用
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
高级版
套餐订阅购买积分包
AI 工具
文献检索文档翻译深度研究
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2025

调用没有参考序列的 SNPs。

Calling SNPs without a reference sequence.

机构信息

Center for Comparative Genomics and Bioinformatics, Pennsylvania State University, USA.

出版信息

BMC Bioinformatics. 2010 Mar 15;11:130. doi: 10.1186/1471-2105-11-130.


DOI:10.1186/1471-2105-11-130
PMID:20230626
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2851604/
Abstract

BACKGROUND: The most common application for the next-generation sequencing technologies is resequencing, where short reads from the genome of an individual are aligned to a reference genome sequence for the same species. These mappings can then be used to identify genetic differences among individuals in a population, and perhaps ultimately to explain phenotypic variation. Many algorithms capable of aligning short reads to the reference, and determining differences between them have been reported. Much less has been reported on how to use these technologies to determine genetic differences among individuals of a species for which a reference sequence is not available, which drastically limits the number of species that can easily benefit from these new technologies. RESULTS: We describe a computational pipeline, called DIAL (De novo Identification of Alleles), for identifying single-base substitutions between two closely related genomes without the help of a reference genome. The method works even when the depth of coverage is insufficient for de novo assembly, and it can be extended to determine small insertions/deletions. We evaluate the software's effectiveness using published Roche/454 sequence data from the genome of Dr. James Watson (to detect heterozygous positions) and recent Illumina data from orangutan, in each case comparing our results to those from computational analysis that uses a reference genome assembly. We also illustrate the use of DIAL to identify nucleotide differences among transcriptome sequences. CONCLUSIONS: DIAL can be used for identification of nucleotide differences in species for which no reference sequence is available. Our main motivation is to use this tool to survey the genetic diversity of endangered species as the identified sequence differences can be used to design genotyping arrays to assist in the species' management. The DIAL source code is freely available at http://www.bx.psu.edu/miller_lab/.

摘要

背景:下一代测序技术最常见的应用是重测序,即将个体基因组的短读段与同一物种的参考基因组序列进行比对。这些比对可以用来识别群体中个体之间的遗传差异,并最终解释表型变异。已经有许多能够将短读段与参考序列进行比对并确定它们之间差异的算法被报道。但关于如何利用这些技术来确定没有参考序列的物种中个体之间的遗传差异的报道却很少,这极大地限制了可以从这些新技术中受益的物种数量。

结果:我们描述了一种名为 DIAL(从头鉴定等位基因)的计算流程,用于在没有参考基因组的情况下识别两个密切相关的基因组之间的单碱基替换。该方法甚至在覆盖深度不足以进行从头组装的情况下也能工作,并且可以扩展用于确定小的插入/缺失。我们使用已发表的 Roche/454 序列数据(来自 Dr. James Watson 的基因组,用于检测杂合位置)和最近的猩猩 Illumina 数据来评估软件的有效性,在每种情况下,我们将结果与使用参考基因组组装的计算分析进行比较。我们还展示了 DIAL 用于鉴定转录组序列中核苷酸差异的用法。

结论:DIAL 可用于鉴定没有参考序列的物种中的核苷酸差异。我们的主要动机是使用此工具来调查濒危物种的遗传多样性,因为所鉴定的序列差异可用于设计基因分型阵列以协助物种管理。DIAL 的源代码可在 http://www.bx.psu.edu/miller_lab/ 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6856/2851604/94d6d0ea7889/1471-2105-11-130-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6856/2851604/c465c8766d1c/1471-2105-11-130-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6856/2851604/94d6d0ea7889/1471-2105-11-130-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6856/2851604/c465c8766d1c/1471-2105-11-130-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6856/2851604/94d6d0ea7889/1471-2105-11-130-4.jpg

相似文献

[1]
Calling SNPs without a reference sequence.

BMC Bioinformatics. 2010-3-15

[2]
Coverage-based consensus calling (CbCC) of short sequence reads and comparison of CbCC results to identify SNPs in chickpea (Cicer arietinum; Fabaceae), a crop species without a reference genome.

Am J Bot. 2012-2-1

[3]
High quality SNP calling using Illumina data at shallow coverage.

Bioinformatics. 2010-2-26

[4]
Short reads and nonmodel species: exploring the complexities of next-generation sequence assembly and SNP discovery in the absence of a reference genome.

Mol Ecol Resour. 2011-3

[5]
Orthology Guided Assembly in highly heterozygous crops: creating a reference transcriptome to uncover genetic diversity in Lolium perenne.

Plant Biotechnol J. 2013-2-21

[6]
Correction of sequencing errors in a mixed set of reads.

Bioinformatics. 2010-4-8

[7]
Exploring single-sample SNP and INDEL calling with whole-genome de novo assembly.

Bioinformatics. 2012-5-7

[8]
EagleView: a genome assembly viewer for next-generation sequencing technologies.

Genome Res. 2008-9

[9]
Sequencing of natural strains of Arabidopsis thaliana with short reads.

Genome Res. 2008-12

[10]
SNP calling from RNA-seq data without a reference genome: identification, quantification, differential analysis and impact on the protein sequence.

Nucleic Acids Res. 2016-11-2

引用本文的文献

[1]
A single QTL with large effect is associated with female functional virginity in an asexual parasitoid wasp.

Mol Ecol. 2021-5

[2]
SNP Mining in Functional Genes from Nonmodel Species by Next-Generation Sequencing: A Case of Flowering, Pre-Harvest Sprouting, and Dehydration Resistant Genes in Wheat.

Biomed Res Int. 2016

[3]
4Pipe4--A 454 data analysis pipeline for SNP detection in datasets with no reference sequence or strain information.

BMC Bioinformatics. 2016-1-19

[4]
Reliable in silico identification of sequence polymorphisms and their application for extending the genetic map of sugar beet (Beta vulgaris).

PLoS One. 2014-10-10

[5]
Using next-generation sequencing to isolate mutant genes from forward genetic screens.

Nat Rev Genet. 2014-8-20

[6]
Brain transcriptome of the violet-eared waxbill Uraeginthus granatina and recent evolution in the songbird genome.

Open Biol. 2013-9-4

[7]
Development of strategies for SNP detection in RNA-seq data: application to lymphoblastoid cell lines and evaluation using 1000 Genomes data.

PLoS One. 2013-3-26

[8]
Aye-aye population genomic analyses highlight an important center of endemism in northern Madagascar.

Proc Natl Acad Sci U S A. 2013-3-25

[9]
Identification of high-quality single-nucleotide polymorphisms in Glycine latifolia using a heterologous reference genome sequence.

Theor Appl Genet. 2013-3-15

[10]
Mutation identification by direct comparison of whole-genome sequencing data from mutant and wild-type individuals using k-mers.

Nat Biotechnol. 2013-3-10

本文引用的文献

[1]
Optimization methods for selecting founder individuals for captive breeding or reintroduction of endangered species.

Pac Symp Biocomput. 2010

[2]
Large scale single nucleotide polymorphism discovery in unsequenced genomes using second generation high throughput sequencing technology: applied to turkey.

BMC Genomics. 2009-10-16

[3]
Application of massive parallel sequencing to whole genome SNP discovery in the porcine genome.

BMC Genomics. 2009-8-12

[4]
The first Korean genome sequence and analysis: full genome sequencing for a socio-ethnic group.

Genome Res. 2009-9

[5]
SHRiMP: accurate mapping of short color-space reads.

PLoS Comput Biol. 2009-5

[6]
Reduced heterozygosity impairs sperm quality in endangered mammals.

Biol Lett. 2009-6-23

[7]
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Genome Biol. 2009

[8]
Solution hybrid selection with ultra-long oligonucleotides for massively parallel targeted sequencing.

Nat Biotechnol. 2009-2

[9]
SNP discovery in swine by reduced representation and high throughput pyrosequencing.

BMC Genet. 2008-12-4

[10]
Gene-boosted assembly of a novel bacterial genome from very short reads.

PLoS Comput Biol. 2008-9-26

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

推荐工具

医学文档翻译智能文献检索