• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

VARiD:一种用于颜色空间和字母空间平台的变异检测框架。

VARiD: a variation detection framework for color-space and letter-space platforms.

机构信息

Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, Cambridge, MA, USA.

出版信息

Bioinformatics. 2010 Jun 15;26(12):i343-9. doi: 10.1093/bioinformatics/btq184.

DOI:10.1093/bioinformatics/btq184
PMID:20529926
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2881369/
Abstract

MOTIVATION

High-throughput sequencing (HTS) technologies are transforming the study of genomic variation. The various HTS technologies have different sequencing biases and error rates, and while most HTS technologies sequence the residues of the genome directly, generating base calls for each position, the Applied Biosystem's SOLiD platform generates dibase-coded (color space) sequences. While combining data from the various platforms should increase the accuracy of variation detection, to date there are only a few tools that can identify variants from color space data, and none that can analyze color space and regular (letter space) data together.

RESULTS

We present VARiD--a probabilistic method for variation detection from both letter- and color-space reads simultaneously. VARiD is based on a hidden Markov model and uses the forward-backward algorithm to accurately identify heterozygous, homozygous and tri-allelic SNPs, as well as micro-indels. Our analysis shows that VARiD performs better than the AB SOLiD toolset at detecting variants from color-space data alone, and improves the calls dramatically when letter- and color-space reads are combined.

AVAILABILITY

The toolset is freely available at http://compbio.cs.utoronto.ca/varid.

摘要

动机

高通量测序(HTS)技术正在改变基因组变异的研究方式。各种 HTS 技术具有不同的测序偏差和错误率,虽然大多数 HTS 技术直接对基因组的残基进行测序,为每个位置生成碱基调用,但 Applied Biosystem 的 SOLiD 平台生成双碱基编码(颜色空间)序列。虽然结合来自各种平台的数据应该会提高变异检测的准确性,但迄今为止,只有少数工具可以从颜色空间数据中识别变体,并且没有可以同时分析颜色空间和常规(字母空间)数据的工具。

结果

我们提出了 VARiD-一种同时从字母空间和颜色空间读取数据中进行变异检测的概率方法。VARiD 基于隐马尔可夫模型,并使用前向-后向算法来准确识别杂合子、纯合子和三等位基因 SNP 以及微缺失。我们的分析表明,VARiD 单独从颜色空间数据中检测变体的性能优于 AB SOLiD 工具集,并且当字母空间和颜色空间读取结合使用时,大大改善了调用。

可用性

该工具集可在 http://compbio.cs.utoronto.ca/varid 上免费获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/2d61bf941276/btq184f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/7e588a79ba00/btq184f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/ba6d607ed015/btq184f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/9417f8e068e1/btq184f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/e00bb4b212fd/btq184f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/7c716b4b05e2/btq184f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/286b4c8d9967/btq184f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/3d597f52291a/btq184f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/7aeb6f952ab9/btq184f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/3e3173604a81/btq184f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/2d61bf941276/btq184f10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/7e588a79ba00/btq184f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/ba6d607ed015/btq184f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/9417f8e068e1/btq184f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/e00bb4b212fd/btq184f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/7c716b4b05e2/btq184f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/286b4c8d9967/btq184f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/3d597f52291a/btq184f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/7aeb6f952ab9/btq184f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/3e3173604a81/btq184f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5a13/2881369/2d61bf941276/btq184f10.jpg

相似文献

1
VARiD: a variation detection framework for color-space and letter-space platforms.VARiD:一种用于颜色空间和字母空间平台的变异检测框架。
Bioinformatics. 2010 Jun 15;26(12):i343-9. doi: 10.1093/bioinformatics/btq184.
2
ComB: SNP calling and mapping analysis for color and nucleotide space platforms.ComB:用于颜色和核苷酸空间平台的单核苷酸多态性(SNP)检测与定位分析
J Comput Biol. 2011 Jun;18(6):795-807. doi: 10.1089/cmb.2011.0027. Epub 2011 May 12.
3
Correction of sequencing errors in a mixed set of reads.纠正混合读取集中的测序错误。
Bioinformatics. 2010 May 15;26(10):1284-90. doi: 10.1093/bioinformatics/btq151. Epub 2010 Apr 8.
4
SHRiMP: accurate mapping of short color-space reads.SHRiMP:短颜色空间读数的精确映射
PLoS Comput Biol. 2009 May;5(5):e1000386. doi: 10.1371/journal.pcbi.1000386. Epub 2009 May 22.
5
A statistical method for the detection of variants from next-generation resequencing of DNA pools.一种用于从 DNA 池的下一代重测序中检测变异的统计方法。
Bioinformatics. 2010 Jun 15;26(12):i318-24. doi: 10.1093/bioinformatics/btq214.
6
SHRiMP2: sensitive yet practical SHort Read Mapping.SHRiMP2:敏感而实用的短读序列比对。
Bioinformatics. 2011 Apr 1;27(7):1011-2. doi: 10.1093/bioinformatics/btr046. Epub 2011 Jan 28.
7
Sensitive and fast mapping of di-base encoded reads.双碱基编码读取的敏感快速映射。
Bioinformatics. 2011 Jul 15;27(14):1915-21. doi: 10.1093/bioinformatics/btr303. Epub 2011 May 17.
8
Apollo: a sequencing-technology-independent, scalable and accurate assembly polishing algorithm.阿波罗:一种与测序技术无关、可扩展且准确的组装后处理算法。
Bioinformatics. 2020 Jun 1;36(12):3669-3679. doi: 10.1093/bioinformatics/btaa179.
9
SNVMix: predicting single nucleotide variants from next-generation sequencing of tumors.SNVMix:从肿瘤的下一代测序中预测单核苷酸变异。
Bioinformatics. 2010 Mar 15;26(6):730-6. doi: 10.1093/bioinformatics/btq040. Epub 2010 Feb 3.
10
A robust framework for detecting structural variations in a genome.一种用于检测基因组结构变异的强大框架。
Bioinformatics. 2008 Jul 1;24(13):i59-67. doi: 10.1093/bioinformatics/btn176.

引用本文的文献

1
Polyploidy can drive rapid adaptation in yeast.多倍体可推动酵母的快速适应性变化。
Nature. 2015 Mar 19;519(7543):349-52. doi: 10.1038/nature14187. Epub 2015 Mar 2.
2
SNVHMM: predicting single nucleotide variants from next generation sequencing.SNVHMM:从下一代测序中预测单核苷酸变异。
BMC Bioinformatics. 2013 Jul 15;14:225. doi: 10.1186/1471-2105-14-225.
3
Using a periclinal chimera to unravel layer-specific gene expression in plants.利用周缘嵌合体解开植物中特定层的基因表达。

本文引用的文献

1
Genome variation discovery with high-throughput sequencing data.高通量测序数据的基因组变异发现。
Brief Bioinform. 2010 Jan;11(1):3-14. doi: 10.1093/bib/bbp058. Epub 2010 Jan 6.
2
Sense from sequence reads: methods for alignment and assembly.从序列读取中获取意义:比对和组装方法
Nat Methods. 2009 Nov;6(11 Suppl):S6-S12. doi: 10.1038/nmeth.1376.
3
Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding.通过使用双碱基编码的短读长、大规模平行连接测序揭示的人类基因组中的序列和结构变异。
Plant J. 2013 Sep;75(6):1039-49. doi: 10.1111/tpj.12250. Epub 2013 Jul 19.
4
Automated cleaning and pre-processing of immunoglobulin gene sequences from high-throughput sequencing.从高通量测序中自动清洗和预处理免疫球蛋白基因序列。
Front Immunol. 2012 Dec 28;3:386. doi: 10.3389/fimmu.2012.00386. eCollection 2012.
5
The role of clonal interference in the evolutionary dynamics of plasmid-host adaptation.克隆干扰在质粒-宿主适应进化动态中的作用。
mBio. 2012 Jul 3;3(4):e00077-12. doi: 10.1128/mBio.00077-12. Print 2012.
Genome Res. 2009 Sep;19(9):1527-41. doi: 10.1101/gr.091868.109. Epub 2009 Jun 22.
4
SOAP2: an improved ultrafast tool for short read alignment.SOAP2:一种用于短读序列比对的改进型超快速工具。
Bioinformatics. 2009 Aug 1;25(15):1966-7. doi: 10.1093/bioinformatics/btp336. Epub 2009 Jun 3.
5
SHRiMP: accurate mapping of short color-space reads.SHRiMP:短颜色空间读数的精确映射
PLoS Comput Biol. 2009 May;5(5):e1000386. doi: 10.1371/journal.pcbi.1000386. Epub 2009 May 22.
6
Fast and accurate short read alignment with Burrows-Wheeler transform.使用Burrows-Wheeler变换进行快速准确的短读比对。
Bioinformatics. 2009 Jul 15;25(14):1754-60. doi: 10.1093/bioinformatics/btp324. Epub 2009 May 18.
7
Evaluation of next generation sequencing platforms for population targeted sequencing studies.用于群体靶向测序研究的新一代测序平台评估
Genome Biol. 2009;10(3):R32. doi: 10.1186/gb-2009-10-3-r32. Epub 2009 Mar 27.
8
Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.短DNA序列与人类基因组的超快速且内存高效比对。
Genome Biol. 2009;10(3):R25. doi: 10.1186/gb-2009-10-3-r25. Epub 2009 Mar 4.
9
PASS: a program to align short sequences.PASS:一个用于比对短序列的程序。
Bioinformatics. 2009 Apr 1;25(7):967-8. doi: 10.1093/bioinformatics/btp087. Epub 2009 Feb 13.
10
Mapping short DNA sequencing reads and calling variants using mapping quality scores.使用比对质量分数比对短DNA测序读数并识别变异。
Genome Res. 2008 Nov;18(11):1851-8. doi: 10.1101/gr.078212.108. Epub 2008 Aug 19.