• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

处理杂交测序中的长靶标和错误。

Handling long targets and errors in sequencing by hybridization.

作者信息

Halperin Eran, Halperin Shay, Hartman Tzvika, Shamir Ron

机构信息

CS Division, Soda Hall, University of California at Berkeley, Berkeley, CA 94720-1776, USA.

出版信息

J Comput Biol. 2003;10(3-4):483-97. doi: 10.1089/10665270360688138.

DOI:10.1089/10665270360688138
PMID:12935339
Abstract

Sequencing by hybridization (SBH) is a DNA sequencing technique, in which the sequence is reconstructed using its k-mer content. This content, which is called the spectrum of the sequence, is obtained by hybridization to a universal DNA array. Standard universal arrays contain all k-mers for some fixed k, typically 8 to 10. Currently, in spite of its promise and elegance, SBH is not competitive with standard gel-based sequencing methods. This is due to two main reasons: lack of tools to handle realistic levels of hybridization errors and an inherent limitation on the length of uniquely reconstructible sequence by standard universal arrays. In this paper, we deal with both problems. We introduce a simple polynomial reconstruction algorithm which can be applied to spectra from standard arrays and has provable performance in the presence of both false negative and false positive errors. We also propose a novel design of chips containing universal bases that differs from the one proposed by Preparata et al. (1999). We give a simple algorithm that uses spectra from such chips to reconstruct with high probability random sequences of length lower only by a squared log factor compared to the information theoretic bound. Our algorithm is very robust to errors and has a provable performance even if there are both false negative and false positive errors. Simulations indicate that its sensitivity to errors is also very small in practice.

摘要

杂交测序(SBH)是一种DNA测序技术,其中序列是利用其k元组含量进行重建的。这种含量,即所谓的序列谱,是通过与通用DNA阵列杂交获得的。标准通用阵列包含针对某个固定k(通常为8到10)的所有k元组。目前,尽管SBH有前景且精妙,但它与基于凝胶的标准测序方法相比并无竞争力。这主要有两个原因:缺乏处理实际杂交错误水平的工具,以及标准通用阵列在唯一可重建序列长度方面存在固有局限性。在本文中,我们处理这两个问题。我们引入一种简单的多项式重建算法,该算法可应用于来自标准阵列的谱,并且在存在假阴性和假阳性错误的情况下具有可证明的性能。我们还提出了一种包含通用碱基的芯片的新颖设计,它不同于Preparata等人(1999年)提出的设计。我们给出一种简单算法,该算法使用来自此类芯片的谱以高概率重建长度仅比信息论界限低一个平方对数因子的随机序列。我们的算法对错误非常鲁棒,即使存在假阴性和假阳性错误也具有可证明的性能。模拟表明,在实际应用中其对错误的敏感度也非常小。

相似文献

1
Handling long targets and errors in sequencing by hybridization.处理杂交测序中的长靶标和错误。
J Comput Biol. 2003;10(3-4):483-97. doi: 10.1089/10665270360688138.
2
Sequencing by hybridization in the presence of hybridization errors.存在杂交错误时的杂交测序法。
Genome Inform Ser Workshop Genome Inform. 2000;11:53-62.
3
Large scale sequencing by hybridization.大规模杂交测序
J Comput Biol. 2002;9(2):413-28. doi: 10.1089/10665270252935548.
4
Optimal reconstruction of a sequence from its probes.根据其探测值对序列进行最优重建。
J Comput Biol. 1999 Fall-Winter;6(3-4):361-8. doi: 10.1089/106652799318328.
5
A computational method for resequencing long DNA targets by universal oligonucleotide arrays.一种通过通用寡核苷酸阵列对长DNA靶标进行重测序的计算方法。
Proc Natl Acad Sci U S A. 2002 Nov 26;99(24):15492-6. doi: 10.1073/pnas.232278299. Epub 2002 Nov 12.
6
Sequencing-by-hybridization revisited: the analog-spectrum proposal.重新审视杂交测序:模拟频谱方案。
IEEE/ACM Trans Comput Biol Bioinform. 2004 Jan-Mar;1(1):46-52. doi: 10.1109/TCBB.2004.12.
7
Sequencing by hybridization by cooperating direct and reverse spectra.
J Comput Biol. 2003;10(3-4):499-508. doi: 10.1089/10665270360688147.
8
Sequencing-by-hybridization at the information-theory bound: an optimal algorithm.信息论极限下的杂交测序:一种最优算法。
J Comput Biol. 2000;7(3-4):621-30. doi: 10.1089/106652700750050970.
9
Reconstruction of DNA sequencing by hybridization.通过杂交进行DNA测序的重建。
Bioinformatics. 2003 Jan;19(1):14-21. doi: 10.1093/bioinformatics/19.1.14.
10
Multiplex sequencing by hybridization.基于杂交的多重测序
J Comput Biol. 2001;8(2):141-9. doi: 10.1089/106652701300312913.

引用本文的文献

1
Robust-LongSAGE (RL-SAGE): a substantially improved LongSAGE method for gene discovery and transcriptome analysis.稳健长链SAGE(RL-SAGE):一种用于基因发现和转录组分析的显著改进的长链SAGE方法。
Plant Physiol. 2004 Mar;134(3):890-7. doi: 10.1104/pp.103.034496.