• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于片段组装的分治方法。

A divide-and-conquer approach to fragment assembly.

作者信息

Otu Hasan H, Sayood Khalid

机构信息

University of Nebraska-Lincoln, Department of Electrical Engineering, 209N WSEC, 68503, USA.

出版信息

Bioinformatics. 2003 Jan;19(1):22-9. doi: 10.1093/bioinformatics/19.1.22.

DOI:10.1093/bioinformatics/19.1.22
PMID:12499289
Abstract

MOTIVATION

One of the major problems in DNA sequencing is assembling the fragments obtained by shotgun sequencing. Most existing fragment assembly techniques follow the overlap-layout-consensus approach. This framework requires extensive computation in each phase and becomes inefficient with increasing number of fragments.

RESULTS

We propose a new algorithm which solves the overlap, layout, and consensus phases simultaneously. The fragments are clustered with respect to their Average Mutual Information (AMI) profiles using the k-means algorithm. This removes the unnecessary burden of considering the collection of fragments as a whole. Instead, the orientation and overlap detection are solved efficiently, within the clusters. The algorithm has successfully reconstructed both artificial and real data.

AVAILABILITY

Available on request from the authors.

摘要

动机

DNA测序中的一个主要问题是组装通过鸟枪法测序获得的片段。大多数现有的片段组装技术都遵循重叠-布局-共识方法。该框架在每个阶段都需要大量计算,并且随着片段数量的增加而变得效率低下。

结果

我们提出了一种新算法,该算法同时解决重叠、布局和共识阶段。使用k均值算法根据片段的平均互信息(AMI)谱对片段进行聚类。这消除了将片段集合作为一个整体考虑的不必要负担。相反,在聚类中有效地解决了方向和重叠检测问题。该算法已成功重建了人工数据和真实数据。

可用性

可根据作者要求提供。

相似文献

1
A divide-and-conquer approach to fragment assembly.一种用于片段组装的分治方法。
Bioinformatics. 2003 Jan;19(1):22-9. doi: 10.1093/bioinformatics/19.1.22.
2
Efficiently detecting polymorphisms during the fragment assembly process.在片段组装过程中高效检测多态性。
Bioinformatics. 2002;18 Suppl 1:S294-302. doi: 10.1093/bioinformatics/18.suppl_1.s294.
3
Restarting and recentering genetic algorithm variations for DNA fragment assembly: The necessity of a multi-strategy approach.
Biosystems. 2016 Dec;150:35-45. doi: 10.1016/j.biosystems.2016.08.001. Epub 2016 Aug 10.
4
An Eulerian path approach to DNA fragment assembly.一种用于DNA片段组装的欧拉路径方法。
Proc Natl Acad Sci U S A. 2001 Aug 14;98(17):9748-53. doi: 10.1073/pnas.171285098.
5
A simulated annealing algorithm for finding consensus sequences.一种用于寻找共有序列的模拟退火算法。
Bioinformatics. 2002 Nov;18(11):1494-9. doi: 10.1093/bioinformatics/18.11.1494.
6
A graph based algorithm for generating EST consensus sequences.一种基于图形的用于生成EST一致性序列的算法。
Bioinformatics. 2005 Apr 15;21(8):1371-5. doi: 10.1093/bioinformatics/bti184. Epub 2004 Nov 30.
7
Generating consensus sequences from partial order multiple sequence alignment graphs.从偏序多序列比对图生成一致序列。
Bioinformatics. 2003 May 22;19(8):999-1008. doi: 10.1093/bioinformatics/btg109.
8
Refinement of optical map assemblies.光学图谱组装的优化。
Bioinformatics. 2006 May 15;22(10):1217-24. doi: 10.1093/bioinformatics/btl063. Epub 2006 Feb 24.
9
Efficient filtering methods for clustering cDNAs with spliced sequence alignment.用于通过剪接序列比对对cDNA进行聚类的高效过滤方法。
Bioinformatics. 2004 Jan 1;20(1):29-39. doi: 10.1093/bioinformatics/btg367.
10
AMASS: a structured pattern matching approach to shotgun sequence assembly.AMASS:一种用于鸟枪法序列组装的结构化模式匹配方法。
J Comput Biol. 1999 Summer;6(2):163-86. doi: 10.1089/cmb.1999.6.163.

引用本文的文献

1
Use of Average Mutual Information and Derived Measures to Find Coding Regions.使用平均互信息及派生度量来寻找编码区域。
Entropy (Basel). 2021 Oct 11;23(10):1324. doi: 10.3390/e23101324.
2
ARK: Aggregation of Reads by K-Means for Estimation of Bacterial Community Composition.ARK:通过K均值法聚合读数以估计细菌群落组成
PLoS One. 2015 Oct 23;10(10):e0140644. doi: 10.1371/journal.pone.0140644. eCollection 2015.
3
Sequence analysis of origins of replication in the Saccharomyces cerevisiae genomes.酿酒酵母基因组中复制起点的序列分析。
Front Microbiol. 2014 Nov 18;5:574. doi: 10.3389/fmicb.2014.00574. eCollection 2014.
4
Data Compression Concepts and Algorithms and their Applications to Bioinformatics.数据压缩概念、算法及其在生物信息学中的应用。
Entropy (Basel). 2010 Jan 1;12(1):34. doi: 10.3390/e12010034.
5
Use of average mutual information for studying changes in HIV populations.利用平均互信息研究HIV群体的变化。
Annu Int Conf IEEE Eng Med Biol Soc. 2009;2009:3861-4. doi: 10.1109/IEMBS.2009.5332579.
6
Establishment of a pipeline to analyse non-synonymous SNPs in Bos taurus.建立一个分析牛非同义单核苷酸多态性的流程。
BMC Genomics. 2006 Nov 26;7:298. doi: 10.1186/1471-2164-7-298.