• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于生物序列比较的启发式方法的自动生成。

Automated generation of heuristics for biological sequence comparison.

作者信息

Slater Guy St C, Birney Ewan

机构信息

The Ensembl Group, EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.

出版信息

BMC Bioinformatics. 2005 Feb 15;6:31. doi: 10.1186/1471-2105-6-31.

DOI:10.1186/1471-2105-6-31
PMID:15713233
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC553969/
Abstract

BACKGROUND

Exhaustive methods of sequence alignment are accurate but slow, whereas heuristic approaches run quickly, but their complexity makes them more difficult to implement. We introduce bounded sparse dynamic programming (BSDP) to allow rapid approximation to exhaustive alignment. This is used within a framework whereby the alignment algorithms are described in terms of their underlying model, to allow automated development of efficient heuristic implementations which may be applied to a general set of sequence comparison problems.

RESULTS

The speed and accuracy of this approach compares favourably with existing methods. Examples of its use in the context of genome annotation are given.

CONCLUSIONS

This system allows rapid implementation of heuristics approximating to many complex alignment models, and has been incorporated into the freely available sequence alignment program, exonerate.

摘要

背景

详尽的序列比对方法准确但速度慢,而启发式方法运行速度快,但其复杂性使其更难实现。我们引入有界稀疏动态规划(BSDP)以实现对详尽比对的快速近似。这在一个框架内使用,在该框架中,比对算法根据其基础模型进行描述,以允许自动开发高效的启发式实现,这些实现可应用于一般的序列比较问题集。

结果

该方法的速度和准确性与现有方法相比具有优势。给出了其在基因组注释背景下的使用示例。

结论

该系统允许快速实现近似许多复杂比对模型的启发式方法,并且已被纳入免费的序列比对程序exonerate中。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/cf85352a3a71/1471-2105-6-31-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/48005ae44e0f/1471-2105-6-31-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/5da738c48f5a/1471-2105-6-31-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/afda4336383b/1471-2105-6-31-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/e83755f77f03/1471-2105-6-31-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/c95c743d5889/1471-2105-6-31-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/cf85352a3a71/1471-2105-6-31-6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/48005ae44e0f/1471-2105-6-31-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/5da738c48f5a/1471-2105-6-31-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/afda4336383b/1471-2105-6-31-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/e83755f77f03/1471-2105-6-31-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/c95c743d5889/1471-2105-6-31-5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/640b/553969/cf85352a3a71/1471-2105-6-31-6.jpg

相似文献

1
Automated generation of heuristics for biological sequence comparison.用于生物序列比较的启发式方法的自动生成。
BMC Bioinformatics. 2005 Feb 15;6:31. doi: 10.1186/1471-2105-6-31.
2
ParAlign: a parallel sequence alignment algorithm for rapid and sensitive database searches.ParAlign:一种用于快速且灵敏的数据库搜索的并行序列比对算法。
Nucleic Acids Res. 2001 Apr 1;29(7):1647-52. doi: 10.1093/nar/29.7.1647.
3
transAlign: using amino acids to facilitate the multiple alignment of protein-coding DNA sequences.transAlign:利用氨基酸促进蛋白质编码DNA序列的多重比对。
BMC Bioinformatics. 2005 Jun 22;6:156. doi: 10.1186/1471-2105-6-156.
4
Heuristic reusable dynamic programming: efficient updates of local sequence alignment.启发式可重用动态规划:局部序列比对的高效更新。
IEEE/ACM Trans Comput Biol Bioinform. 2009 Oct-Dec;6(4):570-82. doi: 10.1109/TCBB.2009.30.
5
Refined repetitive sequence searches utilizing a fast hash function and cross species information retrievals.利用快速哈希函数和跨物种信息检索进行精细的重复序列搜索。
BMC Bioinformatics. 2005 May 3;6:111. doi: 10.1186/1471-2105-6-111.
6
Dynamic programming algorithms for biological sequence comparison.用于生物序列比较的动态规划算法。
Methods Enzymol. 1992;210:575-601. doi: 10.1016/0076-6879(92)10029-d.
7
Workflows in bioinformatics: meta-analysis and prototype implementation of a workflow generator.生物信息学中的工作流程:工作流程生成器的元分析与原型实现
BMC Bioinformatics. 2005 Apr 7;6:87. doi: 10.1186/1471-2105-6-87.
8
DIALIGN-T: an improved algorithm for segment-based multiple sequence alignment.DIALIGN-T:一种改进的基于片段的多序列比对算法。
BMC Bioinformatics. 2005 Mar 22;6:66. doi: 10.1186/1471-2105-6-66.
9
An alignment-free heuristic for fast sequence comparisons with applications to phylogeny reconstruction.一种基于比对的启发式算法,用于快速的序列比对,可应用于系统发育重建。
BMC Bioinformatics. 2020 Nov 18;21(Suppl 6):404. doi: 10.1186/s12859-020-03738-5.
10
BFAST: an alignment tool for large scale genome resequencing.BFAST:用于大规模基因组重测序的比对工具。
PLoS One. 2009 Nov 11;4(11):e7767. doi: 10.1371/journal.pone.0007767.

引用本文的文献

1
Chromosome-scale genome assembly and gene annotation of the hydrothermal vent annelid Alvinella pompejana yield insight into animal evolution in extreme environments.热液喷口环节动物庞贝蠕虫的染色体水平基因组组装和基因注释为极端环境中的动物进化提供了见解。
BMC Biol. 2025 Sep 2;23(1):274. doi: 10.1186/s12915-025-02369-7.
2
Chromosome-scale genome assembly of Helcystogramma triannulella (Lepidoptera: Gelechiidae).三纹潜蛾(鳞翅目:麦蛾科)的染色体水平基因组组装
Sci Data. 2025 Sep 1;12(1):1525. doi: 10.1038/s41597-025-05850-8.
3
Highly contiguous genome of the medicinal plant Sarcandra glabra (Thunb.) Nakai.

本文引用的文献

1
GeneWise and Genomewise.基因比对软件GeneWise和基因组比对软件Genomewise
Genome Res. 2004 May;14(5):988-95. doi: 10.1101/gr.1865504.
2
The Ensembl automatic gene annotation system.Ensembl自动基因注释系统。
Genome Res. 2004 May;14(5):942-50. doi: 10.1101/gr.1858004.
3
The DNA sequence and comparative analysis of human chromosome 20.人类20号染色体的DNA序列及比较分析。
药用植物九节龙(学名:Sarcandra glabra (Thunb.) Nakai)的高度连续基因组。
Sci Data. 2025 Aug 28;12(1):1508. doi: 10.1038/s41597-025-05796-x.
4
Mycobacterium hainanense sp. nov. represents an emerging nontuberculous Mycobacterium associated with chronic pulmonary disease.海南分枝杆菌新种是一种与慢性肺病相关的新出现的非结核分枝杆菌。
Sci Rep. 2025 Aug 28;15(1):31670. doi: 10.1038/s41598-025-16906-5.
5
Evaluating the Assembly Strategy of a Fungal Genome from Metagenomic Data: (Peltigerales, Ascomycota) as a Case Study.从宏基因组数据评估真菌基因组的组装策略:以(子囊菌门,胶膜菌目)为例进行研究
J Fungi (Basel). 2025 Aug 15;11(8):596. doi: 10.3390/jof11080596.
6
The near-complete genome assembly of provides insights into its origin, evolution, and the regulation of flavonoid biosynthesis.[具体物种名称]近乎完整的基因组组装为其起源、进化以及类黄酮生物合成的调控提供了见解。
Front Plant Sci. 2025 Aug 11;16:1580779. doi: 10.3389/fpls.2025.1580779. eCollection 2025.
7
A species-wide inventory of receptor-like kinases in Arabidopsis thaliana.拟南芥中类受体激酶的全物种清单。
BMC Biol. 2025 Aug 26;23(1):266. doi: 10.1186/s12915-025-02364-y.
8
Convergent Evolution in Amblyopsid Cavefishes and the Age of Eastern North American Subterranean Ecosystems.盲视洞穴鱼的趋同进化与北美东部地下生态系统的年代
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf185.
9
Convergent evolution through independent rearrangements in the primate amylase locus.灵长类动物淀粉酶基因座通过独立重排的趋同进化。
bioRxiv. 2025 Aug 15:2025.08.14.670395. doi: 10.1101/2025.08.14.670395.
10
Evolutionary Genomics of Gene Families: A Case Study of Insect Gustatory Receptors.基因家族的进化基因组学:以昆虫味觉受体为例的研究
Methods Mol Biol. 2025;2935:179-209. doi: 10.1007/978-1-0716-4583-3_8.
Nature. 2001;414(6866):865-71. doi: 10.1038/414865a.
4
SSAHA: a fast search method for large DNA databases.SSAHA:一种用于大型DNA数据库的快速搜索方法。
Genome Res. 2001 Oct;11(10):1725-9. doi: 10.1101/gr.194201.
5
MPBLAST : improved BLAST performance with multiplexed queries.MPBLAST:通过多路复用查询提高BLAST性能。
Bioinformatics. 2000 Nov;16(11):1052-3. doi: 10.1093/bioinformatics/16.11.1052.
6
The language of RNA: a formal grammar that includes pseudoknots.RNA的语言:一种包含假结的形式语法。
Bioinformatics. 2000 Apr;16(4):334-40. doi: 10.1093/bioinformatics/16.4.334.
7
Comparative analysis of noncoding regions of 77 orthologous mouse and human gene pairs.77对直系同源小鼠和人类基因对非编码区的比较分析。
Genome Res. 1999 Sep;9(9):815-24. doi: 10.1101/gr.9.9.815.
8
Alignment of whole genomes.全基因组比对
Nucleic Acids Res. 1999 Jun 1;27(11):2369-76. doi: 10.1093/nar/27.11.2369.
9
A computer program for aligning a cDNA sequence with a genomic DNA sequence.一种用于将互补DNA(cDNA)序列与基因组DNA序列进行比对的计算机程序。
Genome Res. 1998 Sep;8(9):967-74. doi: 10.1101/gr.8.9.967.
10
Generalized affine gap costs for protein sequence alignment.用于蛋白质序列比对的广义仿射空位罚分
Proteins. 1998 Jul 1;32(1):88-96.