Suppr超能文献

序列景观

Sequence landscapes.

作者信息

Clift B, Haussler D, McConnell R, Schneider T D, Stormo G D

出版信息

Nucleic Acids Res. 1986 Jan 10;14(1):141-58. doi: 10.1093/nar/14.1.141.

Abstract

We describe a method for representing the structure of repeating sequences in nucleic-acids, proteins and other texts. A portion of the sequence is presented at the bottom of a CRT screen. Above the sequence is its landscape, which looks like a mountain range. Each mountain corresponds to a subsequence of the sequence. At the peak of every mountain is written the number of times that the subsequence appears. A data structure called a DAWG, which can be built in time proportional to the length of the sequence, is used to construct the landscape. For the 40 thousand bases of bacteriophage T7, the DAWG can be built in 30 seconds. The time to display any portion of the landscape is less than a second. Using sequence landscapes, one can quickly locate significant repeats.

摘要

我们描述了一种用于表示核酸、蛋白质及其他文本中重复序列结构的方法。序列的一部分显示在阴极射线管(CRT)屏幕底部。序列上方是其景观图,看起来像山脉。每座山对应序列的一个子序列。在每座山的山顶写着该子序列出现的次数。一种名为有向无环字图(DAWG)的数据结构可用于构建景观图,构建时间与序列长度成正比。对于噬菌体T7的4万个碱基,构建DAWG只需30秒。显示景观图任何部分的时间不到一秒。使用序列景观图,人们可以快速定位重要的重复序列。

相似文献

1
Sequence landscapes.序列景观
Nucleic Acids Res. 1986 Jan 10;14(1):141-58. doi: 10.1093/nar/14.1.141.
3
Analysis of the occurrence of promoter-sites in DNA.DNA中启动子位点出现情况的分析。
Nucleic Acids Res. 1986 Jan 10;14(1):109-26. doi: 10.1093/nar/14.1.109.

引用本文的文献

1
Mem-based pangenome indexing for k-mer queries.用于k-mer查询的基于内存的泛基因组索引
Algorithms Mol Biol. 2025 Mar 1;20(1):3. doi: 10.1186/s13015-025-00272-y.
2
MEM-based pangenome indexing for -mer queries.基于MEM的用于k-mer查询的泛基因组索引
bioRxiv. 2024 May 22:2024.05.20.595044. doi: 10.1101/2024.05.20.595044.
7
DNA sequences at a glance.一眼看清 DNA 序列。
PLoS One. 2013 Nov 21;8(11):e79922. doi: 10.1371/journal.pone.0079922. eCollection 2013.
9
Profile of David Haussler.大卫·豪斯勒简介。
Proc Natl Acad Sci U S A. 2008 Sep 23;105(38):14251-3. doi: 10.1073/pnas.0808284105. Epub 2008 Sep 17.

本文引用的文献

2
Adsorption complex of filamentous fd virus.丝状fd病毒的吸附复合物
J Mol Biol. 1981 Mar 15;146(4):621-7. doi: 10.1016/0022-2836(81)90050-4.
4
Delila system tools.德利拉系统工具。
Nucleic Acids Res. 1984 Jan 11;12(1 Pt 1):129-40. doi: 10.1093/nar/12.1part1.129.
8
Nucleotide sequence of bacteriophage fd DNA.噬菌体fd DNA的核苷酸序列。
Nucleic Acids Res. 1978 Dec;5(12):4495-503. doi: 10.1093/nar/5.12.4495.
9
The nucleotide sequence of bacteriophage phiX174.噬菌体φX174的核苷酸序列。
J Mol Biol. 1978 Oct 25;125(2):225-46. doi: 10.1016/0022-2836(78)90346-7.
10
Nucleotide sequence of bacteriophage G4 DNA.噬菌体G4 DNA的核苷酸序列。
Nature. 1978 Nov 16;276(5685):236-47. doi: 10.1038/276236a0.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验