Suppr超能文献

疱疹病毒基因组中短DNA序列的过度和不足表征

Over- and underrepresentation of short DNA words in herpesvirus genomes.

作者信息

Leung M Y, Marsh G M, Speed T P

机构信息

Division of Mathematics and Statistics, University of Texas at San Antonio 78249, USA.

出版信息

J Comput Biol. 1996 Fall;3(3):345-60. doi: 10.1089/cmb.1996.3.345.

Abstract

The relative abundance and rarity of DNA words have been recognized in previous biological studies to have implications for the regulation, repair, and evolutionary mechanisms of a genome. In this paper, we review several different measures of abundance and rarity of DNA words, including z-scores, representation ratios, and cross-ratios, that have appeared in the recent literature, and examine the concordance among them using the human cytomegalovirus genome sequence. We then rank all words of length k = 2, ..., 5 of seven herpesvirus genomes according to their abundance, as measured by one of the z-scores based upon a stationary Markov model of order k-2. Using a simple metric on the ranks of 2-words of the seven herpesvirus sequences, we construct an evolutionary tree. Several 3-words are observed to be consistently over- or underrepresented in all seven herpesviruses. Furthermore, clusters of some of the most over- and underrepresented 4- and 5-words in the genomes are identified with functional sites such as the origins of replication and regulatory signals of individual viruses.

摘要

在以往的生物学研究中,人们已经认识到DNA单词的相对丰度和稀有性对基因组的调控、修复及进化机制具有重要意义。在本文中,我们回顾了近年来文献中出现的几种不同的DNA单词丰度和稀有性度量方法,包括z分数、表述比例和交叉比例,并使用人类巨细胞病毒基因组序列检验它们之间的一致性。然后,我们根据基于k - 2阶平稳马尔可夫模型的z分数之一所测量的丰度,对七种疱疹病毒基因组长度为k = 2, ..., 5的所有单词进行排序。使用关于七种疱疹病毒序列中2 - 单词排名的简单度量方法,我们构建了一棵进化树。观察到几个3 - 单词在所有七种疱疹病毒中始终存在过度或不足的表述。此外,基因组中一些过度和不足表述最为明显的4 - 和5 - 单词簇与诸如单个病毒的复制起点和调控信号等功能位点相关联。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/80bf/4076300/5c5039768ad0/nihms583865f1.jpg

相似文献

6
The role of DNA repair in herpesvirus pathogenesis.DNA修复在疱疹病毒发病机制中的作用。
Genomics. 2014 Oct;104(4):287-94. doi: 10.1016/j.ygeno.2014.08.005. Epub 2014 Aug 27.

引用本文的文献

10

本文引用的文献

2
Patchiness and correlations in DNA sequences.DNA序列中的斑驳性与相关性。
Science. 1993 Jan 29;259(5095):677-80. doi: 10.1126/science.8430316.
3
Pervasive CpG suppression in animal mitochondrial genomes.动物线粒体基因组中普遍存在的CpG抑制现象。
Proc Natl Acad Sci U S A. 1994 Apr 26;91(9):3799-803. doi: 10.1073/pnas.91.9.3799.
5
Computational DNA sequence analysis.计算DNA序列分析
Annu Rev Microbiol. 1994;48:619-54. doi: 10.1146/annurev.mi.48.100194.003155.
6
Comparisons of eukaryotic genomic sequences.真核生物基因组序列的比较。
Proc Natl Acad Sci U S A. 1994 Dec 20;91(26):12832-6. doi: 10.1073/pnas.91.26.12832.
9
Strong adenine clustering in nucleotide sequences.核苷酸序列中强烈的腺嘌呤聚类。
J Theor Biol. 1980 Jul 21;85(2):285-91. doi: 10.1016/0022-5193(80)90021-1.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验