Suppr超能文献

寻找人类基因组基因间区域和基因区域的关键词。

Finding keywords for intergenic and gene regions for human genome.

作者信息

Qiao Y H, Liu J L, Zhang C G, Zeng Yanjun

机构信息

Biomechanics and Medical Information Institute, Beijing University of Technology, Beijing, China.

出版信息

Nucleosides Nucleotides Nucleic Acids. 2005;24(3):191-8. doi: 10.1081/NCN-55714.

Abstract

The analysis of functionally related sequences for conserved patterns is important for further research of different functional regions. This paper presents an analysis of genes and intergenic sequences from the point of view of linguistics analysis, where gene and intergenic regions are regarded as two different subjects written in the four-letter alphabet [A, C, G, T] and high-frequency simple sequences are taken as keywords. A measurement alpha[l(tau)] was introduced to describe the relative repeat ratio of simple sequences. Cutoff values were found for keywords selection. After eliminating "noise," 87 short sequences were selected as keywords for intergenic regions and 76 for gene regions.

摘要

对功能相关序列的保守模式进行分析对于进一步研究不同功能区域很重要。本文从语言学分析的角度对基因和基因间序列进行了分析,其中基因和基因间区域被视为用四字母字母表[A、C、G、T]书写的两个不同主题,高频简单序列被视为关键词。引入了一个测量值alpha[l(tau)]来描述简单序列的相对重复率。找到了关键词选择的截止值。在消除“噪声”后,选择了87个短序列作为基因间区域的关键词,76个作为基因区域的关键词。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验