• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过混沌游戏表示法分析基因组序列。

Analysis of genomic sequences by Chaos Game Representation.

作者信息

Almeida J S, Carriço J A, Maretzek A, Noble P A, Fletcher M

机构信息

ITQB/Universidade Nova Lisboa, PO Box 127, 2780 Oeiras, Portugal.

出版信息

Bioinformatics. 2001 May;17(5):429-37. doi: 10.1093/bioinformatics/17.5.429.

DOI:10.1093/bioinformatics/17.5.429
PMID:11331237
Abstract

MOTIVATION

Chaos Game Representation (CGR) is an iterative mapping technique that processes sequences of units, such as nucleotides in a DNA sequence or amino acids in a protein, in order to find the coordinates for their position in a continuous space. This distribution of positions has two properties: it is unique, and the source sequence can be recovered from the coordinates such that distance between positions measures similarity between the corresponding sequences. The possibility of using the latter property to identify succession schemes have been entirely overlooked in previous studies which raises the possibility that CGR may be upgraded from a mere representation technique to a sequence modeling tool.

RESULTS

The distribution of positions in the CGR plane were shown to be a generalization of Markov chain probability tables that accommodates non-integer orders. Therefore, Markov models are particular cases of CGR models rather than the reverse, as currently accepted. In addition, the CGR generalization has both practical (computational efficiency) and fundamental (scale independence) advantages. These results are illustrated by using Escherichia coli K-12 as a test data-set, in particular, the genes thrA, thrB and thrC of the threonine operon.

摘要

动机

混沌游戏表示法(CGR)是一种迭代映射技术,它处理单元序列,如DNA序列中的核苷酸或蛋白质中的氨基酸,以便找到它们在连续空间中位置的坐标。这种位置分布具有两个特性:它是唯一的,并且可以从坐标中恢复源序列,使得位置之间的距离衡量相应序列之间的相似性。在先前的研究中,完全忽略了利用后一个特性来识别连续方案的可能性,这就增加了CGR可能从单纯的表示技术升级为序列建模工具的可能性。

结果

CGR平面中的位置分布被证明是马尔可夫链概率表的推广,它适用于非整数阶。因此,马尔可夫模型是CGR模型的特殊情况,而不是像目前所认为的那样相反。此外,CGR推广具有实际(计算效率)和基本(尺度独立性)优势。以大肠杆菌K-12作为测试数据集,特别是苏氨酸操纵子的thrA、thrB和thrC基因,来说明这些结果。

相似文献

1
Analysis of genomic sequences by Chaos Game Representation.通过混沌游戏表示法分析基因组序列。
Bioinformatics. 2001 May;17(5):429-37. doi: 10.1093/bioinformatics/17.5.429.
2
A probabilistic measure for alignment-free sequence comparison.一种用于无比对序列比较的概率测度。
Bioinformatics. 2004 Dec 12;20(18):3455-61. doi: 10.1093/bioinformatics/bth426. Epub 2004 Jul 22.
3
Chaos game representation for comparison of whole genomes.用于全基因组比较的混沌游戏表示法。
BMC Bioinformatics. 2006 May 5;7:243. doi: 10.1186/1471-2105-7-243.
4
Encoding and Decoding DNA Sequences by Integer Chaos Game Representation.通过整数混沌游戏表示法对DNA序列进行编码和解码
J Comput Biol. 2019 Feb;26(2):143-151. doi: 10.1089/cmb.2018.0173. Epub 2018 Dec 5.
5
Universal sequence map (USM) of arbitrary discrete sequences.任意离散序列的通用序列映射(USM)
BMC Bioinformatics. 2002;3:6. doi: 10.1186/1471-2105-3-6. Epub 2002 Feb 5.
6
[Multifractal analysis of genomes sequences' CGR graph].[基因组序列CGR图的多重分形分析]
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2007 Jun;24(3):522-5.
7
Numerical encoding of DNA sequences by chaos game representation with application in similarity comparison.基于混沌游戏表示的DNA序列数值编码及其在相似性比较中的应用
Genomics. 2016 Oct;108(3-4):134-142. doi: 10.1016/j.ygeno.2016.08.002. Epub 2016 Aug 15.
8
Identifying anticancer peptides by using a generalized chaos game representation.利用广义混沌博弈表示法鉴定抗癌肽
J Math Biol. 2019 Jan;78(1-2):441-463. doi: 10.1007/s00285-018-1279-x. Epub 2018 Oct 5.
9
Efficient Boolean implementation of universal sequence maps (bUSM).通用序列映射(bUSM)的高效布尔实现。
BMC Bioinformatics. 2002 Oct 21;3:28. doi: 10.1186/1471-2105-3-28.
10
Similarity analysis for DNA sequences based on chaos game representation. Case study: the albumin.基于混沌游戏表示的 DNA 序列相似性分析。案例研究:白蛋白。
J Theor Biol. 2010 Dec 21;267(4):513-8. doi: 10.1016/j.jtbi.2010.09.027. Epub 2010 Sep 28.

引用本文的文献

1
WaveSeekerNet: accurate prediction of influenza A virus subtypes and host source using attention-based deep learning.WaveSeekerNet:基于注意力机制的深度学习对甲型流感病毒亚型和宿主来源的准确预测
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf089.
2
PCVR: a pre-trained contextualized visual representation for DNA sequence classification.PCVR:用于DNA序列分类的预训练情境化视觉表征
BMC Bioinformatics. 2025 May 9;26(1):125. doi: 10.1186/s12859-025-06136-x.
3
Multifractal analysis and support vector machine for the classification of coronaviruses and SARS-CoV-2 variants.
用于冠状病毒和SARS-CoV-2变体分类的多重分形分析和支持向量机
Sci Rep. 2025 Apr 29;15(1):15041. doi: 10.1038/s41598-025-98366-5.
4
Efficient Storage and Analysis of Genomic Data: A k-mer Frequency Mapping and Image Representation Method.基因组数据的高效存储与分析:一种k-mer频率映射与图像表示方法。
Interdiscip Sci. 2024 Oct 21. doi: 10.1007/s12539-024-00659-2.
5
Exploring geometry of genome space via Grassmann manifolds.通过格拉斯曼流形探索基因组空间的几何结构。
Innovation (Camb). 2024 Jul 22;5(5):100677. doi: 10.1016/j.xinn.2024.100677. eCollection 2024 Sep 9.
6
On leveraging self-supervised learning for accurate HCV genotyping.利用自监督学习进行准确的 HCV 基因分型。
Sci Rep. 2024 Jul 5;14(1):15463. doi: 10.1038/s41598-024-64209-y.
7
Multifractal analysis of maize and soybean DNA.玉米和大豆 DNA 的多重分形分析。
Sci Rep. 2024 May 9;14(1):10687. doi: 10.1038/s41598-024-60722-2.
8
Predicting antimicrobial resistance in with discriminative position fused deep learning classifier.使用判别位置融合深度学习分类器预测抗菌药物耐药性。 (你提供的原文“in with discriminative position fused deep learning classifier”表述似乎不太完整准确,推测完整内容可能是类似“Predicting antimicrobial resistance in [具体对象] with discriminative position fused deep learning classifier”,这里是按照推测完整后的内容翻译的)
Comput Struct Biotechnol J. 2023 Dec 29;23:559-565. doi: 10.1016/j.csbj.2023.12.041. eCollection 2024 Dec.
9
Using Chaos-Game-Representation for Analysing the SARS-CoV-2 Lineages, Newly Emerging Strains and Recombinants.使用混沌游戏表示法分析严重急性呼吸综合征冠状病毒2(SARS-CoV-2)谱系、新出现的毒株和重组体
Curr Genomics. 2023 Nov 22;24(3):187-195. doi: 10.2174/0113892029264990231013112156.
10
Polarization- and Chaos-Game-Based Fingerprinting of Molecular Targets of Listeria Monocytogenes Vaccine and Fully Virulent Strains.基于极化和混沌博弈的单核细胞增生李斯特菌疫苗及完全致病菌株分子靶点指纹识别
Curr Issues Mol Biol. 2023 Dec 13;45(12):10056-10078. doi: 10.3390/cimb45120628.