• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用6个核苷酸长度的单词来研究不同生物体基因序列的复杂性。

Use of 6 Nucleotide Length Words to Study the Complexity of Gene Sequences from Different Organisms.

作者信息

Korotkov Eugene, Zaytsev Konstantin, Fedorov Alexey

机构信息

Institute of Bioengineering, Federal Research Center of Biotechnology of the Russian Academy of Sciences, 119071 Moscow, Russia.

Bach Institute of Biochemistry, Research Center of Biotechnology of the Russian Academy of Sciences, 119071 Moscow, Russia.

出版信息

Entropy (Basel). 2022 Apr 30;24(5):632. doi: 10.3390/e24050632.

DOI:10.3390/e24050632
PMID:35626518
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9141341/
Abstract

In this paper, we attempted to find a relation between bacteria living conditions and their genome algorithmic complexity. We developed a probabilistic mathematical method for the evaluation of k-words (6 bases length) occurrence irregularity in bacterial gene coding sequences. For this, the coding sequences from different bacterial genomes were analyzed and as an index of k-words occurrence irregularity, we used W, which has a distribution similar to normal. The research results for bacterial genomes show that they can be divided into two uneven groups. First, the smaller one has in the interval from 170 to 475, while for the second it is from 475 to 875. Plants, metazoan and virus genomes also have in the same interval as the first bacterial group. We suggested that second bacterial group coding sequences are much less susceptible to evolutionary changes than the first group ones. It is also discussed to use the index as a biological stress value.

摘要

在本文中,我们试图找出细菌生存条件与其基因组算法复杂性之间的关系。我们开发了一种概率数学方法,用于评估细菌基因编码序列中k字(6个碱基长度)出现的不规则性。为此,我们分析了来自不同细菌基因组的编码序列,并使用W作为k字出现不规则性的指标,W具有类似于正态分布的分布。细菌基因组的研究结果表明,它们可以分为两个不均衡的组。第一组较小,W值在170至475之间,而第二组的W值在475至875之间。植物、后生动物和病毒基因组的W值也与第一组细菌处于相同区间。我们认为,第二组细菌的编码序列比第一组细菌的编码序列对进化变化的敏感性要低得多。我们还讨论了将W指标用作生物应激值。

相似文献

1
Use of 6 Nucleotide Length Words to Study the Complexity of Gene Sequences from Different Organisms.使用6个核苷酸长度的单词来研究不同生物体基因序列的复杂性。
Entropy (Basel). 2022 Apr 30;24(5):632. doi: 10.3390/e24050632.
2
MRF: a tool to overcome the barrier of inconsistent genome annotations and perform comparative genomics studies for the largest animal DNA virus.MRF:一种克服不一致的基因组注释障碍并进行最大动物 DNA 病毒比较基因组学研究的工具。
Virol J. 2023 Apr 18;20(1):72. doi: 10.1186/s12985-023-02035-w.
3
Gene transfer and nucleotide sequence evolution by Gossypium cytoplasmic genomes indicates novel evolutionary characteristics.棉属细胞质基因组的基因转移和核苷酸序列进化表明了新的进化特征。
Plant Cell Rep. 2020 Jun;39(6):765-777. doi: 10.1007/s00299-020-02529-9. Epub 2020 Mar 25.
4
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
5
[Evolution of non-coding nucleotide sequences in Newcastle disease virus genomes ].[新城疫病毒基因组中非编码核苷酸序列的进化]
Wei Sheng Wu Xue Bao. 2014 Sep 4;54(9):1073-81.
6
Occurrence and genetic diversity of prophage sequences identified in the genomes of L. casei group bacteria.在干酪乳杆菌群细菌的基因组中鉴定出噬菌体序列的发生和遗传多样性。
Sci Rep. 2023 May 26;13(1):8603. doi: 10.1038/s41598-023-35823-z.
7
Constraint on di-nucleotides by codon usage bias in bacterial genomes.细菌基因组中密码子使用偏好对二核苷酸的限制。
Gene. 2014 Feb 15;536(1):18-28. doi: 10.1016/j.gene.2013.11.098. Epub 2013 Dec 11.
8
Origin of the Y genome in Elymus and its relationship to other genomes in Triticeae based on evidence from elongation factor G (EF-G) gene sequences.基于伸长因子 G(EF-G)基因序列的证据,探讨了 Y 基因组在披碱草属中的起源及其与小麦族其他基因组的关系。
Mol Phylogenet Evol. 2010 Aug;56(2):727-33. doi: 10.1016/j.ympev.2010.03.037. Epub 2010 Apr 2.
9
Optimization of Mutation Pressure in Relation to Properties of Protein-Coding Sequences in Bacterial Genomes.细菌基因组中与蛋白质编码序列特性相关的突变压力优化
PLoS One. 2015 Jun 29;10(6):e0130411. doi: 10.1371/journal.pone.0130411. eCollection 2015.
10
Compositional correlation studies among the three different codon positions in 12 bacterial genomes.12个细菌基因组中三个不同密码子位置之间的组成相关性研究。
Biochem Biophys Res Commun. 1999 Dec 9;266(1):66-71. doi: 10.1006/bbrc.1999.1774.

引用本文的文献

1
Bioinformatics tools for the sequence complexity estimates.用于序列复杂性估计的生物信息学工具。
Biophys Rev. 2023 Sep 15;15(5):1367-1378. doi: 10.1007/s12551-023-01140-y. eCollection 2023 Oct.

本文引用的文献

1
Next-Generation Sequencing in Newborn Screening: A Review of Current State.新生儿筛查中的下一代测序:现状综述
Front Genet. 2021 May 26;12:662254. doi: 10.3389/fgene.2021.662254. eCollection 2021.
2
Detection of Highly Divergent Tandem Repeats in the Rice Genome.检测水稻基因组中的高度变异串联重复序列。
Genes (Basel). 2021 Mar 25;12(4):473. doi: 10.3390/genes12040473.
3
Multiple Alignment of Promoter Sequences from the L. Genome.从 L. 基因组中启动子序列的多重比对。
Genes (Basel). 2021 Jan 21;12(2):135. doi: 10.3390/genes12020135.
4
Antimicrobial Susceptibility Pattern of and Isolates.[具体细菌名称]和[具体细菌名称]分离株的抗菌药敏模式
Microorganisms. 2020 Jun 25;8(6):957. doi: 10.3390/microorganisms8060957.
5
Genome Sequence of " Nitrosocosmicus franklandus" C13, a Terrestrial Ammonia-Oxidizing Archaeon.陆地氨氧化古菌“弗兰克兰德亚硝化宇宙菌”C13的基因组序列
Microbiol Resour Announc. 2019 Oct 3;8(40):e00435-19. doi: 10.1128/MRA.00435-19.
6
Whole genome sequencing and function prediction of 133 gut anaerobes isolated from chicken caecum in pure cultures.纯培养物中分离的 133 株鸡盲肠肠道厌氧菌的全基因组测序和功能预测。
BMC Genomics. 2018 Jul 31;19(1):561. doi: 10.1186/s12864-018-4959-4.
7
The reconstruction of 2,631 draft metagenome-assembled genomes from the global oceans.重建来自全球海洋的 2631 个宏基因组组装基因组。
Sci Data. 2018 Jan 16;5:170203. doi: 10.1038/sdata.2017.203.
8
Alkaliphilus namsaraevii sp. nov., an alkaliphilic iron- and sulfur-reducing bacterium isolated from a steppe soda lake.纳姆萨拉耶夫嗜碱菌新种,一种从草原苏打湖分离出的嗜碱铁还原和硫还原细菌。
Int J Syst Evol Microbiol. 2017 Jun;67(6):1990-1995. doi: 10.1099/ijsem.0.001904. Epub 2017 Jun 20.
9
Database of Periodic DNA Regions in Major Genomes.主要基因组中周期性DNA区域数据库。
Biomed Res Int. 2017;2017:7949287. doi: 10.1155/2017/7949287. Epub 2017 Jan 15.
10
Evolutionary dynamics of selfish DNA explains the abundance distribution of genomic subsequences.自私DNA的进化动力学解释了基因组子序列的丰度分布。
Sci Rep. 2016 Aug 4;6:30851. doi: 10.1038/srep30851.