• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

应用于葫芦科DNA序列的超统计学

Superstatistics Applied to Cucurbitaceae DNA Sequences.

作者信息

Costa M O, Silva R, de Lima M M F, Anselmo D H A L

机构信息

Departamento de Física, Universidade Federal do Rio Grande do Norte, Natal 59072-970, Brazil.

Departamento de Física, Universidade do Estado do Rio Grande do Norte, Mossoró 59610-210, Brazil.

出版信息

Entropy (Basel). 2024 Sep 25;26(10):819. doi: 10.3390/e26100819.

DOI:10.3390/e26100819
PMID:39451896
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11507824/
Abstract

The short and long statistical correlations are essential in the genomic sequence. Such correlations are long-range for introns, whereas, for exons, these are short. In this study, we employed superstatistics to investigate correlations and fluctuations in the distribution of nucleotide sequence lengths of the family. We established a time series for exon sizes to probe these correlations and fluctuations. We used data from the National Center for Biotechnology Information (NCBI) gene database to extract the temporal evolution of exon sizes, measured in terms of the number of base pairs (bp). To assess the model's viability, we utilized a timescale extraction method to determine the statistical properties of our time series, including the local distribution and fluctuations, which provide the exon size distributions based on the -Gamma and inverse -Gamma distributions. From the Bayesian statistics standpoint, both distributions are excellent for capturing the correlations and fluctuations from the data.

摘要

短程和长程统计相关性在基因组序列中至关重要。对于内含子而言,这种相关性是长程的,而对于外显子来说,这些相关性是短程的。在本研究中,我们采用超统计学来研究该家族核苷酸序列长度分布中的相关性和涨落。我们建立了外显子大小的时间序列以探究这些相关性和涨落。我们使用来自美国国立生物技术信息中心(NCBI)基因数据库的数据来提取以碱基对(bp)数量衡量的外显子大小的时间演化。为了评估模型的可行性,我们利用一种时间尺度提取方法来确定我们时间序列的统计特性,包括局部分布和涨落,这些基于伽马分布和逆伽马分布提供外显子大小分布。从贝叶斯统计学的角度来看,这两种分布都非常适合从数据中捕捉相关性和涨落。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/e986a1e5604f/entropy-26-00819-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/706a4b84512b/entropy-26-00819-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/cf8ce5f67039/entropy-26-00819-g0A2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/43d793e9093d/entropy-26-00819-g0A3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/942a8566d5a9/entropy-26-00819-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/d0fc3768c3c2/entropy-26-00819-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/a0c706422555/entropy-26-00819-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/a7c48da019df/entropy-26-00819-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/b28ef5a4bd59/entropy-26-00819-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/e986a1e5604f/entropy-26-00819-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/706a4b84512b/entropy-26-00819-g0A1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/cf8ce5f67039/entropy-26-00819-g0A2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/43d793e9093d/entropy-26-00819-g0A3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/942a8566d5a9/entropy-26-00819-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/d0fc3768c3c2/entropy-26-00819-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/a0c706422555/entropy-26-00819-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/a7c48da019df/entropy-26-00819-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/b28ef5a4bd59/entropy-26-00819-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/206d/11507824/e986a1e5604f/entropy-26-00819-g006.jpg

相似文献

1
Superstatistics Applied to Cucurbitaceae DNA Sequences.应用于葫芦科DNA序列的超统计学
Entropy (Basel). 2024 Sep 25;26(10):819. doi: 10.3390/e26100819.
2
Superstatistical and DNA sequence coding of the human genome.人类基因组的超统计和 DNA 序列编码。
Phys Rev E. 2022 Dec;106(6-1):064407. doi: 10.1103/PhysRevE.106.064407.
3
The evolution of proteins from random amino acid sequences: II. Evidence from the statistical distributions of the lengths of modern protein sequences.蛋白质从随机氨基酸序列的进化:II. 来自现代蛋白质序列长度统计分布的证据。
J Mol Evol. 1994 Apr;38(4):383-94. doi: 10.1007/BF00163155.
4
Spectral fluctuations of billiards with mixed dynamics: from time series to superstatistics.具有混合动力学的台球的谱涨落:从时间序列到超统计
Phys Rev E Stat Nonlin Soft Matter Phys. 2008 Apr;77(4 Pt 2):046202. doi: 10.1103/PhysRevE.77.046202. Epub 2008 Apr 3.
5
Log-amplitude statistics for Beck-Cohen superstatistics.贝克-科恩超统计的对数振幅统计
Phys Rev E Stat Nonlin Soft Matter Phys. 2013 May;87(5):052104. doi: 10.1103/PhysRevE.87.052104. Epub 2013 May 6.
6
Nonthermal and suprathermal distributions as a consequence of superstatistics.作为超统计结果的非热分布和超热分布。
Phys Rev E Stat Nonlin Soft Matter Phys. 2015 Jan;91(1):012133. doi: 10.1103/PhysRevE.91.012133. Epub 2015 Jan 20.
7
Structure of vertebrate genes: a statistical analysis implicating selection.脊椎动物基因的结构:一项涉及选择的统计分析
J Mol Evol. 1988;27(1):45-55. doi: 10.1007/BF02099729.
8
Fluctuations of water quality time series in rivers follow superstatistics.河流中水质时间序列的波动遵循超统计规律。
iScience. 2021 Jul 21;24(8):102881. doi: 10.1016/j.isci.2021.102881. eCollection 2021 Aug 20.
9
Determination of eukaryotic protein coding regions using neural networks and information theory.使用神经网络和信息论确定真核生物蛋白质编码区域
J Mol Biol. 1992 Jul 20;226(2):471-9. doi: 10.1016/0022-2836(92)90961-i.
10
Distribution bias of the sequence matching between exons and introns in exon joint and EJC binding region in C. elegans.秀丽隐杆线虫中外显子连接和外显子连接复合体(EJC)结合区域中外显子与内含子序列匹配的分布偏差
J Theor Biol. 2015 Jan 7;364:295-304. doi: 10.1016/j.jtbi.2014.09.009. Epub 2014 Sep 16.

本文引用的文献

1
Superstatistical and DNA sequence coding of the human genome.人类基因组的超统计和 DNA 序列编码。
Phys Rev E. 2022 Dec;106(6-1):064407. doi: 10.1103/PhysRevE.106.064407.
2
A Bayesian Analysis of Plant DNA Length Distribution via -Statistics.通过-统计对植物DNA长度分布进行贝叶斯分析。
Entropy (Basel). 2022 Sep 1;24(9):1225. doi: 10.3390/e24091225.
3
Generalized statistical mechanics of stellar systems.恒星系统的广义统计力学。
Phys Rev E. 2022 Jun;105(6-1):064108. doi: 10.1103/PhysRevE.105.064108.
4
Database resources of the national center for biotechnology information.国家生物技术信息中心数据库资源。
Nucleic Acids Res. 2022 Jan 7;50(D1):D20-D26. doi: 10.1093/nar/gkab1112.
5
Fingerprints of nonequilibrium stationary distributions in dispersion relations.色散关系中非平衡稳态分布的指纹图谱。
Sci Rep. 2021 Jun 8;11(1):12103. doi: 10.1038/s41598-021-91455-1.
6
Superstatistical two-temperature Ising model.超统计双温度伊辛模型。
Phys Rev E. 2021 Mar;103(3-1):032104. doi: 10.1103/PhysRevE.103.032104.
7
Superstatistical modelling of protein diffusion dynamics in bacteria.细菌中蛋白质扩散动力学的超统计建模
J R Soc Interface. 2021 Mar;18(176):20200927. doi: 10.1098/rsif.2020.0927. Epub 2021 Mar 3.
8
Single-particle velocity distributions of collisionless, steady-state plasmas must follow superstatistics.无碰撞、稳态等离子体的单粒子速度分布必须遵循超统计。
Phys Rev E. 2019 Aug;100(2-1):023205. doi: 10.1103/PhysRevE.100.023205.
9
Origin and domestication of Cucurbitaceae crops: insights from phylogenies, genomics and archaeology.葫芦科作物的起源与驯化:系统发育学、基因组学和考古学的见解
New Phytol. 2020 Jun;226(5):1240-1255. doi: 10.1111/nph.16015. Epub 2019 Aug 1.
10
Analysis of human DNA through power-law statistics.通过幂律统计分析人类 DNA。
Phys Rev E. 2019 Feb;99(2-1):022112. doi: 10.1103/PhysRevE.99.022112.