• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一般设计揭示了蛋白质编码和非编码人类 DNA 中的不同代码。

General Designs Reveal Distinct Codes in Protein-Coding and Non-Coding Human DNA.

机构信息

Ronin Institute, 127 Haddon Pl, Montclair, NJ 07043-2314, USA.

出版信息

Genes (Basel). 2022 Oct 28;13(11):1970. doi: 10.3390/genes13111970.

DOI:10.3390/genes13111970
PMID:36360206
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9690640/
Abstract

This study seeks to investigate distinct signatures and codes within different genomic sequence locations of the human genome. The promoter and other non-coding regions contain sites for the binding of biological particles, for processes such as transcription regulation. The specific rules and sequence codes that govern this remain poorly understood. To derive these (codes), the general designs of sequence are investigated. Genomic signatures are a powerful tool for assessing the general designs of sequence, and cross-comparing different genomic regions for their distinct sequence properties. Through these genomic signatures, the relative non-random properties of sequences are also assessed. Furthermore, a binary components analysis is carried out making use of information theory ideas, to study the RY (purine/pyrimidine), WS (weak/strong) and KM (keto/amino) signatures in the sequences. From this comparison, it is possible to identify the relative importance of these properties within the various protein-coding and non-coding genomic locations. The results show that coding DNA has a strongly non-random WS signature, which reflects the genetic code, and the hydrogen-bond base pairing of codon-anti-codon interactions. In contrast, non-coding locations, such as the promoter, contain a distinct genomic signature. A prominent feature throughout non-coding DNA is a highly non-random RY signature, which is very different in nature to coding DNA, and suggests a structural-based RY code. This marks progress towards deciphering the unknown code(s) in non-protein-coding DNA, and a further understanding of the coding DNA. Additionally, it unravels how DNA carries information. These findings have implications for the most fundamental principles of biology, including knowledge of gene regulation, development and disease.

摘要

本研究旨在探究人类基因组不同基因组序列位置中的独特特征和编码。启动子和其他非编码区域包含生物粒子结合的位点,用于转录调控等过程。这些过程的具体规则和序列编码仍知之甚少。为了得出这些(编码),研究了序列的一般设计。基因组特征是评估序列一般设计的有力工具,并对不同基因组区域进行比较,以研究其独特的序列特性。通过这些基因组特征,还评估了序列的相对非随机特性。此外,还利用信息论思想进行了二进制分量分析,以研究序列中的 RY(嘌呤/嘧啶)、WS(弱/强)和 KM(酮/氨基)特征。通过这种比较,可以确定这些特性在各种蛋白质编码和非编码基因组位置中的相对重要性。结果表明,编码 DNA 具有强烈的非随机 WS 特征,反映了遗传密码和密码子-反密码子相互作用的氢键碱基配对。相比之下,非编码位置,如启动子,包含独特的基因组特征。非编码 DNA 的一个突出特征是高度非随机的 RY 特征,其性质与编码 DNA 非常不同,这表明存在基于结构的 RY 编码。这标志着在非蛋白编码 DNA 中解码未知编码方面取得了进展,并进一步了解了编码 DNA。此外,它揭示了 DNA 如何携带信息。这些发现对生物学的最基本原则具有重要意义,包括对基因调控、发育和疾病的认识。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e77/9690640/789c38cd0801/genes-13-01970-g005a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e77/9690640/4975798af553/genes-13-01970-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e77/9690640/5177ad5214cc/genes-13-01970-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e77/9690640/9fa95a15a081/genes-13-01970-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e77/9690640/658b695b6691/genes-13-01970-g004a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e77/9690640/789c38cd0801/genes-13-01970-g005a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e77/9690640/4975798af553/genes-13-01970-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e77/9690640/5177ad5214cc/genes-13-01970-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e77/9690640/9fa95a15a081/genes-13-01970-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e77/9690640/658b695b6691/genes-13-01970-g004a.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4e77/9690640/789c38cd0801/genes-13-01970-g005a.jpg

相似文献

1
General Designs Reveal Distinct Codes in Protein-Coding and Non-Coding Human DNA.一般设计揭示了蛋白质编码和非编码人类 DNA 中的不同代码。
Genes (Basel). 2022 Oct 28;13(11):1970. doi: 10.3390/genes13111970.
2
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].[通过新型人类基因的电子克隆和实验验证对NCBI人类基因数据库中出现的模型参考序列的一些错误进行分析、鉴定和校正]
Yi Chuan Xue Bao. 2004 May;31(5):431-43.
3
The genetic code is nearly optimal for allowing additional information within protein-coding sequences.遗传密码对于在蛋白质编码序列中容纳额外信息而言几乎是最优的。
Genome Res. 2007 Apr;17(4):405-12. doi: 10.1101/gr.5987307. Epub 2007 Feb 9.
4
A computational screen for alternative genetic codes in over 250,000 genomes.对超过 25 万个基因组中的替代遗传密码进行计算筛选。
Elife. 2021 Nov 9;10:e71402. doi: 10.7554/eLife.71402.
5
Mutually symmetric and complementary triplets: differences in their use distinguish systematically between coding and non-coding genomic sequences.相互对称且互补的三联体:它们使用方式的差异在编码和非编码基因组序列之间进行系统区分。
J Theor Biol. 2003 Aug 21;223(4):477-87. doi: 10.1016/s0022-5193(03)00123-1.
6
TRX-LOGOS - a graphical tool to demonstrate DNA information content dependent upon backbone dynamics in addition to base sequence.TRX-LOGOS——一种图形工具,用于展示除碱基序列外还依赖于主链动力学的DNA信息内容。
Source Code Biol Med. 2015 Sep 25;10:10. doi: 10.1186/s13029-015-0040-8. eCollection 2015.
7
Energy mapping of the genetic code and genomic domains: implications for code evolution and molecular Darwinism.遗传密码和基因组域的能量图谱:对密码进化和分子达尔文主义的影响。
Q Rev Biophys. 2020 Nov 4;53:e11. doi: 10.1017/S0033583520000098.
8
Strong short-range correlations and dichotomic codon classes in coding DNA sequences.编码DNA序列中的强短程相关性和二分密码子类别
Phys Rev E Stat Nonlin Soft Matter Phys. 2008 Nov;78(5 Pt 1):051918. doi: 10.1103/PhysRevE.78.051918. Epub 2008 Nov 19.
9
Quantifying the species-specificity in genomic signatures, synonymous codon choice, amino acid usage and G+C content.量化基因组特征、同义密码子选择、氨基酸使用和G+C含量中的物种特异性。
Gene. 2003 Jun 5;311:35-42. doi: 10.1016/s0378-1119(03)00581-x.
10
A content-centric organization of the genetic code.一种以内容为中心的遗传密码组织方式。
Genomics Proteomics Bioinformatics. 2007 Feb;5(1):1-6. doi: 10.1016/S1672-0229(07)60008-4.

引用本文的文献

1
Profound Non-Randomness in Dinucleotide Arrangements within Ultra-Conserved Non-Coding Elements and the Human Genome.超保守非编码元件及人类基因组中双核苷酸排列的深度非随机性
Biology (Basel). 2023 Aug 12;12(8):1125. doi: 10.3390/biology12081125.

本文引用的文献

1
Functional Systems Derived from Nucleobase Self-assembly.基于核碱基自组装的功能系统。
ChemistryOpen. 2020 Apr 1;9(4):409-430. doi: 10.1002/open.201900363. eCollection 2020 Apr.
2
EPD in 2020: enhanced data visualization and extension to ncRNA promoters.EPD 于 2020 年:增强的数据可视化功能,并扩展至非编码 RNA 启动子。
Nucleic Acids Res. 2020 Jan 8;48(D1):D65-D69. doi: 10.1093/nar/gkz1014.
3
Quantifying local randomness in human DNA and RNA sequences using Erdös motifs.使用 Erdős 模式量化人类 DNA 和 RNA 序列中的局部随机性。
J Theor Biol. 2019 Jan 14;461:41-50. doi: 10.1016/j.jtbi.2018.09.031. Epub 2018 Oct 16.
4
Emerging Roles of Non-Coding RNA Transcription.非编码 RNA 转录的新兴作用
Trends Biochem Sci. 2018 Sep;43(9):654-667. doi: 10.1016/j.tibs.2018.06.002. Epub 2018 Jun 28.
5
GeneHancer: genome-wide integration of enhancers and target genes in GeneCards.基因增强子:基因卡片中增强子与靶基因的全基因组整合
Database (Oxford). 2017 Jan 1;2017. doi: 10.1093/database/bax028.
6
Non-coding RNAs as regulators in epigenetics (Review).非编码RNA作为表观遗传学中的调节因子(综述)。
Oncol Rep. 2017 Jan;37(1):3-9. doi: 10.3892/or.2016.5236. Epub 2016 Nov 8.
7
Establishing and validating regulatory regions for variant annotation and expression analysis.建立并验证用于变异注释和表达分析的调控区域。
BMC Genomics. 2016 Jun 23;17 Suppl 2(Suppl 2):393. doi: 10.1186/s12864-016-2724-0.
8
Chromatin remodeling effects on enhancer activity.染色质重塑对增强子活性的影响。
Cell Mol Life Sci. 2016 Aug;73(15):2897-910. doi: 10.1007/s00018-016-2184-3. Epub 2016 Mar 30.
9
DNA structure and function.DNA的结构与功能。
FEBS J. 2015 Jun;282(12):2279-95. doi: 10.1111/febs.13307. Epub 2015 Jun 2.
10
The BioMart community portal: an innovative alternative to large, centralized data repositories.生物信息学集成数据挖掘社区门户:大型集中式数据存储库的创新替代方案。
Nucleic Acids Res. 2015 Jul 1;43(W1):W589-98. doi: 10.1093/nar/gkv350. Epub 2015 Apr 20.