• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

“广义基因组特征”的引入用于量化邻域偏好,从而导致基于分类学和功能的序列区分。

Introduction of 'Generalized Genomic Signatures' for the quantification of neighbour preferences leads to taxonomy- and functionality-based distinction among sequences.

机构信息

Institute of Biosciences and Applications, National Center for Scientific Research "Demokritos", 15310, Athens, Greece.

Genomics England, Charterhouse Square, London, EC1M 6BQ, UK.

出版信息

Sci Rep. 2019 Feb 8;9(1):1700. doi: 10.1038/s41598-018-38157-3.

DOI:10.1038/s41598-018-38157-3
PMID:30737442
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6368578/
Abstract

Analysis of DNA composition at several length scales constitutes the bulk of many early studies aimed at unravelling the complexity of the organization and functionality of genomes. Dinucleotide relative abundances are considered an idiosyncratic feature of genomes, regarded as a 'genomic signature'. Motivated by this finding, we introduce the 'Generalized Genomic Signatures' (GGSs), composed of over- and under-abundances of all oligonucleotides of a given length, thus filtering out compositional trends and neighbour preferences at any shorter range. Previous works on alignment-free genomic comparisons mostly rely on k-mer frequencies and not on distance-dependent neighbour preferences. Therein, nucleotide composition and proximity preferences are combined, while in the present work they are strictly separated, focusing uniquely on neighbour relationships. GGSs retain the potential or even outperform genomic signatures defined at the dinucleotide level in distinguishing between taxonomic subdivisions of bacteria, and can be more effectively implemented in microbial phylogenetic reconstruction. Moreover, we compare DNA sequences from the human genome corresponding to protein coding segments, conserved non-coding elements and non-functional DNA stretches. These classes of sequences have distinctive GGSs according to their genomic role and degree of conservation. Overall, GGSs constitute a trait characteristic of the evolutionary origin and functionality of different genomic segments.

摘要

对多个长度尺度上的 DNA 组成进行分析是许多旨在揭示基因组组织和功能复杂性的早期研究的主要内容。二核苷酸相对丰度被认为是基因组的特有特征,被视为“基因组特征”。受此发现的启发,我们引入了“广义基因组特征”(GGS),它由给定长度的所有寡核苷酸的过丰度和欠丰度组成,从而过滤掉任何更短范围内的组成趋势和相邻偏好。以前关于无比对基因组比较的工作主要依赖于 k-mer 频率,而不是依赖于距离相关的相邻偏好。在这些工作中,核苷酸组成和接近偏好是结合在一起的,而在本工作中,它们是严格分开的,只专注于相邻关系。GGS 在区分细菌的分类细分方面保留了甚至超过在二核苷酸水平定义的基因组特征的潜力,并且可以更有效地用于微生物系统发育重建。此外,我们比较了来自人类基因组的对应于蛋白质编码片段、保守非编码元件和非功能 DNA 片段的 DNA 序列。这些序列类根据其基因组作用和保守程度具有独特的 GGS。总体而言,GGS 构成了不同基因组片段的进化起源和功能的特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8087/6368578/7aa936b3ced3/41598_2018_38157_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8087/6368578/f2ebe29d3b5c/41598_2018_38157_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8087/6368578/b6c7ad492a6e/41598_2018_38157_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8087/6368578/a3ba91cfbe83/41598_2018_38157_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8087/6368578/29d45832b799/41598_2018_38157_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8087/6368578/7aa936b3ced3/41598_2018_38157_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8087/6368578/f2ebe29d3b5c/41598_2018_38157_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8087/6368578/b6c7ad492a6e/41598_2018_38157_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8087/6368578/a3ba91cfbe83/41598_2018_38157_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8087/6368578/29d45832b799/41598_2018_38157_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8087/6368578/7aa936b3ced3/41598_2018_38157_Fig5_HTML.jpg

相似文献

1
Introduction of 'Generalized Genomic Signatures' for the quantification of neighbour preferences leads to taxonomy- and functionality-based distinction among sequences.“广义基因组特征”的引入用于量化邻域偏好,从而导致基于分类学和功能的序列区分。
Sci Rep. 2019 Feb 8;9(1):1700. doi: 10.1038/s41598-018-38157-3.
2
Global dinucleotide signatures and analysis of genomic heterogeneity.全球双核苷酸特征及基因组异质性分析
Curr Opin Microbiol. 1998 Oct;1(5):598-610. doi: 10.1016/s1369-5274(98)80095-7.
3
Additive methods for genomic signatures.基因组特征的加法方法。
BMC Bioinformatics. 2016 Aug 22;17(1):313. doi: 10.1186/s12859-016-1157-8.
4
An investigation into inter- and intragenomic variations of graphic genomic signatures.对图形基因组特征的基因组间和基因组内变异的调查。
BMC Bioinformatics. 2015 Aug 7;16:246. doi: 10.1186/s12859-015-0655-4.
5
Quantitatively Partitioning Microbial Genomic Traits among Taxonomic Ranks across the Microbial Tree of Life.定量划分生命之树上的微生物分类等级中的微生物基因组特征。
mSphere. 2019 Aug 28;4(4):e00446-19. doi: 10.1128/mSphere.00446-19.
6
Classification of selectively constrained DNA elements using feature vectors and rule-based classifiers.使用特征向量和基于规则的分类器对选择性受限DNA元件进行分类。
Genomics. 2014 Aug;104(2):79-86. doi: 10.1016/j.ygeno.2014.07.004. Epub 2014 Jul 22.
7
Molecular evolution of herpesviruses: genomic and protein sequence comparisons.疱疹病毒的分子进化:基因组和蛋白质序列比较
J Virol. 1994 Mar;68(3):1886-902. doi: 10.1128/JVI.68.3.1886-1902.1994.
8
Comparisons of eukaryotic genomic sequences.真核生物基因组序列的比较。
Proc Natl Acad Sci U S A. 1994 Dec 20;91(26):12832-6. doi: 10.1073/pnas.91.26.12832.
9
GENSTYLE: exploration and analysis of DNA sequences with genomic signature.基因风格:利用基因组特征对DNA序列进行探索与分析。
Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W512-5. doi: 10.1093/nar/gki489.
10
Phylogenetic analysis using complete signature information of whole genomes and clustered Neighbour-Joining method.使用全基因组的完整特征信息和聚类邻接法进行系统发育分析。
Int J Bioinform Res Appl. 2006;2(3):219-48. doi: 10.1504/IJBRA.2006.010602.

引用本文的文献

1
Discovery of novel treponemes associated with pododermatitis in elk ().发现与麋鹿蹄皮炎相关的新型密螺旋体()。
Appl Environ Microbiol. 2024 Jun 18;90(6):e0010524. doi: 10.1128/aem.00105-24. Epub 2024 May 14.
2
Genomic Signature in Evolutionary Biology: A Review.进化生物学中的基因组特征:综述
Biology (Basel). 2023 Feb 16;12(2):322. doi: 10.3390/biology12020322.

本文引用的文献

1
Conserved non-coding elements: developmental gene regulation meets genome organization.保守非编码元件:发育基因调控与基因组组织相遇
Nucleic Acids Res. 2017 Dec 15;45(22):12611-12624. doi: 10.1093/nar/gkx1074.
2
Alignment-free sequence comparison: benefits, applications, and tools.无比对信息的序列比对:优势、应用和工具。
Genome Biol. 2017 Oct 3;18(1):186. doi: 10.1186/s13059-017-1319-7.
3
A novel skew analysis reveals substitution asymmetries linked to genetic code GC-biases and PolIII a-subunit isoforms.一种新型偏斜分析揭示了与遗传密码GC偏倚和PolIII α亚基异构体相关的替代不对称性。
DNA Res. 2016 Aug;23(4):353-63. doi: 10.1093/dnares/dsw021. Epub 2016 Jun 26.
4
Combinatorial Gene Regulatory Functions Underlie Ultraconserved Elements in Drosophila.组合式基因调控功能是果蝇中极度保守元件的基础。
Mol Biol Evol. 2016 Sep;33(9):2294-306. doi: 10.1093/molbev/msw101. Epub 2016 May 31.
5
An Empirical Overview of the No Free Lunch Theorem and Its Effect on Real-World Machine Learning Classification.无免费午餐定理及其对现实世界机器学习分类影响的实证概述。
Neural Comput. 2016 Jan;28(1):216-28. doi: 10.1162/NECO_a_00793. Epub 2015 Nov 24.
6
Allele frequencies of variants in ultra conserved elements identify selective pressure on transcription factor binding.超保守元件中变异的等位基因频率揭示了对转录因子结合的选择压力。
PLoS One. 2014 Nov 4;9(11):e110692. doi: 10.1371/journal.pone.0110692. eCollection 2014.
7
Classification of selectively constrained DNA elements using feature vectors and rule-based classifiers.使用特征向量和基于规则的分类器对选择性受限DNA元件进行分类。
Genomics. 2014 Aug;104(2):79-86. doi: 10.1016/j.ygeno.2014.07.004. Epub 2014 Jul 22.
8
Conserved noncoding elements follow power-law-like distributions in several genomes as a result of genome dynamics.由于基因组动态变化,保守非编码元件在多个基因组中呈现类似幂律的分布。
PLoS One. 2014 May 2;9(5):e95437. doi: 10.1371/journal.pone.0095437. eCollection 2014.
9
A DNA-centric protein interaction map of ultraconserved elements reveals contribution of transcription factor binding hubs to conservation.以DNA为中心的超保守元件蛋白质相互作用图谱揭示了转录因子结合中心对保守性的贡献。
Cell Rep. 2013 Oct 31;5(2):531-45. doi: 10.1016/j.celrep.2013.09.022. Epub 2013 Oct 17.
10
Alignment-free genetic sequence comparisons: a review of recent approaches by word analysis.基于字分析的无比对基因序列比较:最新方法综述
Brief Bioinform. 2014 Nov;15(6):890-905. doi: 10.1093/bib/bbt052. Epub 2013 Jul 31.