• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于综合特征表示和机器学习算法的核小体定位的比较分析和预测。

Comparative analysis and prediction of nucleosome positioning using integrative feature representation and machine learning algorithms.

机构信息

Department of Mathematics and Computational Science, Xiangtan University, Xiangtan, 411105, Hunan, China.

Key Laboratory of Intelligent Computing and Information Processing of Ministry of Education and Hunan Key Laboratory for Computation and Simulation in Science and Engineering, Xiangtan University, Xiangtan, 411105, Hunan, China.

出版信息

BMC Bioinformatics. 2021 Jun 2;22(Suppl 6):129. doi: 10.1186/s12859-021-04006-w.

DOI:10.1186/s12859-021-04006-w
PMID:34078256
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8170966/
Abstract

BACKGROUND

Nucleosome plays an important role in the process of genome expression, DNA replication, DNA repair and transcription. Therefore, the research of nucleosome positioning has invariably received extensive attention. Considering the diversity of DNA sequence representation methods, we tried to integrate multiple features to analyze its effect in the process of nucleosome positioning analysis. This process can also deepen our understanding of the theoretical analysis of nucleosome positioning.

RESULTS

Here, we not only used frequency chaos game representation (FCGR) to construct DNA sequence features, but also integrated it with other features and adopted the principal component analysis (PCA) algorithm. Simultaneously, support vector machine (SVM), extreme learning machine (ELM), extreme gradient boosting (XGBoost), multilayer perceptron (MLP) and convolutional neural networks (CNN) are used as predictors for nucleosome positioning prediction analysis, respectively. The integrated feature vector prediction quality is significantly superior to a single feature. After using principal component analysis (PCA) to reduce the feature dimension, the prediction quality of H. sapiens dataset has been significantly improved.

CONCLUSIONS

Comparative analysis and prediction on H. sapiens, C. elegans, D. melanogaster and S. cerevisiae datasets, demonstrate that the application of FCGR to nucleosome positioning is feasible, and we also found that integrative feature representation would be better.

摘要

背景

核小体在基因组表达、DNA 复制、DNA 修复和转录过程中起着重要作用。因此,核小体定位的研究一直受到广泛关注。考虑到 DNA 序列表示方法的多样性,我们试图整合多种特征来分析其在核小体定位分析过程中的作用。这一过程也可以加深我们对核小体定位理论分析的理解。

结果

在这里,我们不仅使用了频率混沌游戏表示(FCGR)来构建 DNA 序列特征,还将其与其他特征相结合,并采用主成分分析(PCA)算法。同时,支持向量机(SVM)、极限学习机(ELM)、极端梯度提升(XGBoost)、多层感知机(MLP)和卷积神经网络(CNN)分别作为核小体定位预测分析的预测器。集成特征向量的预测质量明显优于单个特征。在用主成分分析(PCA)降低特征维数后,H. sapiens 数据集的预测质量得到了显著提高。

结论

对 H. sapiens、C. elegans、D. melanogaster 和 S. cerevisiae 数据集的比较分析和预测表明,FCGR 应用于核小体定位是可行的,我们还发现综合特征表示会更好。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/8cb88424e905/12859_2021_4006_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/537c0f620a9d/12859_2021_4006_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/f53036dd910e/12859_2021_4006_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/cb94107615f7/12859_2021_4006_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/0d3c331eae29/12859_2021_4006_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/4b12f563faa2/12859_2021_4006_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/4e213d6399e7/12859_2021_4006_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/fbff56d0590b/12859_2021_4006_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/8cb88424e905/12859_2021_4006_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/537c0f620a9d/12859_2021_4006_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/f53036dd910e/12859_2021_4006_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/cb94107615f7/12859_2021_4006_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/0d3c331eae29/12859_2021_4006_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/4b12f563faa2/12859_2021_4006_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/4e213d6399e7/12859_2021_4006_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/fbff56d0590b/12859_2021_4006_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e2c2/8170966/8cb88424e905/12859_2021_4006_Fig8_HTML.jpg

相似文献

1
Comparative analysis and prediction of nucleosome positioning using integrative feature representation and machine learning algorithms.基于综合特征表示和机器学习算法的核小体定位的比较分析和预测。
BMC Bioinformatics. 2021 Jun 2;22(Suppl 6):129. doi: 10.1186/s12859-021-04006-w.
2
[Identification of nucleosome positioning using support vector machine method based on comprehensive DNA sequence feature].基于综合DNA序列特征的支持向量机方法鉴定核小体定位
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2020 Jun 25;37(3):496-501. doi: 10.7507/1001-5515.201911064.
3
LeNup: learning nucleosome positioning from DNA sequences with improved convolutional neural networks.LeNup:利用改进的卷积神经网络从 DNA 序列学习核小体定位。
Bioinformatics. 2018 May 15;34(10):1705-1712. doi: 10.1093/bioinformatics/bty003.
4
NucPosPred: Predicting species-specific genomic nucleosome positioning via four different modes of general PseKNC.NucPosPred:通过四种不同的通用 PseKNC 模式预测种特异性基因组核小体定位
J Theor Biol. 2018 Aug 7;450:15-21. doi: 10.1016/j.jtbi.2018.04.025. Epub 2018 Apr 18.
5
Nucleosome positioning based on DNA sequence embedding and deep learning.基于 DNA 序列嵌入和深度学习的核小体定位。
BMC Genomics. 2022 Apr 13;23(Suppl 1):301. doi: 10.1186/s12864-022-08508-6.
6
An analysis and prediction of nucleosome positioning based on information content.基于信息含量的核小体定位分析与预测。
Chromosome Res. 2013 Mar;21(1):63-74. doi: 10.1007/s10577-013-9338-z. Epub 2013 Feb 22.
7
ZCMM: A Novel Method Using Z-Curve Theory- Based and Position Weight Matrix for Predicting Nucleosome Positioning.ZCMM:一种基于 Z 曲线理论和位置权重矩阵的预测核小体定位的新方法。
Genes (Basel). 2019 Sep 28;10(10):765. doi: 10.3390/genes10100765.
8
iNuc-ext-PseTNC: an efficient ensemble model for identification of nucleosome positioning by extending the concept of Chou's PseAAC to pseudo-tri-nucleotide composition.iNuc-ext-PseTNC:一种通过将 Chou 的 PseAAC 概念扩展到伪三核苷酸组成来有效识别核小体定位的集成模型。
Mol Genet Genomics. 2019 Feb;294(1):199-210. doi: 10.1007/s00438-018-1498-2. Epub 2018 Oct 5.
9
A deformation energy-based model for predicting nucleosome dyads and occupancy.一种基于变形能量的预测核小体二分体和占有率的模型。
Sci Rep. 2016 Apr 7;6:24133. doi: 10.1038/srep24133.
10
Prediction of nucleosome positioning by the incorporation of frequencies and distributions of three different nucleotide segment lengths into a general pseudo k-tuple nucleotide composition.通过将三种不同核苷酸片段长度的频率和分布纳入通用伪k元核苷酸组成来预测核小体定位。
Bioinformatics. 2017 Jan 1;33(1):42-48. doi: 10.1093/bioinformatics/btw562. Epub 2016 Aug 25.

引用本文的文献

1
DNA sequence analysis landscape: a comprehensive review of DNA sequence analysis task types, databases, datasets, word embedding methods, and language models.DNA序列分析全景:对DNA序列分析任务类型、数据库、数据集、词嵌入方法和语言模型的全面综述。
Front Med (Lausanne). 2025 Apr 8;12:1503229. doi: 10.3389/fmed.2025.1503229. eCollection 2025.
2
CGRclust: Chaos Game Representation for twin contrastive clustering of unlabelled DNA sequences.CGRclust:用于未标记DNA序列双对比聚类的混沌游戏表示法
BMC Genomics. 2024 Dec 18;25(1):1214. doi: 10.1186/s12864-024-11135-y.
3
A Machine Learning Framework for Diagnosing and Predicting the Severity of Coronary Artery Disease.

本文引用的文献

1
Gene Expression Value Prediction Based on XGBoost Algorithm.基于XGBoost算法的基因表达值预测
Front Genet. 2019 Nov 12;10:1077. doi: 10.3389/fgene.2019.01077. eCollection 2019.
2
Splice sites detection using chaos game representation and neural network.基于混沌游戏表示和神经网络的剪接位点检测。
Genomics. 2020 Mar;112(2):1847-1852. doi: 10.1016/j.ygeno.2019.10.018. Epub 2019 Nov 5.
3
Fast and accurate genome comparison using genome images: The Extended Natural Vector Method.使用基因组图像进行快速准确的基因组比较:扩展自然向量方法。
一种用于诊断和预测冠状动脉疾病严重程度的机器学习框架。
Rev Cardiovasc Med. 2023 Jun 8;24(6):168. doi: 10.31083/j.rcm2406168. eCollection 2023 Jun.
4
The importance of DNA sequence for nucleosome positioning in transcriptional regulation.DNA 序列在转录调控中核小体定位的重要性。
Life Sci Alliance. 2024 Jun 3;7(8). doi: 10.26508/lsa.202302380. Print 2024 Aug.
5
DeepNup: Prediction of Nucleosome Positioning from DNA Sequences Using Deep Neural Network.DeepNup:使用深度神经网络从 DNA 序列预测核小体定位。
Genes (Basel). 2022 Oct 30;13(11):1983. doi: 10.3390/genes13111983.
6
Chaos game representation and its applications in bioinformatics.混沌游戏表示法及其在生物信息学中的应用。
Comput Struct Biotechnol J. 2021 Nov 10;19:6263-6271. doi: 10.1016/j.csbj.2021.11.008. eCollection 2021.
Mol Phylogenet Evol. 2019 Dec;141:106633. doi: 10.1016/j.ympev.2019.106633. Epub 2019 Sep 26.
4
Deep learning on chaos game representation for proteins.基于混沌游戏表示的蛋白质深度学习。
Bioinformatics. 2020 Jan 1;36(1):272-279. doi: 10.1093/bioinformatics/btz493.
5
Deep learning architectures for prediction of nucleosome positioning from sequences data.深度学习架构用于从序列数据预测核小体定位。
BMC Bioinformatics. 2018 Nov 20;19(Suppl 14):418. doi: 10.1186/s12859-018-2386-9.
6
LeNup: learning nucleosome positioning from DNA sequences with improved convolutional neural networks.LeNup:利用改进的卷积神经网络从 DNA 序列学习核小体定位。
Bioinformatics. 2018 May 15;34(10):1705-1712. doi: 10.1093/bioinformatics/bty003.
7
Nucleosome occupancy as a novel chromatin parameter for replication origin functions.核小体占据率作为复制起点功能的一种新型染色质参数。
Genome Res. 2017 Feb;27(2):269-277. doi: 10.1101/gr.209940.116. Epub 2016 Nov 28.
8
Prediction of nucleosome positioning by the incorporation of frequencies and distributions of three different nucleotide segment lengths into a general pseudo k-tuple nucleotide composition.通过将三种不同核苷酸片段长度的频率和分布纳入通用伪k元核苷酸组成来预测核小体定位。
Bioinformatics. 2017 Jan 1;33(1):42-48. doi: 10.1093/bioinformatics/btw562. Epub 2016 Aug 25.
9
Nucleosome repositioning underlies dynamic gene expression.核小体重新定位是动态基因表达的基础。
Genes Dev. 2016 Mar 15;30(6):660-72. doi: 10.1101/gad.274910.115. Epub 2016 Mar 10.
10
Using deformation energy to analyze nucleosome positioning in genomes.利用变形能分析基因组中的核小体定位。
Genomics. 2016 Mar;107(2-3):69-75. doi: 10.1016/j.ygeno.2015.12.005. Epub 2015 Dec 24.