• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

LeNup:利用改进的卷积神经网络从 DNA 序列学习核小体定位。

LeNup: learning nucleosome positioning from DNA sequences with improved convolutional neural networks.

机构信息

Department of Biomedical Engineering.

Key Laboratory of Convergence Medical Engineering System and Healthcare Technology of the Ministry of Industry and Information Technology, School of Life Science, Beijing Institute of Technology, Beijing 100081, China.

出版信息

Bioinformatics. 2018 May 15;34(10):1705-1712. doi: 10.1093/bioinformatics/bty003.

DOI:10.1093/bioinformatics/bty003
PMID:29329398
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5946947/
Abstract

MOTIVATION

Nucleosome positioning plays significant roles in proper genome packing and its accessibility to execute transcription regulation. Despite a multitude of nucleosome positioning resources available on line including experimental datasets of genome-wide nucleosome occupancy profiles and computational tools to the analysis on these data, the complex language of eukaryotic Nucleosome positioning remains incompletely understood.

RESULTS

Here, we address this challenge using an approach based on a state-of-the-art machine learning method. We present a novel convolutional neural network (CNN) to understand nucleosome positioning. We combined Inception-like networks with a gating mechanism for the response of multiple patterns and long term association in DNA sequences. We developed the open-source package LeNup based on the CNN to predict nucleosome positioning in Homo sapiens, Caenorhabditis elegans, Drosophila melanogaster as well as Saccharomyces cerevisiae genomes. We trained LeNup on four benchmark datasets. LeNup achieved greater predictive accuracy than previously published methods.

AVAILABILITY AND IMPLEMENTATION

LeNup is freely available as Python and Lua script source code under a BSD style license from https://github.com/biomedBit/LeNup.

CONTACT

jhzhang@bit.edu.cn.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

核小体定位在正确的基因组包装及其转录调控的可及性方面发挥着重要作用。尽管在线上有许多核小体定位资源,包括全基因组核小体占有率图谱的实验数据集和用于分析这些数据的计算工具,但真核生物核小体定位的复杂语言仍未被完全理解。

结果

在这里,我们使用基于最先进的机器学习方法的方法来解决这个挑战。我们提出了一种新的卷积神经网络(CNN)来理解核小体定位。我们将类似于 Inception 的网络与门控机制结合起来,以响应 DNA 序列中的多种模式和长期关联。我们基于 CNN 开发了开源软件包 LeNup,用于预测人类、秀丽隐杆线虫、黑腹果蝇和酿酒酵母基因组中的核小体定位。我们在四个基准数据集上训练了 LeNup。LeNup 的预测准确性优于以前发表的方法。

可用性和实现

LeNup 可作为 Python 和 Lua 脚本源代码,根据 BSD 样式许可证从 https://github.com/biomedBit/LeNup 获得。

联系方式

jhzhang@bit.edu.cn。

补充信息

补充数据可在 Bioinformatics 在线获取。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/7320dc5bff2e/bty003f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/47ef1d109ced/bty003f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/8d11b7261bd2/bty003f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/4dadd4cba20d/bty003f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/254b42c244c0/bty003f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/abfe62d600a9/bty003f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/1e780d0398be/bty003f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/7320dc5bff2e/bty003f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/47ef1d109ced/bty003f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/8d11b7261bd2/bty003f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/4dadd4cba20d/bty003f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/254b42c244c0/bty003f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/abfe62d600a9/bty003f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/1e780d0398be/bty003f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acdb/5946947/7320dc5bff2e/bty003f7.jpg

相似文献

1
LeNup: learning nucleosome positioning from DNA sequences with improved convolutional neural networks.LeNup:利用改进的卷积神经网络从 DNA 序列学习核小体定位。
Bioinformatics. 2018 May 15;34(10):1705-1712. doi: 10.1093/bioinformatics/bty003.
2
Prediction of nucleosome positioning by the incorporation of frequencies and distributions of three different nucleotide segment lengths into a general pseudo k-tuple nucleotide composition.通过将三种不同核苷酸片段长度的频率和分布纳入通用伪k元核苷酸组成来预测核小体定位。
Bioinformatics. 2017 Jan 1;33(1):42-48. doi: 10.1093/bioinformatics/btw562. Epub 2016 Aug 25.
3
NucPosPred: Predicting species-specific genomic nucleosome positioning via four different modes of general PseKNC.NucPosPred:通过四种不同的通用 PseKNC 模式预测种特异性基因组核小体定位
J Theor Biol. 2018 Aug 7;450:15-21. doi: 10.1016/j.jtbi.2018.04.025. Epub 2018 Apr 18.
4
An analysis and prediction of nucleosome positioning based on information content.基于信息含量的核小体定位分析与预测。
Chromosome Res. 2013 Mar;21(1):63-74. doi: 10.1007/s10577-013-9338-z. Epub 2013 Feb 22.
5
ZCMM: A Novel Method Using Z-Curve Theory- Based and Position Weight Matrix for Predicting Nucleosome Positioning.ZCMM:一种基于 Z 曲线理论和位置权重矩阵的预测核小体定位的新方法。
Genes (Basel). 2019 Sep 28;10(10):765. doi: 10.3390/genes10100765.
6
Comparative analysis and prediction of nucleosome positioning using integrative feature representation and machine learning algorithms.基于综合特征表示和机器学习算法的核小体定位的比较分析和预测。
BMC Bioinformatics. 2021 Jun 2;22(Suppl 6):129. doi: 10.1186/s12859-021-04006-w.
7
iNuc-PseKNC: a sequence-based predictor for predicting nucleosome positioning in genomes with pseudo k-tuple nucleotide composition.iNuc-PseKNC:一种基于序列的预测器,用于预测基因组中具有伪 k-元核苷酸组成的核小体定位。
Bioinformatics. 2014 Jun 1;30(11):1522-9. doi: 10.1093/bioinformatics/btu083. Epub 2014 Feb 6.
8
A deformation energy-based model for predicting nucleosome dyads and occupancy.一种基于变形能量的预测核小体二分体和占有率的模型。
Sci Rep. 2016 Apr 7;6:24133. doi: 10.1038/srep24133.
9
DiNuP: a systematic approach to identify regions of differential nucleosome positioning.DiNuP:一种系统性方法,用于鉴定差异核小体定位区域。
Bioinformatics. 2012 Aug 1;28(15):1965-71. doi: 10.1093/bioinformatics/bts329. Epub 2012 Jun 4.
10
Prediction of nucleosome positioning in genomes: limits and perspectives of physical and bioinformatic approaches.基因组中核小体定位的预测:物理和生物信息学方法的局限性和展望。
J Biomol Struct Dyn. 2010 Jun;27(6):747-64. doi: 10.1080/07391102.2010.10508583.

引用本文的文献

1
Interpretable deep residual network uncovers nucleosome positioning and associated features.可解释深度残差网络揭示核小体定位和相关特征。
Nucleic Acids Res. 2024 Aug 27;52(15):8734-8745. doi: 10.1093/nar/gkae623.
2
The importance of DNA sequence for nucleosome positioning in transcriptional regulation.DNA 序列在转录调控中核小体定位的重要性。
Life Sci Alliance. 2024 Jun 3;7(8). doi: 10.26508/lsa.202302380. Print 2024 Aug.
3
Comprehensive computational analysis of epigenetic descriptors affecting CRISPR-Cas9 off-target activity.

本文引用的文献

1
RPA binds histone H3-H4 and functions in DNA replication-coupled nucleosome assembly.RPA 结合组蛋白 H3-H4,并在 DNA 复制偶联核小体组装中发挥作用。
Science. 2017 Jan 27;355(6323):415-420. doi: 10.1126/science.aah4712.
2
Dermatologist-level classification of skin cancer with deep neural networks.基于深度神经网络的皮肤癌皮肤科医生级分类。
Nature. 2017 Feb 2;542(7639):115-118. doi: 10.1038/nature21056. Epub 2017 Jan 25.
3
Regulated large-scale nucleosome density patterns and precise nucleosome positioning correlate with V(D)J recombination.
综合计算分析影响 CRISPR-Cas9 脱靶活性的表观遗传描述符。
BMC Genomics. 2022 Dec 6;23(1):805. doi: 10.1186/s12864-022-09012-7.
4
DeepNup: Prediction of Nucleosome Positioning from DNA Sequences Using Deep Neural Network.DeepNup:使用深度神经网络从 DNA 序列预测核小体定位。
Genes (Basel). 2022 Oct 30;13(11):1983. doi: 10.3390/genes13111983.
5
Assessment and Optimization of Explainable Machine Learning Models Applied to Transcriptomic Data.评估和优化应用于转录组数据的可解释机器学习模型。
Genomics Proteomics Bioinformatics. 2022 Oct;20(5):899-911. doi: 10.1016/j.gpb.2022.07.003. Epub 2022 Aug 3.
6
Genomics enters the deep learning era.基因组学进入深度学习时代。
PeerJ. 2022 Jun 24;10:e13613. doi: 10.7717/peerj.13613. eCollection 2022.
7
Galaxy Dnpatterntools for Computational Analysis of Nucleosome Positioning Sequence Patterns.星系 Dnpatterntools 用于核小体定位序列模式的计算分析。
Int J Mol Sci. 2022 Apr 28;23(9):4869. doi: 10.3390/ijms23094869.
8
DNAcycP: a deep learning tool for DNA cyclizability prediction.DNAcycP:一种用于 DNA 环化能力预测的深度学习工具。
Nucleic Acids Res. 2022 Apr 8;50(6):3142-3154. doi: 10.1093/nar/gkac162.
9
Comparative analysis and prediction of nucleosome positioning using integrative feature representation and machine learning algorithms.基于综合特征表示和机器学习算法的核小体定位的比较分析和预测。
BMC Bioinformatics. 2021 Jun 2;22(Suppl 6):129. doi: 10.1186/s12859-021-04006-w.
10
BITS2019: the sixteenth annual meeting of the Italian society of bioinformatics.BITS2019:第十六届意大利生物信息学学会年会。
BMC Bioinformatics. 2020 Sep 16;21(Suppl 8):363. doi: 10.1186/s12859-020-03708-x.
受调控的大规模核小体密度模式和精确的核小体定位与V(D)J重组相关。
Proc Natl Acad Sci U S A. 2016 Oct 18;113(42):E6427-E6436. doi: 10.1073/pnas.1605543113. Epub 2016 Oct 3.
4
Nucleosomal signatures impose nucleosome positioning in coding and noncoding sequences in the genome.核小体特征决定了基因组中编码和非编码序列的核小体定位。
Genome Res. 2016 Nov;26(11):1532-1543. doi: 10.1101/gr.207241.116. Epub 2016 Sep 23.
5
Dynamics of Nucleosome Positioning Maturation following Genomic Replication.基因组复制后核小体定位成熟的动力学
Cell Rep. 2016 Sep 6;16(10):2651-2665. doi: 10.1016/j.celrep.2016.07.083. Epub 2016 Aug 25.
6
Prediction of nucleosome positioning by the incorporation of frequencies and distributions of three different nucleotide segment lengths into a general pseudo k-tuple nucleotide composition.通过将三种不同核苷酸片段长度的频率和分布纳入通用伪k元核苷酸组成来预测核小体定位。
Bioinformatics. 2017 Jan 1;33(1):42-48. doi: 10.1093/bioinformatics/btw562. Epub 2016 Aug 25.
7
iNuc-STNC: a sequence-based predictor for identification of nucleosome positioning in genomes by extending the concept of SAAC and Chou's PseAAC.iNuc-STNC:一种基于序列的预测器,通过扩展SAAC和周式伪氨基酸组成的概念来识别基因组中的核小体定位。
Mol Biosyst. 2016 Jul 19;12(8):2587-93. doi: 10.1039/c6mb00221h.
8
Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks.巴塞特:利用深度卷积神经网络学习可及基因组的调控密码。
Genome Res. 2016 Jul;26(7):990-9. doi: 10.1101/gr.200535.115. Epub 2016 May 3.
9
iDHS-EL: identifying DNase I hypersensitive sites by fusing three different modes of pseudo nucleotide composition into an ensemble learning framework.iDHS-EL:通过将三种不同模式的伪核苷酸组成融合到一个集成学习框架中,来识别 DNase I 超敏位点。
Bioinformatics. 2016 Aug 15;32(16):2411-8. doi: 10.1093/bioinformatics/btw186. Epub 2016 Apr 8.
10
Mastering the game of Go with deep neural networks and tree search.用深度神经网络和树搜索掌握围棋游戏。
Nature. 2016 Jan 28;529(7587):484-9. doi: 10.1038/nature16961.