• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过全卷积神经网络定位转录因子结合位点。

Locating transcription factor binding sites by fully convolutional neural network.

机构信息

Translational Medical Center for Stem Cell Therapy and Institute for Regenerative Medicine, Shanghai East Hospital, Tongji University, Shanghai, China.

Computer Science and Technology, Tongji University, China.

出版信息

Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa435.

DOI:10.1093/bib/bbaa435
PMID:33498086
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8425303/
Abstract

Transcription factors (TFs) play an important role in regulating gene expression, thus identification of the regions bound by them has become a fundamental step for molecular and cellular biology. In recent years, an increasing number of deep learning (DL) based methods have been proposed for predicting TF binding sites (TFBSs) and achieved impressive prediction performance. However, these methods mainly focus on predicting the sequence specificity of TF-DNA binding, which is equivalent to a sequence-level binary classification task, and fail to identify motifs and TFBSs accurately. In this paper, we developed a fully convolutional network coupled with global average pooling (FCNA), which by contrast is equivalent to a nucleotide-level binary classification task, to roughly locate TFBSs and accurately identify motifs. Experimental results on human ChIP-seq datasets show that FCNA outperforms other competing methods significantly. Besides, we find that the regions located by FCNA can be used by motif discovery tools to further refine the prediction performance. Furthermore, we observe that FCNA can accurately identify TF-DNA binding motifs across different cell lines and infer indirect TF-DNA bindings.

摘要

转录因子 (TFs) 在调节基因表达中起着重要作用,因此鉴定它们结合的区域已成为分子和细胞生物学的基本步骤。近年来,越来越多的基于深度学习 (DL) 的方法被提出用于预测 TF 结合位点 (TFBSs),并取得了令人印象深刻的预测性能。然而,这些方法主要侧重于预测 TF-DNA 结合的序列特异性,这相当于序列级别的二进制分类任务,并且无法准确识别基序和 TFBSs。在本文中,我们开发了一种完全卷积网络与全局平均池化 (FCNA) 相结合的方法,它相当于核苷酸级别的二进制分类任务,用于粗略定位 TFBSs 和准确识别基序。在人类 ChIP-seq 数据集上的实验结果表明,FCNA 明显优于其他竞争方法。此外,我们发现 FCNA 定位的区域可以被基序发现工具用于进一步改进预测性能。此外,我们观察到 FCNA 可以准确识别不同细胞系中的 TF-DNA 结合基序,并推断间接的 TF-DNA 结合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72d2/8425303/ea9a2923a951/bbaa435f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72d2/8425303/9e8e1822424d/bbaa435f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72d2/8425303/d3eb010cfeda/bbaa435f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72d2/8425303/576938fa4edd/bbaa435f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72d2/8425303/e43ef3326db2/bbaa435f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72d2/8425303/ea9a2923a951/bbaa435f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72d2/8425303/9e8e1822424d/bbaa435f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72d2/8425303/d3eb010cfeda/bbaa435f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72d2/8425303/576938fa4edd/bbaa435f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72d2/8425303/e43ef3326db2/bbaa435f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/72d2/8425303/ea9a2923a951/bbaa435f5.jpg

相似文献

1
Locating transcription factor binding sites by fully convolutional neural network.通过全卷积神经网络定位转录因子结合位点。
Brief Bioinform. 2021 Sep 2;22(5). doi: 10.1093/bib/bbaa435.
2
Base-resolution prediction of transcription factor binding signals by a deep learning framework.基于深度学习框架的转录因子结合信号的碱基分辨率预测。
PLoS Comput Biol. 2022 Mar 9;18(3):e1009941. doi: 10.1371/journal.pcbi.1009941. eCollection 2022 Mar.
3
FCNGRU: Locating Transcription Factor Binding Sites by Combing Fully Convolutional Neural Network With Gated Recurrent Unit.FCNGRU:通过全卷积神经网络与门控循环单元相结合定位转录因子结合位点。
IEEE J Biomed Health Inform. 2022 Apr;26(4):1883-1890. doi: 10.1109/JBHI.2021.3117616. Epub 2022 Apr 14.
4
High-resolution transcription factor binding sites prediction improved performance and interpretability by deep learning method.深度学习方法提高了高分辨率转录因子结合位点预测的性能和可解释性。
Brief Bioinform. 2021 Nov 5;22(6). doi: 10.1093/bib/bbab273.
5
A novel convolution attention model for predicting transcription factor binding sites by combination of sequence and shape.一种新型卷积注意力模型,通过序列和形状的结合来预测转录因子结合位点。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab525.
6
Using Fully Convolutional Network to Locate Transcription Factor Binding Sites Based on DNA Sequence and Conservation Information.基于DNA序列和保守信息,使用全卷积网络定位转录因子结合位点。
IEEE/ACM Trans Comput Biol Bioinform. 2023 Sep-Oct;20(5):2690-2699. doi: 10.1109/TCBB.2022.3219831. Epub 2023 Oct 9.
7
Discovering unknown human and mouse transcription factor binding sites and their characteristics from ChIP-seq data.从 ChIP-seq 数据中发现未知的人和小鼠转录因子结合位点及其特征。
Proc Natl Acad Sci U S A. 2021 May 18;118(20). doi: 10.1073/pnas.2026754118.
8
MMGAT: a graph attention network framework for ATAC-seq motifs finding.MMGAT:一种用于 ATAC-seq 基序发现的图注意力网络框架。
BMC Bioinformatics. 2024 Apr 20;25(1):158. doi: 10.1186/s12859-024-05774-x.
9
DeepSTF: predicting transcription factor binding sites by interpretable deep neural networks combining sequence and shape.DeepSTF:通过结合序列和形状的可解释深度神经网络预测转录因子结合位点。
Brief Bioinform. 2023 Jul 20;24(4). doi: 10.1093/bib/bbad231.
10
MMGraph: a multiple motif predictor based on graph neural network and coexisting probability for ATAC-seq data.MMGraph:基于图神经网络和共存概率的多基序预测器,用于 ATAC-seq 数据。
Bioinformatics. 2022 Sep 30;38(19):4636-4638. doi: 10.1093/bioinformatics/btac572.

引用本文的文献

1
Explainable AI in Genomics: Transcription Factor Binding Site Prediction with Mixture of Experts.基因组学中的可解释人工智能:基于专家混合模型的转录因子结合位点预测
ArXiv. 2025 Jul 18:arXiv:2507.09754v2.
2
MLSNet: a deep learning model for predicting transcription factor binding sites.MLSNet:一种用于预测转录因子结合位点的深度学习模型。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae489.
3
Cross-Species Prediction of Transcription Factor Binding by Adversarial Training of a Novel Nucleotide-Level Deep Neural Network.通过新型核苷酸级别的深度神经网络的对抗训练对转录因子结合进行跨物种预测。

本文引用的文献

1
Multi-Scale Capsule Network for Predicting DNA-Protein Binding Sites.多尺度胶囊网络预测 DNA-蛋白质结合位点
IEEE/ACM Trans Comput Biol Bioinform. 2021 Sep-Oct;18(5):1793-1800. doi: 10.1109/TCBB.2020.3025579. Epub 2021 Oct 7.
2
Predicting in-vitro Transcription Factor Binding Sites Using DNA Sequence + Shape.使用 DNA 序列+形状预测体外转录因子结合位点。
IEEE/ACM Trans Comput Biol Bioinform. 2021 Mar-Apr;18(2):667-676. doi: 10.1109/TCBB.2019.2947461. Epub 2021 Apr 6.
3
Modeling in-vivo protein-DNA binding by combining multiple-instance learning with a hybrid deep neural network.
Adv Sci (Weinh). 2024 Sep;11(36):e2405685. doi: 10.1002/advs.202405685. Epub 2024 Jul 30.
4
Advances in computational and experimental approaches for deciphering transcriptional regulatory networks: Understanding the roles of cis-regulatory elements is essential, and recent research utilizing MPRAs, STARR-seq, CRISPR-Cas9, and machine learning has yielded valuable insights.在解析转录调控网络的计算和实验方法方面的进展:理解顺式调控元件的作用至关重要,最近利用 MPRAs、STARR-seq、CRISPR-Cas9 和机器学习的研究提供了有价值的见解。
Bioessays. 2024 Jul;46(7):e2300210. doi: 10.1002/bies.202300210. Epub 2024 May 8.
5
BERT-TFBS: a novel BERT-based model for predicting transcription factor binding sites by transfer learning.BERT-TFBS:一种基于迁移学习的用于预测转录因子结合位点的新型基于BERT的模型。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae195.
6
Comparative analysis of models in predicting the effects of SNPs on TF-DNA binding using large-scale in vitro and in vivo data.利用大规模的体外和体内数据对 SNP 对 TF-DNA 结合影响的预测模型进行比较分析。
Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae110.
7
WSHNN: A Weakly Supervised Hybrid Neural Network for the Identification of DNA-Protein Binding Sites.WSHNN:一种用于识别DNA-蛋白质结合位点的弱监督混合神经网络。
Curr Comput Aided Drug Des. 2024 Feb 12. doi: 10.2174/0115734099277249240129114123.
8
Nucleotide-level prediction of CircRNA-protein binding based on fully convolutional neural network.基于全卷积神经网络的环状RNA与蛋白质结合的核苷酸水平预测
Front Genet. 2023 Oct 6;14:1283404. doi: 10.3389/fgene.2023.1283404. eCollection 2023.
9
Computational prediction and characterization of cell-type-specific and shared binding sites.计算预测和细胞类型特异性及共享结合位点的特征描述。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac798.
10
DLoopCaller: A deep learning approach for predicting genome-wide chromatin loops by integrating accessible chromatin landscapes.DLoopCaller:一种通过整合可及染色质景观来预测全基因组染色质环的深度学习方法。
PLoS Comput Biol. 2022 Oct 7;18(10):e1010572. doi: 10.1371/journal.pcbi.1010572. eCollection 2022 Oct.
通过将多实例学习与混合深度神经网络相结合来模拟体内蛋白质-DNA 结合。
Sci Rep. 2019 Jun 11;9(1):8484. doi: 10.1038/s41598-019-44966-x.
4
Weakly-Supervised Convolutional Neural Network Architecture for Predicting Protein-DNA Binding.弱监督卷积神经网络结构用于预测蛋白质-DNA 结合。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Mar-Apr;17(2):679-689. doi: 10.1109/TCBB.2018.2864203. Epub 2018 Aug 7.
5
High-Order Convolutional Neural Network Architecture for Predicting DNA-Protein Binding Sites.用于预测 DNA-蛋白质结合位点的高阶卷积神经网络架构。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1184-1192. doi: 10.1109/TCBB.2018.2819660. Epub 2018 Mar 26.
6
HOCOMOCO: towards a complete collection of transcription factor binding models for human and mouse via large-scale ChIP-Seq analysis.HOCOMOCO:通过大规模的 ChIP-Seq 分析,构建人类和小鼠转录因子结合模型的完整集合。
Nucleic Acids Res. 2018 Jan 4;46(D1):D252-D259. doi: 10.1093/nar/gkx1106.
7
SMiLE-seq identifies binding motifs of single and dimeric transcription factors.SMiLE-seq 鉴定了单个和二聚体转录因子的结合基序。
Nat Methods. 2017 Mar;14(3):316-322. doi: 10.1038/nmeth.4143. Epub 2017 Jan 16.
8
DNA Shape Features Improve Transcription Factor Binding Site Predictions In Vivo.DNA 形状特征可改善体内转录因子结合位点预测。
Cell Syst. 2016 Sep 28;3(3):278-286.e4. doi: 10.1016/j.cels.2016.07.001. Epub 2016 Aug 18.
9
Fully Convolutional Networks for Semantic Segmentation.全卷积网络用于语义分割。
IEEE Trans Pattern Anal Mach Intell. 2017 Apr;39(4):640-651. doi: 10.1109/TPAMI.2016.2572683. Epub 2016 May 24.
10
DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences.DanQ:一种用于量化DNA序列功能的卷积与循环相结合的深度神经网络。
Nucleic Acids Res. 2016 Jun 20;44(11):e107. doi: 10.1093/nar/gkw226. Epub 2016 Apr 15.