• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于卷积神经网络和对抗网络的跨细胞类型预测 TF 结合位点

Cross-Cell-Type Prediction of TF-Binding Site by Integrating Convolutional Neural Network and Adversarial Network.

机构信息

School of Computer Science and Technology, Harbin Institute of Technology, Shenzhen 518055, China.

Department of Computing, The Hong Kong Polytechnic University, Hong Kong 810005, China.

出版信息

Int J Mol Sci. 2019 Jul 12;20(14):3425. doi: 10.3390/ijms20143425.

DOI:10.3390/ijms20143425
PMID:31336830
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6679139/
Abstract

Transcription factor binding sites (TFBSs) play an important role in gene expression regulation. Many computational methods for TFBS prediction need sufficient labeled data. However, many transcription factors (TFs) lack labeled data in cell types. We propose a novel method, referred to as DANN_TF, for TFBS prediction. DANN_TF consists of a feature extractor, a label predictor, and a domain classifier. The feature extractor and the domain classifier constitute an Adversarial Network, which ensures that learned features are common features across different cell types. DANN_TF is evaluated on five TFs in five cell types with a total of 25 cell-type TF pairs and compared to a baseline method which does not use Adversarial Network. For both data augmentation and cross-cell-type prediction, DANN_TF performs better than the baseline method on most cell-type TF pairs. DANN_TF is further evaluated by an additional 13 TFs in the five cell types with a total of 65 cell-type TF pairs. Results show that DANN_TF achieves significantly higher AUC than the baseline method on 96.9% pairs of the 65 cell-type TF pairs. This is a strong indication that DANN_TF can indeed learn common features for cross-cell-type TFBS prediction.

摘要

转录因子结合位点 (TFBSs) 在基因表达调控中起着重要作用。许多用于 TFBS 预测的计算方法都需要充足的标记数据。然而,许多转录因子 (TFs) 在细胞类型中缺乏标记数据。我们提出了一种新的方法,称为 DANN_TF,用于 TFBS 预测。DANN_TF 由特征提取器、标签预测器和域分类器组成。特征提取器和域分类器构成了一个对抗网络,确保学习到的特征是不同细胞类型共有的特征。在五种细胞类型中的五个 TF 上评估了 DANN_TF,共有 25 对细胞类型 TF,与不使用对抗网络的基线方法进行了比较。对于数据增强和跨细胞类型预测,DANN_TF 在大多数细胞类型 TF 对中都优于基线方法。DANN_TF 还在五种细胞类型中的另外 13 个 TF 上进行了评估,共有 65 对细胞类型 TF。结果表明,在 65 对细胞类型 TF 对中,DANN_TF 在 96.9%的对上的 AUC 显著高于基线方法。这强烈表明 DANN_TF 确实可以学习跨细胞类型 TFBS 预测的共同特征。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/4bfcb5550b47/ijms-20-03425-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/6f1f3c0ca04f/ijms-20-03425-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/eb2521506077/ijms-20-03425-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/8a35a16fe167/ijms-20-03425-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/c579a12b3c0a/ijms-20-03425-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/3c4c96040234/ijms-20-03425-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/4bfcb5550b47/ijms-20-03425-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/6f1f3c0ca04f/ijms-20-03425-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/eb2521506077/ijms-20-03425-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/8a35a16fe167/ijms-20-03425-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/c579a12b3c0a/ijms-20-03425-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/3c4c96040234/ijms-20-03425-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6ef2/6679139/4bfcb5550b47/ijms-20-03425-g006.jpg

相似文献

1
Cross-Cell-Type Prediction of TF-Binding Site by Integrating Convolutional Neural Network and Adversarial Network.基于卷积神经网络和对抗网络的跨细胞类型预测 TF 结合位点
Int J Mol Sci. 2019 Jul 12;20(14):3425. doi: 10.3390/ijms20143425.
2
Identifying cooperative transcription factors in yeast using multiple data sources.利用多种数据源鉴定酵母中的协同转录因子。
BMC Syst Biol. 2014;8 Suppl 5(Suppl 5):S2. doi: 10.1186/1752-0509-8-S5-S2. Epub 2014 Dec 12.
3
MTTFsite: cross-cell type TF binding site prediction by using multi-task learning.MTTFsite:基于多任务学习的跨细胞类型 TF 结合位点预测。
Bioinformatics. 2019 Dec 15;35(24):5067-5077. doi: 10.1093/bioinformatics/btz451.
4
Prediction of TF-Binding Site by Inclusion of Higher Order Position Dependencies.通过包含更高阶位置相关性来预测 TF 结合位点。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1383-1393. doi: 10.1109/TCBB.2019.2892124. Epub 2019 Jan 10.
5
Assessing the model transferability for prediction of transcription factor binding sites based on chromatin accessibility.基于染色质可及性评估预测转录因子结合位点的模型可转移性。
BMC Bioinformatics. 2017 Jul 27;18(1):355. doi: 10.1186/s12859-017-1769-7.
6
BERT-TFBS: a novel BERT-based model for predicting transcription factor binding sites by transfer learning.BERT-TFBS:一种基于迁移学习的用于预测转录因子结合位点的新型基于BERT的模型。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae195.
7
Cross-Species Prediction of Transcription Factor Binding by Adversarial Training of a Novel Nucleotide-Level Deep Neural Network.通过新型核苷酸级别的深度神经网络的对抗训练对转录因子结合进行跨物种预测。
Adv Sci (Weinh). 2024 Sep;11(36):e2405685. doi: 10.1002/advs.202405685. Epub 2024 Jul 30.
8
Prediction of Transcription Factor Binding Sites With an Attention Augmented Convolutional Neural Network.基于注意力增强卷积神经网络的转录因子结合位点预测
IEEE/ACM Trans Comput Biol Bioinform. 2022 Nov-Dec;19(6):3614-3623. doi: 10.1109/TCBB.2021.3126623. Epub 2022 Dec 8.
9
Enhancing the interpretability of transcription factor binding site prediction using attention mechanism.利用注意力机制提高转录因子结合位点预测的可解释性。
Sci Rep. 2020 Aug 7;10(1):13413. doi: 10.1038/s41598-020-70218-4.
10
Properly defining the targets of a transcription factor significantly improves the computational identification of cooperative transcription factor pairs in yeast.正确定义转录因子的靶标可显著提高酵母中协同转录因子对的计算识别能力。
BMC Genomics. 2015;16 Suppl 12(Suppl 12):S10. doi: 10.1186/1471-2164-16-S12-S10. Epub 2015 Dec 9.

引用本文的文献

1
Cross-Species Prediction of Transcription Factor Binding by Adversarial Training of a Novel Nucleotide-Level Deep Neural Network.通过新型核苷酸级别的深度神经网络的对抗训练对转录因子结合进行跨物种预测。
Adv Sci (Weinh). 2024 Sep;11(36):e2405685. doi: 10.1002/advs.202405685. Epub 2024 Jul 30.
2
Transfer Learning Allows Accurate RBP Target Site Prediction with Limited Sample Sizes.迁移学习可在样本量有限的情况下实现准确的RNA结合蛋白靶位点预测。
Biology (Basel). 2023 Sep 25;12(10):1276. doi: 10.3390/biology12101276.
3
Exploring the Role of Enhancer-Mediated Transcriptional Regulation in Precision Biology.

本文引用的文献

1
Prediction of TF-Binding Site by Inclusion of Higher Order Position Dependencies.通过包含更高阶位置相关性来预测 TF 结合位点。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1383-1393. doi: 10.1109/TCBB.2019.2892124. Epub 2019 Jan 10.
2
EL_LSTM: Prediction of DNA-Binding Residue from Protein Sequence by Combining Long Short-Term Memory and Ensemble Learning.EL_LSTM:通过组合长短期记忆和集成学习预测蛋白质序列中的 DNA 结合残基。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Jan-Feb;17(1):124-135. doi: 10.1109/TCBB.2018.2858806. Epub 2018 Jul 23.
3
EL_PSSM-RT: DNA-binding residue prediction by integrating ensemble learning with PSSM Relation Transformation.
探索增强子介导的转录调控在精准生物学中的作用。
Int J Mol Sci. 2023 Jun 29;24(13):10843. doi: 10.3390/ijms241310843.
4
An Evidence Theory and Fuzzy Logic Combined Approach for the Prediction of Potential ARF-Regulated Genes in Quinoa.一种结合证据理论和模糊逻辑的方法用于预测藜麦中潜在的ARF调控基因。
Plants (Basel). 2022 Dec 23;12(1):71. doi: 10.3390/plants12010071.
5
Assessing deep learning methods in cis-regulatory motif finding based on genomic sequencing data.基于基因组测序数据评估 cis-regulatory 基序发现中的深度学习方法。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab374.
6
Biologically relevant transfer learning improves transcription factor binding prediction.生物相关的迁移学习可提高转录因子结合预测。
Genome Biol. 2021 Sep 27;22(1):280. doi: 10.1186/s13059-021-02499-5.
7
Cross-species regulatory sequence activity prediction.跨物种调控序列活性预测。
PLoS Comput Biol. 2020 Jul 20;16(7):e1008050. doi: 10.1371/journal.pcbi.1008050. eCollection 2020 Jul.
EL_PSSM-RT:通过整合集成学习与PSSM关系转换进行DNA结合残基预测
BMC Bioinformatics. 2017 Aug 29;18(1):379. doi: 10.1186/s12859-017-1792-8.
4
Convolutional neural network architectures for predicting DNA-protein binding.用于预测DNA-蛋白质结合的卷积神经网络架构。
Bioinformatics. 2016 Jun 15;32(12):i121-i127. doi: 10.1093/bioinformatics/btw255.
5
PDNAsite: Identification of DNA-binding Site from Protein Sequence by Incorporating Spatial and Sequence Context.PDNAsite:通过整合空间和序列上下文从蛋白质序列中识别 DNA 结合位点。
Sci Rep. 2016 Jun 10;6:27653. doi: 10.1038/srep27653.
6
How motif environment influences transcription factor search dynamics: Finding a needle in a haystack.基序环境如何影响转录因子搜索动力学:大海捞针
Bioessays. 2016 Jul;38(7):605-12. doi: 10.1002/bies.201600005. Epub 2016 May 19.
7
DanQ: a hybrid convolutional and recurrent deep neural network for quantifying the function of DNA sequences.DanQ:一种用于量化DNA序列功能的卷积与循环相结合的深度神经网络。
Nucleic Acids Res. 2016 Jun 20;44(11):e107. doi: 10.1093/nar/gkw226. Epub 2016 Apr 15.
8
Predicting transcription factor site occupancy using DNA sequence intrinsic and cell-type specific chromatin features.利用DNA序列内在特征和细胞类型特异性染色质特征预测转录因子位点占有率。
BMC Bioinformatics. 2016 Jan 11;17 Suppl 1(Suppl 1):4. doi: 10.1186/s12859-015-0846-z.
9
JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles.JASPAR 2016:转录因子结合谱开放获取数据库的重大扩展与更新
Nucleic Acids Res. 2016 Jan 4;44(D1):D110-5. doi: 10.1093/nar/gkv1176. Epub 2015 Nov 3.
10
Predicting effects of noncoding variants with deep learning-based sequence model.使用基于深度学习的序列模型预测非编码变异的影响。
Nat Methods. 2015 Oct;12(10):931-4. doi: 10.1038/nmeth.3547. Epub 2015 Aug 24.