快速解码单细胞类型特异性转录因子结合图谱，达到单核苷酸分辨率。

Fast decoding cell type-specific transcription factor binding landscape at single-nucleotide resolution.

机构信息

Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan 48109, USA.

出版信息

Genome Res. 2021 Apr;31(4):721-731. doi: 10.1101/gr.269613.120. Epub 2021 Mar 19.

DOI:10.1101/gr.269613.120

PMID:33741685

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8015851/

Abstract

Decoding the cell type-specific transcription factor (TF) binding landscape at single-nucleotide resolution is crucial for understanding the regulatory mechanisms underlying many fundamental biological processes and human diseases. However, limits on time and resources restrict the high-resolution experimental measurements of TF binding profiles of all possible TF-cell type combinations. Previous computational approaches either cannot distinguish the cell context-dependent TF binding profiles across diverse cell types or can only provide a relatively low-resolution prediction. Here we present a novel deep learning approach, Leopard, for predicting TF binding sites at single-nucleotide resolution, achieving the average area under receiver operating characteristic curve (AUROC) of 0.982 and the average area under precision recall curve (AUPRC) of 0.208. Our method substantially outperformed the state-of-the-art methods Anchor and FactorNet, improving the predictive AUPRC by 19% and 27%, respectively, when evaluated at 200-bp resolution. Meanwhile, by leveraging a many-to-many neural network architecture, Leopard features a hundredfold to thousandfold speedup compared with current many-to-one machine learning methods.

摘要

解析单细胞转录因子（TF）结合图谱的核苷酸分辨率对于理解许多基本生物学过程和人类疾病的调控机制至关重要。然而，时间和资源的限制限制了所有可能的 TF-细胞类型组合的 TF 结合谱的高分辨率实验测量。以前的计算方法要么不能区分不同细胞类型中依赖于细胞环境的 TF 结合谱，要么只能提供相对较低分辨率的预测。在这里，我们提出了一种新的深度学习方法 Leopard，用于预测单核苷酸分辨率的 TF 结合位点，平均接收者操作特征曲线下面积（AUROC）为 0.982，平均精度召回曲线下面积（AUPRC）为 0.208。我们的方法大大优于最先进的方法 Anchor 和 FactorNet，当评估分辨率为 200 个碱基时，预测 AUPRC 分别提高了 19%和 27%。同时，通过利用多对多神经网络架构，与当前的多对一机器学习方法相比，Leopard 的速度提高了百倍到千倍。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/645f/8015851/0422b9a602e0/721f01.jpg

相似文献

Fast decoding cell type-specific transcription factor binding landscape at single-nucleotide resolution.快速解码单细胞类型特异性转录因子结合图谱，达到单核苷酸分辨率。

Genome Res. 2021 Apr;31(4):721-731. doi: 10.1101/gr.269613.120. Epub 2021 Mar 19.

FactorNet: A deep learning framework for predicting cell type specific transcription factor binding from nucleotide-resolution sequential data.FactorNet：一种从核苷酸分辨率序列数据预测细胞类型特异性转录因子结合的深度学习框架。

Methods. 2019 Aug 15;166:40-47. doi: 10.1016/j.ymeth.2019.03.020. Epub 2019 Mar 26.

Establishment of noninvasive diabetes risk prediction model based on tongue features and machine learning techniques.基于舌象特征和机器学习技术的无创糖尿病风险预测模型的建立。

Int J Med Inform. 2021 May;149:104429. doi: 10.1016/j.ijmedinf.2021.104429. Epub 2021 Feb 22.

NetTIME: a multitask and base-pair resolution framework for improved transcription factor binding site prediction.NetTIME：一个用于提高转录因子结合位点预测的多任务和碱基对分辨率框架。

Bioinformatics. 2022 Oct 14;38(20):4762-4770. doi: 10.1093/bioinformatics/btac569.

Cross-Species Prediction of Transcription Factor Binding by Adversarial Training of a Novel Nucleotide-Level Deep Neural Network.通过新型核苷酸级别的深度神经网络的对抗训练对转录因子结合进行跨物种预测。

Adv Sci (Weinh). 2024 Sep;11(36):e2405685. doi: 10.1002/advs.202405685. Epub 2024 Jul 30.

Cross-Cell-Type Prediction of TF-Binding Site by Integrating Convolutional Neural Network and Adversarial Network.基于卷积神经网络和对抗网络的跨细胞类型预测 TF 结合位点

Int J Mol Sci. 2019 Jul 12;20(14):3425. doi: 10.3390/ijms20143425.

Imputation for transcription factor binding predictions based on deep learning.基于深度学习的转录因子结合预测插补

PLoS Comput Biol. 2017 Feb 24;13(2):e1005403. doi: 10.1371/journal.pcbi.1005403. eCollection 2017 Feb.

Enhancing the interpretability of transcription factor binding site prediction using attention mechanism.利用注意力机制提高转录因子结合位点预测的可解释性。

Sci Rep. 2020 Aug 7;10(1):13413. doi: 10.1038/s41598-020-70218-4.

GNet: An integrated context-aware neural framework for transcription factor binding signal at single nucleotide resolution prediction.GNet：一种用于单核苷酸分辨率预测转录因子结合信号的综合上下文感知神经框架。

Math Biosci Eng. 2023 Jul 31;20(9):15809-15829. doi: 10.3934/mbe.2023704.

HAMPLE: deciphering TF-DNA binding mechanism in different cellular environments by characterizing higher-order nucleotide dependency.通过描述更高阶核苷酸的依赖性来破译不同细胞环境中的 TF-DNA 结合机制。

Bioinformatics. 2023 May 4;39(5). doi: 10.1093/bioinformatics/btad299.

引用本文的文献

Predicting gene expression from DNA sequence using deep learning models.使用深度学习模型从DNA序列预测基因表达。

Nat Rev Genet. 2025 May 13. doi: 10.1038/s41576-025-00841-2.

Predicting cell type-specific epigenomic profiles accounting for distal genetic effects.预测细胞类型特异性表观基因组图谱，同时考虑远端遗传效应。

Nat Commun. 2024 Nov 16;15(1):9951. doi: 10.1038/s41467-024-54441-5.

Approaches for Benchmarking Single-Cell Gene Regulatory Network Methods.单细胞基因调控网络方法的基准测试方法

Bioinform Biol Insights. 2024 Nov 4;18:11779322241287120. doi: 10.1177/11779322241287120. eCollection 2024.

Identifying transcription factors with cell-type specific DNA binding signatures.鉴定具有细胞类型特异性 DNA 结合特征的转录因子。

BMC Genomics. 2024 Oct 14;25(1):957. doi: 10.1186/s12864-024-10859-1.

Adv Sci (Weinh). 2024 Sep;11(36):e2405685. doi: 10.1002/advs.202405685. Epub 2024 Jul 30.

Comparative analysis of models in predicting the effects of SNPs on TF-DNA binding using large-scale in vitro and in vivo data.利用大规模的体外和体内数据对 SNP 对 TF-DNA 结合影响的预测模型进行比较分析。

Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae110.

A generalizable framework to comprehensively predict epigenome, chromatin organization, and transcriptome.一个可推广的框架，全面预测表观基因组、染色质组织和转录组。

Nucleic Acids Res. 2023 Jul 7;51(12):5931-5947. doi: 10.1093/nar/gkad436.

Artificial Intelligence for Dementia Research Methods Optimization.用于痴呆症研究方法优化的人工智能

ArXiv. 2023 Mar 2:arXiv:2303.01949v1.

maxATAC: Genome-scale transcription-factor binding prediction from ATAC-seq with deep neural networks.maxATAC：基于深度神经网络的 ATAC-seq 全基因组转录因子结合预测

PLoS Comput Biol. 2023 Jan 31;19(1):e1010863. doi: 10.1371/journal.pcbi.1010863. eCollection 2023 Jan.

Computational approaches to understand transcription regulation in development.计算方法在发育中理解转录调控。

Biochem Soc Trans. 2023 Feb 27;51(1):1-12. doi: 10.1042/BST20210145.

本文引用的文献

Base-resolution models of transcription-factor binding reveal soft motif syntax.基于分辨率的转录因子结合模型揭示了软基序语法。

Nat Genet. 2021 Mar;53(3):354-366. doi: 10.1038/s41588-021-00782-6. Epub 2021 Feb 18.

DeepSleep convolutional neural network allows accurate and fast detection of sleep arousal.深度睡眠卷积神经网络能够准确快速地检测睡眠觉醒。

Commun Biol. 2021 Jan 4;4(1):18. doi: 10.1038/s42003-020-01542-8.

Cross-species regulatory sequence activity prediction.跨物种调控序列活性预测。

PLoS Comput Biol. 2020 Jul 20;16(7):e1008050. doi: 10.1371/journal.pcbi.1008050. eCollection 2020 Jul.

Whole-genome deep-learning analysis identifies contribution of noncoding mutations to autism risk.全基因组深度学习分析鉴定非编码突变对自闭症风险的贡献。

Nat Genet. 2019 Jun;51(6):973-980. doi: 10.1038/s41588-019-0420-0. Epub 2019 May 27.

Recognizing basal cell carcinoma on smartphone-captured digital histopathology images with a deep neural network.利用深度神经网络在智能手机拍摄的数字组织病理学图像上识别基底细胞癌。

Br J Dermatol. 2020 Mar;182(3):754-762. doi: 10.1111/bjd.18026. Epub 2019 Aug 22.

Deep learning: new computational modelling techniques for genomics.深度学习：基因组学的新计算建模技术。

Nat Rev Genet. 2019 Jul;20(7):389-403. doi: 10.1038/s41576-019-0122-6.

Methods. 2019 Aug 15;166:40-47. doi: 10.1016/j.ymeth.2019.03.020. Epub 2019 Mar 26.

Accurate prediction of cell type-specific transcription factor binding.准确预测细胞类型特异性转录因子结合。

Genome Biol. 2019 Jan 10;20(1):9. doi: 10.1186/s13059-018-1614-y.

Anchor: trans-cell type prediction of transcription factor binding sites.预测转录因子结合位点的跨细胞类型。

Genome Res. 2019 Feb;29(2):281-292. doi: 10.1101/gr.237156.118. Epub 2018 Dec 19.

A primer on deep learning in genomics.深度学习在基因组学中的应用简介。

Nat Genet. 2019 Jan;51(1):12-18. doi: 10.1038/s41588-018-0295-5. Epub 2018 Nov 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

快速解码单细胞类型特异性转录因子结合图谱，达到单核苷酸分辨率。

Fast decoding cell type-specific transcription factor binding landscape at single-nucleotide resolution.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献