• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的条件特异性调控基因预测。

Prediction of condition-specific regulatory genes using machine learning.

机构信息

Graduate program in Genetics, Bioinformatics and Computational Biology. Virginia Tech., Blacksburg, VA 24061, USA.

School of Plant and Environmental Sciences. Virginia Tech., Blacksburg, VA 24061, USA.

出版信息

Nucleic Acids Res. 2020 Jun 19;48(11):e62. doi: 10.1093/nar/gkaa264.

DOI:10.1093/nar/gkaa264
PMID:32329779
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7293043/
Abstract

Recent advances in genomic technologies have generated data on large-scale protein-DNA interactions and open chromatin regions for many eukaryotic species. How to identify condition-specific functions of transcription factors using these data has become a major challenge in genomic research. To solve this problem, we have developed a method called ConSReg, which provides a novel approach to integrate regulatory genomic data into predictive machine learning models of key regulatory genes. Using Arabidopsis as a model system, we tested our approach to identify regulatory genes in data sets from single cell gene expression and from abiotic stress treatments. Our results showed that ConSReg accurately predicted transcription factors that regulate differentially expressed genes with an average auROC of 0.84, which is 23.5-25% better than enrichment-based approaches. To further validate the performance of ConSReg, we analyzed an independent data set related to plant nitrogen responses. ConSReg provided better rankings of the correct transcription factors in 61.7% of cases, which is three times better than other plant tools. We applied ConSReg to Arabidopsis single cell RNA-seq data, successfully identifying candidate regulatory genes that control cell wall formation. Our methods provide a new approach to define candidate regulatory genes using integrated genomic data in plants.

摘要

近年来,基因组技术的发展为许多真核生物物种生成了大规模蛋白质-DNA 相互作用和开放染色质区域的数据。如何利用这些数据识别转录因子的特定条件功能,已成为基因组研究中的一个主要挑战。为了解决这个问题,我们开发了一种名为 ConSReg 的方法,该方法为将调控基因组数据整合到关键调控基因的预测机器学习模型中提供了一种新方法。我们使用拟南芥作为模型系统,测试了我们的方法,以从单细胞基因表达和非生物胁迫处理的数据集中识别调控基因。我们的结果表明,ConSReg 可以准确地预测调控差异表达基因的转录因子,其平均 auROC 为 0.84,比基于富集的方法好 23.5-25%。为了进一步验证 ConSReg 的性能,我们分析了一个与植物氮响应相关的独立数据集。在 61.7%的情况下,ConSReg 对正确转录因子的排名更好,比其他植物工具好 3 倍。我们将 ConSReg 应用于拟南芥单细胞 RNA-seq 数据,成功鉴定出控制细胞壁形成的候选调控基因。我们的方法为使用整合的基因组数据在植物中定义候选调控基因提供了一种新方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/99d12cca48e9/gkaa264fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/8466f39d012b/gkaa264fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/16dfa4a8d5f4/gkaa264fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/a0e4c4e88ab0/gkaa264fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/4ddada5d6895/gkaa264fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/7e049d390860/gkaa264fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/99d12cca48e9/gkaa264fig6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/8466f39d012b/gkaa264fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/16dfa4a8d5f4/gkaa264fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/a0e4c4e88ab0/gkaa264fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/4ddada5d6895/gkaa264fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/7e049d390860/gkaa264fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ce6/7293043/99d12cca48e9/gkaa264fig6.jpg

相似文献

1
Prediction of condition-specific regulatory genes using machine learning.基于机器学习的条件特异性调控基因预测。
Nucleic Acids Res. 2020 Jun 19;48(11):e62. doi: 10.1093/nar/gkaa264.
2
Modeling Plant Transcription Factor Networks Using ConSReg.使用 ConSReg 对植物转录因子网络进行建模。
Methods Mol Biol. 2023;2594:205-215. doi: 10.1007/978-1-0716-2815-7_15.
3
Machine learning-based differential network analysis: a study of stress-responsive transcriptomes in Arabidopsis.基于机器学习的差异网络分析:拟南芥应激转录组研究。
Plant Cell. 2014 Feb;26(2):520-37. doi: 10.1105/tpc.113.121913. Epub 2014 Feb 11.
4
StressGenePred: a twin prediction model architecture for classifying the stress types of samples and discovering stress-related genes in arabidopsis.StressGenePred:一种用于对样本的应激类型进行分类和发现拟南芥中与应激相关基因的双胞胎预测模型架构。
BMC Genomics. 2019 Dec 20;20(Suppl 11):949. doi: 10.1186/s12864-019-6283-z.
5
Characterization of WRKY co-regulatory networks in rice and Arabidopsis.水稻和拟南芥中WRKY共调控网络的特征分析
BMC Plant Biol. 2009 Sep 22;9:120. doi: 10.1186/1471-2229-9-120.
6
Expression-based network biology identifies immune-related functional modules involved in plant defense.基于表达谱的网络生物学鉴定了与植物防御相关的免疫功能模块。
BMC Genomics. 2014 Jun 3;15:421. doi: 10.1186/1471-2164-15-421.
7
Identification of transcription factor's targets using tissue-specific transcriptomic data in Arabidopsis thaliana.利用拟南芥组织特异性转录组数据鉴定转录因子的靶标
BMC Syst Biol. 2010 Sep 13;4 Suppl 2(Suppl 2):S2. doi: 10.1186/1752-0509-4-S2-S2.
8
An Arabidopsis expression predictor enables inference of transcriptional regulators for gene modules.拟南芥表达预测器可用于推断基因模块的转录调控因子。
Plant J. 2021 Jul;107(2):597-612. doi: 10.1111/tpj.15315. Epub 2021 Jun 8.
9
Prediction of regulatory interactions in Arabidopsis using gene-expression data and support vector machines.使用基因表达数据和支持向量机预测拟南芥中的调控相互作用。
Plant Physiol Biochem. 2011 Mar;49(3):280-3. doi: 10.1016/j.plaphy.2011.01.002. Epub 2011 Jan 12.
10
Genome-wide identification of potential plant E2F target genes.全基因组范围内潜在植物E2F靶基因的鉴定。
Plant Physiol. 2005 Sep;139(1):316-28. doi: 10.1104/pp.105.066290. Epub 2005 Aug 26.

引用本文的文献

1
Identification of Key Differentially Expressed Genes in Under Short- and Long-Term High Light Stress.短期和长期高光胁迫下关键差异表达基因的鉴定
Int J Mol Sci. 2025 Aug 12;26(16):7790. doi: 10.3390/ijms26167790.
2
SaGP: identifying plant saline-alkali tolerance genes based on machine learning techniques.SaGP:基于机器学习技术鉴定植物耐盐碱基因
Front Plant Sci. 2025 Jul 16;16:1629794. doi: 10.3389/fpls.2025.1629794. eCollection 2025.
3
Using supervised machine-learning approaches to understand abiotic stress tolerance and design resilient crops.

本文引用的文献

1
Widespread long-range cis-regulatory elements in the maize genome.玉米基因组中广泛存在的长程顺式调控元件。
Nat Plants. 2019 Dec;5(12):1237-1249. doi: 10.1038/s41477-019-0547-0. Epub 2019 Nov 18.
2
Enhanced Maps of Transcription Factor Binding Sites Improve Regulatory Networks Learned from Accessible Chromatin Data.转录因子结合位点增强图谱可改善从可及染色质数据中学习到的调控网络。
Plant Physiol. 2019 Oct;181(2):412-425. doi: 10.1104/pp.19.00605. Epub 2019 Jul 25.
3
High-Throughput Single-Cell Transcriptome Profiling of Plant Cell Types.
利用监督式机器学习方法来理解非生物胁迫耐受性并设计抗逆作物。
Philos Trans R Soc Lond B Biol Sci. 2025 May 29;380(1927):20240252. doi: 10.1098/rstb.2024.0252.
4
Accelerating crop improvement via integration of transcriptome-based network biology and genome editing.通过整合基于转录组的网络生物学和基因组编辑加速作物改良。
Planta. 2025 Mar 17;261(4):92. doi: 10.1007/s00425-025-04666-5.
5
Benchmarking short-read metagenomics tools for removing host contamination.用于去除宿主污染的短读长宏基因组学工具的基准测试
Gigascience. 2025 Jan 6;14. doi: 10.1093/gigascience/giaf004.
6
Loss of cold tolerance is conferred by absence of the WRKY34 promoter fragment during tomato evolution.在番茄进化过程中,由于缺失 WRKY34 启动子片段而导致对寒冷耐受性的丧失。
Nat Commun. 2024 Aug 6;15(1):6667. doi: 10.1038/s41467-024-51036-y.
7
Predicting and optimizing reactive oxygen species metabolism in Punica granatum L. through machine learning: role of exogenous GABA on antioxidant enzyme activity under drought and salinity stress.通过机器学习预测和优化石榴中活性氧代谢:外源 GABA 在干旱和盐胁迫下对抗氧化酶活性的作用。
BMC Plant Biol. 2024 Jan 23;24(1):65. doi: 10.1186/s12870-024-04740-2.
8
FUN-PROSE: A deep learning approach to predict condition-specific gene expression in fungi.FUN-PROSE:一种用于预测真菌中条件特异性基因表达的深度学习方法。
PLoS Comput Biol. 2023 Nov 16;19(11):e1011563. doi: 10.1371/journal.pcbi.1011563. eCollection 2023 Nov.
9
Gene Regulatory Network Modeling Using Single-Cell Multi-Omics in Plants.基于植物单细胞多组学的基因调控网络建模
Methods Mol Biol. 2023;2698:259-275. doi: 10.1007/978-1-0716-3354-0_16.
10
Characterization of Gene Regulatory Networks in Plants Using New Methods and Data Types.利用新方法和数据类型对植物基因调控网络进行特征描述。
Methods Mol Biol. 2023;2698:1-11. doi: 10.1007/978-1-0716-3354-0_1.
高通量单细胞转录组分析植物细胞类型。
Cell Rep. 2019 May 14;27(7):2241-2247.e4. doi: 10.1016/j.celrep.2019.04.054.
4
A Single-Cell RNA Sequencing Profiles the Developmental Landscape of Arabidopsis Root.基于单细胞 RNA 测序解析拟南芥根的发育全景。
Mol Plant. 2019 May 6;12(5):648-660. doi: 10.1016/j.molp.2019.04.004. Epub 2019 Apr 17.
5
Network Walking charts transcriptional dynamics of nitrogen signaling by integrating validated and predicted genome-wide interactions.网络漫步图通过整合经过验证和预测的全基因组相互作用来描绘氮信号转导的转录动态。
Nat Commun. 2019 Apr 5;10(1):1569. doi: 10.1038/s41467-019-09522-1.
6
Dynamics of Gene Expression in Single Root Cells of .单细胞根细胞中基因表达的动力学。
Plant Cell. 2019 May;31(5):993-1011. doi: 10.1105/tpc.18.00785. Epub 2019 Mar 28.
7
Spatiotemporal Developmental Trajectories in the Arabidopsis Root Revealed Using High-Throughput Single-Cell RNA Sequencing.利用高通量单细胞 RNA 测序揭示拟南芥根的时空发育轨迹。
Dev Cell. 2019 Mar 25;48(6):840-852.e5. doi: 10.1016/j.devcel.2019.02.022.
8
Single-Cell RNA Sequencing Resolves Molecular Relationships Among Individual Plant Cells.单细胞 RNA 测序解析单个植物细胞间的分子关系。
Plant Physiol. 2019 Apr;179(4):1444-1456. doi: 10.1104/pp.18.01482. Epub 2019 Feb 4.
9
Regulatory chromatin landscape in roots uncovered by coupling INTACT and ATAC-seq.通过结合INTACT和ATAC-seq揭示的根中的调控染色质景观。
Plant Methods. 2018 Dec 20;14:113. doi: 10.1186/s13007-018-0381-9. eCollection 2018.
10
PlantPAN3.0: a new and updated resource for reconstructing transcriptional regulatory networks from ChIP-seq experiments in plants.PlantPAN3.0:一个新的和更新的资源,用于从植物的 ChIP-seq 实验中重建转录调控网络。
Nucleic Acids Res. 2019 Jan 8;47(D1):D1155-D1163. doi: 10.1093/nar/gky1081.