• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于构建调控生物途径的多层层次基因调控网络的递归随机森林算法。

Recursive random forest algorithm for constructing multilayered hierarchical gene regulatory networks that govern biological pathways.

作者信息

Deng Wenping, Zhang Kui, Busov Victor, Wei Hairong

机构信息

School of Forest Resources and Environmental Science, Michigan Technological University, Houghton, MI, United States of America.

Department of Mathematical Sciences Michigan Technological University, Houghton, MI, United States of America.

出版信息

PLoS One. 2017 Feb 3;12(2):e0171532. doi: 10.1371/journal.pone.0171532. eCollection 2017.

DOI:10.1371/journal.pone.0171532
PMID:28158291
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5291523/
Abstract

BACKGROUND

Present knowledge indicates a multilayered hierarchical gene regulatory network (ML-hGRN) often operates above a biological pathway. Although the ML-hGRN is very important for understanding how a pathway is regulated, there is almost no computational algorithm for directly constructing ML-hGRNs.

RESULTS

A backward elimination random forest (BWERF) algorithm was developed for constructing the ML-hGRN operating above a biological pathway. For each pathway gene, the BWERF used a random forest model to calculate the importance values of all transcription factors (TFs) to this pathway gene recursively with a portion (e.g. 1/10) of least important TFs being excluded in each round of modeling, during which, the importance values of all TFs to the pathway gene were updated and ranked until only one TF was remained in the list. The above procedure, termed BWERF. After that, the importance values of a TF to all pathway genes were aggregated and fitted to a Gaussian mixture model to determine the TF retention for the regulatory layer immediately above the pathway layer. The acquired TFs at the secondary layer were then set to be the new bottom layer to infer the next upper layer, and this process was repeated until a ML-hGRN with the expected layers was obtained.

CONCLUSIONS

BWERF improved the accuracy for constructing ML-hGRNs because it used backward elimination to exclude the noise genes, and aggregated the individual importance values for determining the TFs retention. We validated the BWERF by using it for constructing ML-hGRNs operating above mouse pluripotency maintenance pathway and Arabidopsis lignocellulosic pathway. Compared to GENIE3, BWERF showed an improvement in recognizing authentic TFs regulating a pathway. Compared to the bottom-up Gaussian graphical model algorithm we developed for constructing ML-hGRNs, the BWERF can construct ML-hGRNs with significantly reduced edges that enable biologists to choose the implicit edges for experimental validation.

摘要

背景

现有知识表明,多层级基因调控网络(ML-hGRN)通常在生物途径之上运行。尽管ML-hGRN对于理解途径如何被调控非常重要,但几乎没有直接构建ML-hGRN的计算算法。

结果

开发了一种反向消除随机森林(BWERF)算法,用于构建在生物途径之上运行的ML-hGRN。对于每个途径基因,BWERF使用随机森林模型递归计算所有转录因子(TF)对该途径基因的重要性值,在每一轮建模中排除一部分(例如1/10)最不重要的TF,在此期间,更新并排列所有TF对途径基因的重要性值,直到列表中只剩下一个TF。上述过程称为BWERF。之后,将一个TF对所有途径基因的重要性值进行汇总,并拟合到高斯混合模型中,以确定途径层之上紧邻调控层的TF保留情况。然后将在第二层获得的TF设置为新的底层,以推断下一个上层,重复此过程,直到获得具有预期层数的ML-hGRN。

结论

BWERF提高了构建ML-hGRN的准确性,因为它使用反向消除来排除噪声基因,并汇总个体重要性值以确定TF保留情况。我们通过使用BWERF构建在小鼠多能性维持途径和拟南芥木质纤维素途径之上运行的ML-hGRN来验证它。与GENIE3相比,BWERF在识别调控途径的真实TF方面有所改进。与我们为构建ML-hGRN而开发的自底向上高斯图形模型算法相比,BWERF可以构建边显著减少的ML-hGRN,使生物学家能够选择隐含边进行实验验证。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ed/5291523/973d538598f5/pone.0171532.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ed/5291523/c5f7ca22c2ce/pone.0171532.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ed/5291523/5593c0f0f275/pone.0171532.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ed/5291523/4df8830a4ce8/pone.0171532.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ed/5291523/973d538598f5/pone.0171532.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ed/5291523/c5f7ca22c2ce/pone.0171532.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ed/5291523/5593c0f0f275/pone.0171532.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ed/5291523/4df8830a4ce8/pone.0171532.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/79ed/5291523/973d538598f5/pone.0171532.g004.jpg

相似文献

1
Recursive random forest algorithm for constructing multilayered hierarchical gene regulatory networks that govern biological pathways.用于构建调控生物途径的多层层次基因调控网络的递归随机森林算法。
PLoS One. 2017 Feb 3;12(2):e0171532. doi: 10.1371/journal.pone.0171532. eCollection 2017.
2
Bottom-up GGM algorithm for constructing multilayered hierarchical gene regulatory networks that govern biological pathways or processes.用于构建调控生物途径或过程的多层层次基因调控网络的自下而上的GGM算法。
BMC Bioinformatics. 2016 Mar 18;17:132. doi: 10.1186/s12859-016-0981-1.
3
Construction of a hierarchical gene regulatory network centered around a transcription factor.构建以转录因子为中心的层次基因调控网络。
Brief Bioinform. 2019 May 21;20(3):1021-1031. doi: 10.1093/bib/bbx152.
4
Transcription factors involved in the regulatory networks governing the Calvin-Benson-Bassham cycle.调控卡尔文-本森-巴斯汉姆循环的调控网络中涉及的转录因子。
Tree Physiol. 2019 Jul 18;39(7):1159-1172. doi: 10.1093/treephys/tpz025.
5
SND1 transcription factor-directed quantitative functional hierarchical genetic regulatory network in wood formation in Populus trichocarpa.毛白杨木质部形成过程中 SND1 转录因子定向的定量功能层次遗传调控网络。
Plant Cell. 2013 Nov;25(11):4324-41. doi: 10.1105/tpc.113.117697. Epub 2013 Nov 26.
6
Construction of two regulatory networks related to salt stress and lignocellulosic synthesis under salt stress based on a Populus davidiana × P. bolleana transcriptome analysis.基于毛白杨 × 欧美杨转录组分析构建两个与盐胁迫及盐胁迫下木质素合成相关的调控网络。
Plant Mol Biol. 2022 Aug;109(6):689-702. doi: 10.1007/s11103-022-01267-8. Epub 2022 Apr 29.
7
TF-Cluster: a pipeline for identifying functionally coordinated transcription factors via network decomposition of the shared coexpression connectivity matrix (SCCM).TF-Cluster:一种通过共享共表达连接矩阵(SCCM)的网络分解来识别功能协调转录因子的流程。
BMC Syst Biol. 2011 Apr 15;5:53. doi: 10.1186/1752-0509-5-53.
8
TF-finder: a software package for identifying transcription factors involved in biological processes using microarray data and existing knowledge base.TF-finder:一款软件包,用于使用微阵列数据和现有的知识库识别参与生物过程的转录因子。
BMC Bioinformatics. 2010 Aug 12;11:425. doi: 10.1186/1471-2105-11-425.
9
Trimming of mammalian transcriptional networks using network component analysis.使用网络组件分析修剪哺乳动物转录网络。
BMC Bioinformatics. 2010 Oct 13;11:511. doi: 10.1186/1471-2105-11-511.
10
TGMI: an efficient algorithm for identifying pathway regulators through evaluation of triple-gene mutual interaction.TGMI:一种通过评估三基因相互作用来识别途径调节剂的有效算法。
Nucleic Acids Res. 2018 Jun 20;46(11):e67. doi: 10.1093/nar/gky210.

引用本文的文献

1
Integrating multi-omics to unveil PSMD12 as a critical gene in promoting brain metastases of lung adenocarcinoma.整合多组学技术揭示PSMD12作为促进肺腺癌脑转移的关键基因。
J Transl Med. 2025 Jun 17;23(1):668. doi: 10.1186/s12967-025-06680-3.
2
Current status and trends in forest genomics.森林基因组学的现状与趋势
For Res (Fayettev). 2022 Aug 31;2:11. doi: 10.48130/FR-2022-0011. eCollection 2022.
3
HB-PLS: A statistical method for identifying biological process or pathway regulators by integrating Huber loss and Berhu penalty with partial least squares regression.

本文引用的文献

1
Drought resistance - is it really a complex trait?抗旱性——它真的是一个复杂性状吗?
Funct Plant Biol. 2011 Oct;38(10):753-757. doi: 10.1071/FP11101.
2
Bottom-up GGM algorithm for constructing multilayered hierarchical gene regulatory networks that govern biological pathways or processes.用于构建调控生物途径或过程的多层层次基因调控网络的自下而上的GGM算法。
BMC Bioinformatics. 2016 Mar 18;17:132. doi: 10.1186/s12859-016-0981-1.
3
Genetic networks involved in poplar root response to low nitrogen.参与杨树根系响应低氮的遗传网络。
HB-PLS:一种通过将Huber损失和Berhu惩罚与偏最小二乘回归相结合来识别生物过程或通路调节因子的统计方法。
For Res (Fayettev). 2021 Mar 30;1:6. doi: 10.48130/FR-2021-0006. eCollection 2021.
4
Genome-wide identification and gene expression networks of LBD transcription factors in Populus trichocarpa.杨属 LBD 转录因子的全基因组鉴定和基因表达网络。
BMC Genomics. 2024 Oct 3;25(1):920. doi: 10.1186/s12864-024-10848-4.
5
TGPred: efficient methods for predicting target genes of a transcription factor by integrating statistics, machine learning and optimization.TGPred:通过整合统计学、机器学习和优化技术来预测转录因子靶基因的有效方法。
NAR Genom Bioinform. 2023 Sep 13;5(3):lqad083. doi: 10.1093/nargab/lqad083. eCollection 2023 Sep.
6
Conserved hierarchical gene regulatory networks for drought and cold stress response in .植物中干旱和寒冷胁迫响应的保守层次基因调控网络 (原文句子不完整,根据常见语境补充“植物中”使句子完整通顺)
Front Plant Sci. 2023 Apr 14;14:1155504. doi: 10.3389/fpls.2023.1155504. eCollection 2023.
7
Construction of a Hierarchical Gene Regulatory Network to Reveal the Drought Tolerance Mechanism of Shanxin Poplar.构建层次化基因调控网络揭示山新杨的耐旱机制。
Int J Mol Sci. 2022 Dec 26;24(1):384. doi: 10.3390/ijms24010384.
8
Growth-regulating factor 15-mediated gene regulatory network enhances salt tolerance in poplar.生长调节因子 15 介导的基因调控网络增强杨树的耐盐性。
Plant Physiol. 2023 Apr 3;191(4):2367-2384. doi: 10.1093/plphys/kiac600.
9
Genome-Wide Identification of WRKY Family Genes and the Expression Profiles in Response to Nitrogen Deficiency in Poplar.杨树响应氮缺乏的全基因组 WRKY 家族基因鉴定及表达谱分析
Genes (Basel). 2022 Dec 10;13(12):2324. doi: 10.3390/genes13122324.
10
Machine Learning for Causal Inference in Biological Networks: Perspectives of This Challenge.用于生物网络因果推断的机器学习:这一挑战的视角
Front Bioinform. 2021 Sep 22;1:746712. doi: 10.3389/fbinf.2021.746712. eCollection 2021.
Plant Signal Behav. 2013 Nov;8(11):e27211. doi: 10.4161/psb.27211. Epub 2013 Dec 3.
4
SND1 transcription factor-directed quantitative functional hierarchical genetic regulatory network in wood formation in Populus trichocarpa.毛白杨木质部形成过程中 SND1 转录因子定向的定量功能层次遗传调控网络。
Plant Cell. 2013 Nov;25(11):4324-41. doi: 10.1105/tpc.113.117697. Epub 2013 Nov 26.
5
Nitrogen deprivation promotes Populus root growth through global transcriptome reprogramming and activation of hierarchical genetic networks.氮饥饿通过全局转录组重编程和分层遗传网络的激活促进杨树根系生长。
New Phytol. 2013 Oct;200(2):483-497. doi: 10.1111/nph.12375. Epub 2013 Jun 25.
6
Ptr-miR397a is a negative regulator of laccase genes affecting lignin content in Populus trichocarpa.Ptr-miR397a 是漆酶基因的负调控因子,影响毛白杨中的木质素含量。
Proc Natl Acad Sci U S A. 2013 Jun 25;110(26):10848-53. doi: 10.1073/pnas.1308936110. Epub 2013 Jun 10.
7
A hierarchical gene regulatory network for adaptive multirobot pattern formation.用于自适应多机器人模式形成的分层基因调控网络。
IEEE Trans Syst Man Cybern B Cybern. 2012 Jun;42(3):805-16. doi: 10.1109/TSMCB.2011.2178021. Epub 2012 Jan 31.
8
Rewiring of transcriptional regulatory networks: hierarchy, rather than connectivity, better reflects the importance of regulators.转录调控网络的重布线:层次结构,而不是连接性,更好地反映了调控因子的重要性。
Sci Signal. 2010 Nov 2;3(146):ra79. doi: 10.1126/scisignal.2001014.
9
Inferring regulatory networks from expression data using tree-based methods.基于树的方法从表达数据推断调控网络。
PLoS One. 2010 Sep 28;5(9):e12776. doi: 10.1371/journal.pone.0012776.
10
Inferring the conservative causal core of gene regulatory networks.推断基因调控网络的保守因果核心。
BMC Syst Biol. 2010 Sep 28;4:132. doi: 10.1186/1752-0509-4-132.