• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于机器学习的全细胞转录反应预测。

Prediction of whole-cell transcriptional response with machine learning.

机构信息

Data Science, Netrias, LLC, Annapolis, MD 21409, USA.

Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA.

出版信息

Bioinformatics. 2022 Jan 3;38(2):404-409. doi: 10.1093/bioinformatics/btab676.

DOI:10.1093/bioinformatics/btab676
PMID:34570169
Abstract

MOTIVATION

Applications in synthetic and systems biology can benefit from measuring whole-cell response to biochemical perturbations. Execution of experiments to cover all possible combinations of perturbations is infeasible. In this paper, we present the host response model (HRM), a machine learning approach that maps response of single perturbations to transcriptional response of the combination of perturbations.

RESULTS

The HRM combines high-throughput sequencing with machine learning to infer links between experimental context, prior knowledge of cell regulatory networks, and RNASeq data to predict a gene's dysregulation. We find that the HRM can predict the directionality of dysregulation to a combination of inducers with an accuracy of >90% using data from single inducers. We further find that the use of prior, known cell regulatory networks doubles the predictive performance of the HRM (an R2 from 0.3 to 0.65). The model was validated in two organisms, Escherichia coli and Bacillus subtilis, using new experiments conducted after training. Finally, while the HRM is trained with gene expression data, the direct prediction of differential expression makes it possible to also conduct enrichment analyses using its predictions. We show that the HRM can accurately classify >95% of the pathway regulations. The HRM reduces the number of RNASeq experiments needed as responses can be tested in silico prior to the experiment.

AVAILABILITY AND IMPLEMENTATION

The HRM software and tutorial are available at https://github.com/sd2e/CDM and the configurable differential expression analysis tools and tutorials are available at https://github.com/SD2E/omics_tools.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

合成和系统生物学中的应用可以受益于测量整个细胞对生化扰动的反应。执行实验以涵盖所有可能的扰动组合是不可行的。在本文中,我们提出了宿主反应模型(HRM),这是一种机器学习方法,它将单个扰动的反应映射到扰动组合的转录反应。

结果

HRM 将高通量测序与机器学习相结合,以推断实验背景、细胞调控网络的先验知识和 RNASeq 数据之间的联系,从而预测基因的失调。我们发现,HRM 可以使用来自单个诱导剂的数据,以 >90%的准确率预测组合诱导剂的失调方向。我们进一步发现,使用先验的、已知的细胞调控网络可以使 HRM 的预测性能提高一倍(从 0.3 到 0.65 的 R2)。该模型在两种生物,大肠杆菌和枯草芽孢杆菌中进行了验证,使用了在训练后进行的新实验。最后,虽然 HRM 是使用基因表达数据进行训练的,但通过其预测进行差异表达的直接预测使其能够使用其预测进行富集分析。我们表明,HRM 可以准确地对 >95%的途径调控进行分类。HRM 减少了所需的 RNASeq 实验数量,因为可以在实验之前在计算机上测试反应。

可用性和实施

HRM 软件和教程可在 https://github.com/sd2e/CDM 上获得,可配置的差异表达分析工具和教程可在 https://github.com/SD2E/omics_tools 上获得。

补充信息

补充数据可在生物信息学在线获得。

相似文献

1
Prediction of whole-cell transcriptional response with machine learning.基于机器学习的全细胞转录反应预测。
Bioinformatics. 2022 Jan 3;38(2):404-409. doi: 10.1093/bioinformatics/btab676.
2
Dr.VAE: improving drug response prediction via modeling of drug perturbation effects.VAE 博士:通过建模药物干扰效应来改善药物反应预测。
Bioinformatics. 2019 Oct 1;35(19):3743-3751. doi: 10.1093/bioinformatics/btz158.
3
Simulation-assisted machine learning.模拟辅助机器学习。
Bioinformatics. 2019 Oct 15;35(20):4072-4080. doi: 10.1093/bioinformatics/btz199.
4
Machine learning in computational biology to accelerate high-throughput protein expression.计算生物学中的机器学习以加速高通量蛋白质表达。
Bioinformatics. 2017 Aug 15;33(16):2487-2495. doi: 10.1093/bioinformatics/btx207.
5
ABEILLE: a novel method for ABerrant Expression Identification empLoying machine LEarning from RNA-sequencing data.ABEILLE:一种基于机器学习的 RNA-seq 数据的异常表达识别新方法。
Bioinformatics. 2022 Oct 14;38(20):4754-4761. doi: 10.1093/bioinformatics/btac603.
6
Effectidor: an automated machine-learning-based web server for the prediction of type-III secretion system effectors.效应物预测器:一种基于自动化机器学习的网络服务器,用于预测 III 型分泌系统效应物。
Bioinformatics. 2022 Apr 12;38(8):2341-2343. doi: 10.1093/bioinformatics/btac087.
7
OpenBioLink: a benchmarking framework for large-scale biomedical link prediction.OpenBioLink:大规模生物医学链接预测的基准测试框架。
Bioinformatics. 2020 Jul 1;36(13):4097-4098. doi: 10.1093/bioinformatics/btaa274.
8
BioKEEN: a library for learning and evaluating biological knowledge graph embeddings.BioKEEN:用于学习和评估生物知识图嵌入的库。
Bioinformatics. 2019 Sep 15;35(18):3538-3540. doi: 10.1093/bioinformatics/btz117.
9
DBFE: distribution-based feature extraction from structural variants in whole-genome data.DBFE:从全基因组数据中的结构变异进行基于分布的特征提取。
Bioinformatics. 2022 Sep 30;38(19):4466-4473. doi: 10.1093/bioinformatics/btac513.
10
DIRECTION: a machine learning framework for predicting and characterizing DNA methylation and hydroxymethylation in mammalian genomes.一种用于预测和描述哺乳动物基因组中 DNA 甲基化和羟甲基化的机器学习框架。
Bioinformatics. 2017 Oct 1;33(19):2986-2994. doi: 10.1093/bioinformatics/btx316.

引用本文的文献

1
Cognitive Nests: Nested Data-Driven Decision Support System in Regenerative Design from Biology to Ecology.认知巢:从生物学到生态学的再生设计中基于嵌套数据驱动的决策支持系统
3D Print Addit Manuf. 2025 Apr 14;12(2):192-198. doi: 10.1089/3dp.2023.0331. eCollection 2025 Apr.
2
Predicting the effects of cultivation condition on gene regulation in by using deep learning.利用深度学习预测培养条件对[具体对象]基因调控的影响。 (注:原文中“in”后面缺少具体内容)
Comput Struct Biotechnol J. 2023 Apr 13;21:2613-2620. doi: 10.1016/j.csbj.2023.04.010. eCollection 2023.
3
Israel and the global synthetic biology ecosystem.
以色列与全球合成生物学生态系统。
Eng Biol. 2023 Oct 14;7(1-4):18-28. doi: 10.1049/enb2.12027. eCollection 2023 Dec.
4
Investigating and Modeling the Factors That Affect Genetic Circuit Performance.研究和建模影响遗传电路性能的因素。
ACS Synth Biol. 2023 Nov 17;12(11):3189-3204. doi: 10.1021/acssynbio.3c00151. Epub 2023 Nov 2.
5
Robustness and reproducibility of simple and complex synthetic logic circuit designs using a DBTL loop.使用DBTL循环的简单和复杂合成逻辑电路设计的稳健性和可重复性。
Synth Biol (Oxf). 2023 Mar 28;8(1):ysad005. doi: 10.1093/synbio/ysad005. eCollection 2023.
6
A toolkit for enhanced reproducibility of RNASeq analysis for synthetic biologists.合成生物学家用于提高RNA测序分析可重复性的工具包。
Synth Biol (Oxf). 2022 Aug 23;7(1):ysac012. doi: 10.1093/synbio/ysac012. eCollection 2022.
7
Data-driven prediction in dynamical systems: recent developments.数据驱动的动态系统预测:最新进展。
Philos Trans A Math Phys Eng Sci. 2022 Aug 8;380(2229):20210213. doi: 10.1098/rsta.2021.0213. Epub 2022 Jun 20.