• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

独立成分分析可从不同数据集恢复一致的调控信号。

Independent component analysis recovers consistent regulatory signals from disparate datasets.

机构信息

Department of Bioengineering, University of California San Diego, La Jolla, California, United States of America.

Novo Nordisk Foundation Center for Biosustainability, Technical University of Denmark, Lyngby, Denmark.

出版信息

PLoS Comput Biol. 2021 Feb 2;17(2):e1008647. doi: 10.1371/journal.pcbi.1008647. eCollection 2021 Feb.

DOI:10.1371/journal.pcbi.1008647
PMID:33529205
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7888660/
Abstract

The availability of bacterial transcriptomes has dramatically increased in recent years. This data deluge could result in detailed inference of underlying regulatory networks, but the diversity of experimental platforms and protocols introduces critical biases that could hinder scalable analysis of existing data. Here, we show that the underlying structure of the E. coli transcriptome, as determined by Independent Component Analysis (ICA), is conserved across multiple independent datasets, including both RNA-seq and microarray datasets. We subsequently combined five transcriptomics datasets into a large compendium containing over 800 expression profiles and discovered that its underlying ICA-based structure was still comparable to that of the individual datasets. With this understanding, we expanded our analysis to over 3,000 E. coli expression profiles and predicted three high-impact regulons that respond to oxidative stress, anaerobiosis, and antibiotic treatment. ICA thus enables deep analysis of disparate data to uncover new insights that were not visible in the individual datasets.

摘要

近年来,细菌转录组的可用性显著增加。这一数据洪流可能会导致对潜在调控网络的详细推断,但实验平台和协议的多样性引入了关键的偏差,这可能会阻碍对现有数据的可扩展分析。在这里,我们表明,通过独立成分分析(ICA)确定的大肠杆菌转录组的基础结构在多个独立数据集之间是保守的,包括 RNA-seq 和微阵列数据集。随后,我们将五个转录组数据集合并到一个大型纲要中,其中包含超过 800 个表达谱,并发现其基于 ICA 的基础结构仍然与单个数据集相当。有了这种理解,我们将分析扩展到超过 3000 个大肠杆菌表达谱,并预测了三个对氧化应激、厌氧和抗生素处理有反应的高影响调控子。因此,ICA 能够深入分析不同的数据,揭示单个数据集无法看到的新见解。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/7ba6d20e27f2/pcbi.1008647.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/1c0529b73b17/pcbi.1008647.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/a661196b9052/pcbi.1008647.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/576c616b32e7/pcbi.1008647.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/d8a71ed78a5c/pcbi.1008647.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/c1f8ca7dc748/pcbi.1008647.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/7ba6d20e27f2/pcbi.1008647.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/1c0529b73b17/pcbi.1008647.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/a661196b9052/pcbi.1008647.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/576c616b32e7/pcbi.1008647.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/d8a71ed78a5c/pcbi.1008647.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/c1f8ca7dc748/pcbi.1008647.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/abfc/7888660/7ba6d20e27f2/pcbi.1008647.g006.jpg

相似文献

1
Independent component analysis recovers consistent regulatory signals from disparate datasets.独立成分分析可从不同数据集恢复一致的调控信号。
PLoS Comput Biol. 2021 Feb 2;17(2):e1008647. doi: 10.1371/journal.pcbi.1008647. eCollection 2021 Feb.
2
Query large scale microarray compendium datasets using a model-based bayesian approach with variable selection.使用基于模型的贝叶斯方法和变量选择来查询大规模微阵列综合数据集。
PLoS One. 2009;4(2):e4495. doi: 10.1371/journal.pone.0004495. Epub 2009 Feb 13.
3
Boolean implication networks derived from large scale, whole genome microarray datasets.从大规模、全基因组微阵列数据集得出的布尔蕴涵网络。
Genome Biol. 2008 Oct 30;9(10):R157. doi: 10.1186/gb-2008-9-10-r157.
4
Gene expression data classification using locally linear discriminant embedding.基于局部线性判别嵌入的基因表达数据分类。
Comput Biol Med. 2010 Oct;40(10):802-10. doi: 10.1016/j.compbiomed.2010.08.003. Epub 2010 Sep 22.
5
transcriptome assembly from a compendium of RNA-seq data sets.基于 RNA-seq 数据集文库的转录组组装。
RNA Biol. 2023 Jan;20(1):77-84. doi: 10.1080/15476286.2023.2189331.
6
iTAP: integrated transcriptomics and phenotype database for stress response of Escherichia coli and Saccharomyces cerevisiae.iTAP:大肠杆菌和酿酒酵母应激反应的综合转录组学与表型数据库
BMC Res Notes. 2015 Dec 12;8:771. doi: 10.1186/s13104-015-1759-7.
7
Deconvolution of transcriptomes and miRNomes by independent component analysis provides insights into biological processes and clinical outcomes of melanoma patients.通过独立成分分析对转录组和 miRNA 组进行反卷积,为黑色素瘤患者的生物学过程和临床结果提供了深入了解。
BMC Med Genomics. 2019 Sep 18;12(1):132. doi: 10.1186/s12920-019-0578-4.
8
Principal components analysis and the reported low intrinsic dimensionality of gene expression microarray data.主成分分析与基因表达微阵列数据所报道的低内在维度
Sci Rep. 2016 Jun 2;6:25696. doi: 10.1038/srep25696.
9
Inferring large-scale gene regulatory networks using a low-order constraint-based algorithm.使用基于低阶约束的算法推断大规模基因调控网络。
Mol Biosyst. 2010 Jun;6(6):988-98. doi: 10.1039/b917571g. Epub 2010 Feb 19.
10
Scalable analysis of cell-type composition from single-cell transcriptomics using deep recurrent learning.利用深度循环神经网络对单细胞转录组学数据进行可扩展的细胞类型组成分析。
Nat Methods. 2019 Apr;16(4):311-314. doi: 10.1038/s41592-019-0353-7. Epub 2019 Mar 18.

引用本文的文献

1
Minimal Gene Signatures Enable High-Accuracy Prediction of Antibiotic Resistance in .最小基因特征可实现对……抗生素耐药性的高精度预测 。 (你提供的原文似乎不完整,最后的“in.”后面应该还有具体内容)
bioRxiv. 2025 May 3:2025.04.29.651273. doi: 10.1101/2025.04.29.651273.
2
iModulonMiner and PyModulon: Software for unsupervised mining of gene expression compendia.iModulonMiner 和 PyModulon:用于非监督挖掘基因表达编目的软件。
PLoS Comput Biol. 2024 Oct 23;20(10):e1012546. doi: 10.1371/journal.pcbi.1012546. eCollection 2024 Oct.
3
CyuR is a dual regulator for L-cysteine dependent antimicrobial resistance in Escherichia coli.

本文引用的文献

1
Machine learning uncovers independently regulated modules in the Bacillus subtilis transcriptome.机器学习揭示枯草芽孢杆菌转录组中独立调控的模块。
Nat Commun. 2020 Dec 11;11(1):6338. doi: 10.1038/s41467-020-20153-9.
2
Revealing 29 sets of independently modulated genes in , their regulators, and role in key physiological response.揭示了其中29组独立调控的基因、它们的调控因子以及在关键生理反应中的作用。
Proc Natl Acad Sci U S A. 2020 Jul 21;117(29):17228-17239. doi: 10.1073/pnas.2008413117. Epub 2020 Jul 2.
3
The Escherichia coli transcriptome mostly consists of independently regulated modules.
CyuR 是大肠杆菌中 L-半胱氨酸依赖型抗微生物抗性的双重调节因子。
Commun Biol. 2024 Sep 17;7(1):1160. doi: 10.1038/s42003-024-06831-0.
4
Modulating bacterial function utilizing A knowledge base of transcriptional regulatory modules.利用转录调控模块知识库调节细菌功能。
Nucleic Acids Res. 2024 Oct 14;52(18):11362-11377. doi: 10.1093/nar/gkae742.
5
Proteome allocation is linked to transcriptional regulation through a modularized transcriptome.蛋白质组分配与转录调节通过模块化转录组相关联。
Nat Commun. 2024 Jun 19;15(1):5234. doi: 10.1038/s41467-024-49231-y.
6
Machine learning analysis of RB-TnSeq fitness data predicts functional gene modules in KT2440.基于 RB-TnSeq 适应度数据的机器学习分析预测了 KT2440 中的功能基因模块。
mSystems. 2024 Mar 19;9(3):e0094223. doi: 10.1128/msystems.00942-23. Epub 2024 Feb 7.
7
High-resolution temporal profiling of E. coli transcriptional response.大肠杆菌转录反应的高分辨率时间剖析。
Nat Commun. 2023 Nov 22;14(1):7606. doi: 10.1038/s41467-023-43173-7.
8
Genome-wide promoter responses to CRISPR perturbations of regulators reveal regulatory networks in Escherichia coli.全基因组启动子对 CRISPR 调控因子扰动的响应揭示了大肠杆菌中的调控网络。
Nat Commun. 2023 Sep 16;14(1):5757. doi: 10.1038/s41467-023-41572-4.
9
A multi-scale expression and regulation knowledge base for Escherichia coli.大肠杆菌多尺度表达和调控知识库。
Nucleic Acids Res. 2023 Oct 27;51(19):10176-10193. doi: 10.1093/nar/gkad750.
10
Machine learning uncovers the transcriptome in microbial communities and during infection.机器学习揭示了微生物群落和感染过程中的转录组。
mSystems. 2023 Oct 26;8(5):e0043723. doi: 10.1128/msystems.00437-23. Epub 2023 Aug 28.
大肠杆菌转录组主要由独立调控的模块组成。
Nat Commun. 2019 Dec 4;10(1):5536. doi: 10.1038/s41467-019-13483-w.
4
Adaptive evolution reveals a tradeoff between growth rate and oxidative stress during naphthoquinone-based aerobic respiration.适应进化揭示了基于萘醌的需氧呼吸过程中生长速率和氧化应激之间的权衡。
Proc Natl Acad Sci U S A. 2019 Dec 10;116(50):25287-25292. doi: 10.1073/pnas.1909987116. Epub 2019 Nov 25.
5
OxyR Is a Convergent Target for Mutations Acquired during Adaptation to Oxidative Stress-Prone Metabolic States.OxyR 是在适应氧化应激易感性代谢状态过程中获得的突变的汇聚靶点。
Mol Biol Evol. 2020 Mar 1;37(3):660-667. doi: 10.1093/molbev/msz251.
6
Independent Component Analysis for Unraveling the Complexity of Cancer Omics Datasets.独立成分分析在癌症组学数据集复杂性研究中的应用
Int J Mol Sci. 2019 Sep 7;20(18):4414. doi: 10.3390/ijms20184414.
7
Extracting Pathway-level Signatures from Proteogenomic Data in Breast Cancer Using Independent Component Analysis.基于独立成分分析从乳腺癌的蛋白质基因组数据中提取通路水平特征。
Mol Cell Proteomics. 2019 Aug 9;18(8 suppl 1):S169-S182. doi: 10.1074/mcp.TIR119.001442. Epub 2019 Jun 18.
8
Genome-wide effects on transcription from ppGpp binding to its two sites on RNA polymerase.全基因组水平上 ppGpp 与 RNA 聚合酶两个结合位点对转录的影响。
Proc Natl Acad Sci U S A. 2019 Apr 23;116(17):8310-8319. doi: 10.1073/pnas.1819682116. Epub 2019 Apr 10.
9
Assessing reproducibility of matrix factorization methods in independent transcriptomes.评估基质分解方法在独立转录组中的可重复性。
Bioinformatics. 2019 Nov 1;35(21):4307-4313. doi: 10.1093/bioinformatics/btz225.
10
The y-ome defines the 35% of Escherichia coli genes that lack experimental evidence of function.y 组定义了大肠杆菌中缺乏功能实验证据的 35%的基因。
Nucleic Acids Res. 2019 Mar 18;47(5):2446-2454. doi: 10.1093/nar/gkz030.