Suppr超能文献

从基因表达数据推断 TF 活性和活性调节剂,并从 TF 扰动数据中获取约束条件。

Inferring TF activities and activity regulators from gene expression data with constraints from TF perturbation data.

机构信息

Center for Genome Sciences and Systems Biology, Washington University School of Medicine, St. Louis, MO 63110, USA.

Department of Computer Science and Engineering, Washington University, St. Louis, MO 63130, USA.

出版信息

Bioinformatics. 2021 Jun 9;37(9):1234-1245. doi: 10.1093/bioinformatics/btaa947.

Abstract

MOTIVATION

The activity of a transcription factor (TF) in a sample of cells is the extent to which it is exerting its regulatory potential. Many methods of inferring TF activity from gene expression data have been described, but due to the lack of appropriate large-scale datasets, systematic and objective validation has not been possible until now.

RESULTS

We systematically evaluate and optimize the approach to TF activity inference in which a gene expression matrix is factored into a condition-independent matrix of control strengths and a condition-dependent matrix of TF activity levels. We find that expression data in which the activities of individual TFs have been perturbed are both necessary and sufficient for obtaining good performance. To a considerable extent, control strengths inferred using expression data from one growth condition carry over to other conditions, so the control strength matrices derived here can be used by others. Finally, we apply these methods to gain insight into the upstream factors that regulate the activities of yeast TFs Gcr2, Gln3, Gcn4 and Msn2.

AVAILABILITY AND IMPLEMENTATION

Evaluation code and data are available at https://doi.org/10.5281/zenodo.4050573.

SUPPLEMENTARY INFORMATION

Supplementary data are available at Bioinformatics online.

摘要

动机

转录因子(TF)在细胞样本中的活性是其发挥调节潜能的程度。已经描述了许多从基因表达数据推断 TF 活性的方法,但由于缺乏适当的大规模数据集,直到现在才有可能进行系统和客观的验证。

结果

我们系统地评估和优化了一种从基因表达矩阵推断 TF 活性的方法,该方法将基因表达矩阵分解为条件独立的控制强度矩阵和条件依赖的 TF 活性水平矩阵。我们发现,对单个 TF 活性进行扰动的表达数据对于获得良好的性能既是必要的,也是充分的。在相当大的程度上,使用一种生长条件下的表达数据推断出的控制强度可以推广到其他条件,因此这里得出的控制强度矩阵可以被其他人使用。最后,我们应用这些方法来深入了解调节酵母 TF Gcr2、Gln3、Gcn4 和 Msn2 活性的上游因素。

可用性和实现

评估代码和数据可在 https://doi.org/10.5281/zenodo.4050573 获得。

补充信息

补充数据可在《生物信息学》在线获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a1e6/8189679/7a26178fff73/btaa947f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验