Suppr超能文献

基于多元稀疏融合的遗传调控标志物指导异质性分析。

Biomarker-guided heterogeneity analysis of genetic regulations via multivariate sparse fusion.

机构信息

School of Mathematical Sciences, and Key Laboratory of Big Data Mining and Knowledge Management, Chinese Academy of Science, Beijing, China.

Center for Applied Statistics, School of Statistics, Renmin University of China, Beijing, China.

出版信息

Stat Med. 2021 Jul 30;40(17):3915-3936. doi: 10.1002/sim.9006. Epub 2021 Apr 27.

Abstract

Heterogeneity is a hallmark of many complex diseases. There are multiple ways of defining heterogeneity, among which the heterogeneity in genetic regulations, for example, gene expressions (GEs) by copy number variations (CNVs), and methylation, has been suggested but little investigated. Heterogeneity in genetic regulations can be linked with disease severity, progression, and other traits and is biologically important. However, the analysis can be very challenging with the high dimensionality of both sides of regulation as well as sparse and weak signals. In this article, we consider the scenario where subjects form unknown subgroups, and each subgroup has unique genetic regulation relationships. Further, such heterogeneity is "guided" by a known biomarker. We develop a multivariate sparse fusion (MSF) approach, which innovatively applies the penalized fusion technique to simultaneously determine the number and structure of subgroups and regulation relationships within each subgroup. An effective computational algorithm is developed, and extensive simulations are conducted. The analysis of heterogeneity in the GE-CNV regulations in melanoma and GE-methylation regulations in stomach cancer using the TCGA data leads to interesting findings.

摘要

异质性是许多复杂疾病的标志。有多种定义异质性的方法,其中遗传调控的异质性,例如通过拷贝数变异(CNVs)和甲基化的基因表达(GEs)已经被提出但研究甚少。遗传调控的异质性可以与疾病的严重程度、进展和其他特征相关联,并且具有重要的生物学意义。然而,由于调控的双方具有高维度以及稀疏和微弱的信号,因此分析可能非常具有挑战性。在本文中,我们考虑了这样一种情况,即受试者形成未知的亚组,并且每个亚组都具有独特的遗传调控关系。此外,这种异质性是由一个已知的生物标志物“引导”的。我们开发了一种多元稀疏融合(MSF)方法,该方法创新性地应用了惩罚融合技术,以同时确定每个亚组内的亚组数量和结构以及调控关系。开发了一种有效的计算算法,并进行了广泛的模拟。使用 TCGA 数据对黑色素瘤的 GE-CNV 调控异质性和胃癌的 GE-甲基化调控异质性进行分析,得出了有趣的发现。

相似文献

5
Assisted gene expression-based clustering with AWNCut.基于辅助基因表达的聚类分析与 AWNCut。
Stat Med. 2018 Dec 20;37(29):4386-4403. doi: 10.1002/sim.7928. Epub 2018 Aug 9.
9
Robust analysis of cancer heterogeneity for high-dimensional data.高维数据中癌症异质性的稳健分析。
Stat Med. 2022 Nov 30;41(27):5448-5462. doi: 10.1002/sim.9578. Epub 2022 Sep 18.
10
Tissue-Specific eQTL in Zebrafish.斑马鱼中的组织特异性表达数量性状基因座
Methods Mol Biol. 2020;2082:239-249. doi: 10.1007/978-1-0716-0026-9_17.

本文引用的文献

1
An integrative sparse boosting analysis of cancer genomic commonality and difference.癌症基因组共性与差异的整合稀疏增强分析
Stat Methods Med Res. 2020 May;29(5):1325-1337. doi: 10.1177/0962280219859026. Epub 2019 Jul 7.
9
Immune Gene Expression Is Associated with Genomic Aberrations in Breast Cancer.免疫基因表达与乳腺癌中的基因组异常相关。
Cancer Res. 2017 Jun 15;77(12):3317-3324. doi: 10.1158/0008-5472.CAN-16-3478. Epub 2017 Apr 20.
10
Transcriptional Addiction in Cancer.癌症中的转录成瘾
Cell. 2017 Feb 9;168(4):629-643. doi: 10.1016/j.cell.2016.12.013.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验