• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

K-模块算法:改进 WGCNA 共表达网络聚类结果的附加步骤。

K-Module Algorithm: An Additional Step to Improve the Clustering Results of WGCNA Co-Expression Networks.

机构信息

College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China.

出版信息

Genes (Basel). 2021 Jan 12;12(1):87. doi: 10.3390/genes12010087.

DOI:10.3390/genes12010087
PMID:33445666
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7828115/
Abstract

Among biological networks, co-expression networks have been widely studied. One of the most commonly used pipelines for the construction of co-expression networks is weighted gene co-expression network analysis (WGCNA), which can identify highly co-expressed clusters of genes (modules). WGCNA identifies gene modules using hierarchical clustering. The major drawback of hierarchical clustering is that once two objects are clustered together, it cannot be reversed; thus, re-adjustment of the unbefitting decision is impossible. In this paper, we calculate the similarity matrix with the distance correlation for WGCNA to construct a gene co-expression network, and present a new approach called the k-module algorithm to improve the WGCNA clustering results. This method can assign all genes to the module with the highest mean connectivity with these genes. This algorithm re-adjusts the results of hierarchical clustering while retaining the advantages of the dynamic tree cut method. The validity of the algorithm is verified using six datasets from microarray and RNA-seq data. The k-module algorithm has fewer iterations, which leads to lower complexity. We verify that the gene modules obtained by the k-module algorithm have high enrichment scores and strong stability. Our method improves upon hierarchical clustering, and can be applied to general clustering algorithms based on the similarity matrix, not limited to gene co-expression network analysis.

摘要

在生物网络中,共表达网络得到了广泛的研究。构建共表达网络最常用的方法之一是加权基因共表达网络分析(WGCNA),它可以识别高度共表达的基因簇(模块)。WGCNA 使用层次聚类来识别基因模块。层次聚类的主要缺点是,一旦两个对象被聚类在一起,就无法逆转;因此,不可能对不恰当的决策进行重新调整。在本文中,我们使用距离相关系数计算相似性矩阵来构建基因共表达网络,并提出了一种新的方法,称为 k-模块算法,以改进 WGCNA 的聚类结果。该方法可以将所有基因分配到与这些基因具有最高平均连接性的模块中。该算法在保留动态树切分方法优点的同时,重新调整了层次聚类的结果。该算法的有效性已通过微阵列和 RNA-seq 数据的六个数据集得到验证。k-模块算法的迭代次数较少,因此复杂度较低。我们验证了 k-模块算法得到的基因模块具有较高的富集分数和较强的稳定性。我们的方法改进了层次聚类,并且可以应用于基于相似性矩阵的一般聚类算法,而不限于基因共表达网络分析。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b78/7828115/d7bc17b5f864/genes-12-00087-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b78/7828115/6e8b6cd11a39/genes-12-00087-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b78/7828115/33685608bd8f/genes-12-00087-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b78/7828115/8645c92d9541/genes-12-00087-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b78/7828115/03de89cd0a5a/genes-12-00087-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b78/7828115/d7bc17b5f864/genes-12-00087-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b78/7828115/6e8b6cd11a39/genes-12-00087-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b78/7828115/33685608bd8f/genes-12-00087-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b78/7828115/8645c92d9541/genes-12-00087-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b78/7828115/03de89cd0a5a/genes-12-00087-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b78/7828115/d7bc17b5f864/genes-12-00087-g005.jpg

相似文献

1
K-Module Algorithm: An Additional Step to Improve the Clustering Results of WGCNA Co-Expression Networks.K-模块算法:改进 WGCNA 共表达网络聚类结果的附加步骤。
Genes (Basel). 2021 Jan 12;12(1):87. doi: 10.3390/genes12010087.
2
KISL: knowledge-injected semi-supervised learning for biological co-expression network modules.KISL:用于生物共表达网络模块的知识注入半监督学习
Front Genet. 2023 May 2;14:1151962. doi: 10.3389/fgene.2023.1151962. eCollection 2023.
3
TPSC: a module detection method based on topology potential and spectral clustering in weighted networks and its application in gene co-expression module discovery.TPSC:一种基于加权网络中拓扑势和谱聚类的模块检测方法及其在基因共表达模块发现中的应用
BMC Bioinformatics. 2021 Oct 25;22(Suppl 4):111. doi: 10.1186/s12859-021-03964-5.
4
Use of a graph neural network to the weighted gene co-expression network analysis of Korean native cattle.利用图神经网络对韩国本土牛进行加权基因共表达网络分析。
Sci Rep. 2022 Jun 14;12(1):9854. doi: 10.1038/s41598-022-13796-9.
5
FGMD: A novel approach for functional gene module detection in cancer.FGMD:一种用于癌症中功能基因模块检测的新方法。
PLoS One. 2017 Dec 15;12(12):e0188900. doi: 10.1371/journal.pone.0188900. eCollection 2017.
6
An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.额外的k均值聚类步骤改善了WGCNA基因共表达网络的生物学特征。
BMC Syst Biol. 2017 Apr 12;11(1):47. doi: 10.1186/s12918-017-0420-6.
7
Module representatives for refining gene co-expression modules.用于精炼基因共表达模块的模块代表。
Phys Biol. 2023 May 4;20(4). doi: 10.1088/1478-3975/acce8d.
8
Co-expression modules construction by WGCNA and identify potential prognostic markers of uveal melanoma.通过 WGCNA 构建共表达模块并鉴定葡萄膜黑色素瘤的潜在预后标志物。
Exp Eye Res. 2018 Jan;166:13-20. doi: 10.1016/j.exer.2017.10.007. Epub 2017 Oct 12.
9
Efficient identification of multiple pathways: RNA-Seq analysis of livers from Fe ion irradiated mice.高效识别多种途径:Fe 离子辐照小鼠肝脏的 RNA-Seq 分析。
BMC Bioinformatics. 2020 Mar 20;21(1):118. doi: 10.1186/s12859-020-3446-5.
10
A gene module identification algorithm and its applications to identify gene modules and key genes of hepatocellular carcinoma.基因模块识别算法及其在肝细胞癌基因模块和关键基因识别中的应用。
Sci Rep. 2021 Mar 9;11(1):5517. doi: 10.1038/s41598-021-84837-y.

引用本文的文献

1
Reconciling multiple connectivity-based systems biology methods for drug repurposing.协调多种基于连通性的系统生物学方法用于药物再利用。
Brief Bioinform. 2025 Jul 2;26(4). doi: 10.1093/bib/bbaf387.
2
Integrated transcriptomic and metabolomic analyses reveal tissue-specific accumulation and expression patterns of monoterpene glycosides, gallaglycosides, and flavonoids in Paeonia Lactiflora Pall.综合转录组学和代谢组学分析揭示了芍药中萜类糖苷、没食子酰糖苷和黄酮类化合物的组织特异性积累和表达模式。
BMC Genomics. 2025 Jun 4;26(1):561. doi: 10.1186/s12864-025-11750-3.
3
Isolation and identification of core-genes in barley roots under low phosphorus stress.

本文引用的文献

1
Weighted gene co-expression network analysis identifies CCNA2 as a treatment target of prostate cancer through inhibiting cell cycle.加权基因共表达网络分析通过抑制细胞周期确定CCNA2为前列腺癌的治疗靶点。
J Cancer. 2020 Jan 1;11(5):1203-1211. doi: 10.7150/jca.38173. eCollection 2020.
2
A Genetic Algorithm to Optimize Weighted Gene Co-Expression Network Analysis.一种用于优化加权基因共表达网络分析的遗传算法。
J Comput Biol. 2019 Dec;26(12):1349-1366. doi: 10.1089/cmb.2019.0221. Epub 2019 Jul 30.
3
Network-Based Approaches to Explore Complex Biological Systems towards Network Medicine.
低磷胁迫下大麦根中核心基因的分离与鉴定
Mol Biol Rep. 2025 May 16;52(1):463. doi: 10.1007/s11033-025-10567-3.
4
Hypergraph-based analysis of weighted gene co-expression hypernetwork.基于超图的加权基因共表达超网络分析
Front Genet. 2025 Apr 4;16:1560841. doi: 10.3389/fgene.2025.1560841. eCollection 2025.
5
SGCP: a spectral self-learning method for clustering genes in co-expression networks.SGCP:一种用于共表达网络中基因聚类的光谱自学习方法。
BMC Bioinformatics. 2024 Jul 2;25(1):230. doi: 10.1186/s12859-024-05848-w.
6
Network medicine approaches for identification of novel prognostic systems biomarkers and drug candidates for papillary thyroid carcinoma.网络医学方法在甲状腺乳头状癌新型预后系统生物标志物和药物靶标的鉴定中的应用。
J Cell Mol Med. 2023 Dec;27(24):4171-4180. doi: 10.1111/jcmm.18002. Epub 2023 Oct 19.
7
KISL: knowledge-injected semi-supervised learning for biological co-expression network modules.KISL:用于生物共表达网络模块的知识注入半监督学习
Front Genet. 2023 May 2;14:1151962. doi: 10.3389/fgene.2023.1151962. eCollection 2023.
8
Construction of a prognostic prediction model for renal clear cell carcinoma combining clinical traits.构建一个结合临床特征的肾透明细胞癌预后预测模型。
Sci Rep. 2023 Feb 27;13(1):3358. doi: 10.1038/s41598-023-30020-4.
9
Integrated bioinformatic analyses investigate macrophage-M1-related biomarkers and tuberculosis therapeutic drugs.综合生物信息学分析研究巨噬细胞M1相关生物标志物和结核病治疗药物。
Front Genet. 2023 Feb 8;14:1041892. doi: 10.3389/fgene.2023.1041892. eCollection 2023.
10
A functional gene module identification algorithm in gene expression data based on genetic algorithm and gene ontology.基于遗传算法和基因本体论的基因表达数据中功能基因模块识别算法。
BMC Genomics. 2023 Feb 17;24(1):76. doi: 10.1186/s12864-023-09157-z.
基于网络的方法探索复杂生物系统以迈向网络医学。
Genes (Basel). 2018 Aug 31;9(9):437. doi: 10.3390/genes9090437.
4
Divisive hierarchical maximum likelihood clustering.分裂式层次最大似然聚类。
BMC Bioinformatics. 2017 Dec 28;18(Suppl 16):546. doi: 10.1186/s12859-017-1965-5.
5
MPIGeneNet: Parallel Calculation of Gene Co-Expression Networks on Multicore Clusters.MPIGeneNet:多核集群上基因共表达网络的并行计算。
IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1732-1737. doi: 10.1109/TCBB.2017.2761340. Epub 2017 Oct 10.
6
Natural variation of macrophage activation as disease-relevant phenotype predictive of inflammation and cancer survival.巨噬细胞激活的自然变异作为疾病相关表型,可预测炎症和癌症的生存情况。
Nat Commun. 2017 Jul 24;8:16041. doi: 10.1038/ncomms16041.
7
An additional k-means clustering step improves the biological features of WGCNA gene co-expression networks.额外的k均值聚类步骤改善了WGCNA基因共表达网络的生物学特征。
BMC Syst Biol. 2017 Apr 12;11(1):47. doi: 10.1186/s12918-017-0420-6.
8
Preservation Analysis of Macrophage Gene Coexpression Between Human and Mouse Identifies PARK2 as a Genetically Controlled Master Regulator of Oxidative Phosphorylation in Humans.人类与小鼠巨噬细胞基因共表达的保存分析确定PARK2是人类氧化磷酸化的基因控制主调节因子。
G3 (Bethesda). 2016 Oct 13;6(10):3361-3371. doi: 10.1534/g3.116.033894.
9
Normalized lmQCM: An Algorithm for Detecting Weak Quasi-Cliques in Weighted Graph with Applications in Gene Co-Expression Module Discovery in Cancers.归一化lmQCM:一种用于检测加权图中弱准团的算法及其在癌症基因共表达模块发现中的应用
Cancer Inform. 2016 Jul 24;13(Suppl 3):137-46. doi: 10.4137/CIN.S14021. eCollection 2014.
10
Genetic regulation of mouse liver metabolite levels.小鼠肝脏代谢物水平的遗传调控。
Mol Syst Biol. 2014 May 23;10(5):730. doi: 10.15252/msb.20135004.