基于进化机制的乳腺癌基因组保守基因表达双聚类模块分析

Evolutionary Mechanism Based Conserved Gene Expression Biclustering Module Analysis for Breast Cancer Genomics.

作者信息

Yuan Wei, Li Yaming, Han Zhengpan, Chen Yu, Xie Jinnan, Chen Jianguo, Bi Zhisheng, Xi Jianing

机构信息

School of Biomedical Engineering, Guangzhou Medical University, Guangzhou 511436, China.

出版信息

Biomedicines. 2024 Sep 12;12(9):2086. doi: 10.3390/biomedicines12092086.

DOI:10.3390/biomedicines12092086

PMID:39335599

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11428256/

Abstract

The identification of significant gene biclusters with particular expression patterns and the elucidation of functionally related genes within gene expression data has become a critical concern due to the vast amount of gene expression data generated by RNA sequencing technology. In this paper, a Conserved Gene Expression Module based on Genetic Algorithm (CGEMGA) is proposed. Breast cancer data from the TCGA database is used as the subject of this study. The -values from Fisher's exact test are used as evaluation metrics to demonstrate the significance of different algorithms, including the Cheng and Church algorithm, CGEM algorithm, etc. In addition, the F-test is used to investigate the difference between our method and the CGEM algorithm. The computational cost of the different algorithms is further investigated by calculating the running time of each algorithm. Finally, the established driver genes and cancer-related pathways are used to validate the process. The results of 10 independent runs demonstrate that CGEMGA has a superior average -value of 1.54 × 10 ± 3.06 × 10 compared to all other algorithms. Furthermore, our approach exhibits consistent performance across all methods. The F-test yields a -value of 0.039, indicating a significant difference between our approach and the CGEM. Computational cost statistics also demonstrate that our approach has a significantly shorter average runtime of 5.22 × 10 ± 1.65 × 10 s compared to the other algorithms. Enrichment analysis indicates that the genes in our approach are significantly enriched for driver genes. Our algorithm is fast and robust, efficiently extracting co-expressed genes and associated co-expression condition biclusters from RNA-seq data.

摘要

由于RNA测序技术产生了大量的基因表达数据，识别具有特定表达模式的重要基因双聚类以及阐明基因表达数据中功能相关的基因已成为一个关键问题。本文提出了一种基于遗传算法的保守基因表达模块（CGEMGA）。使用来自TCGA数据库的乳腺癌数据作为本研究的对象。将Fisher精确检验的p值用作评估指标，以证明包括Cheng和Church算法、CGEM算法等不同算法的显著性。此外，使用F检验来研究我们的方法与CGEM算法之间的差异。通过计算每种算法的运行时间，进一步研究不同算法的计算成本。最后，使用已建立的驱动基因和癌症相关通路来验证该过程。10次独立运行的结果表明，与所有其他算法相比，CGEMGA的平均p值更高，为1.54×10 ± 3.06×10 。此外，我们的方法在所有方法中表现出一致的性能。F检验得出的p值为0.039，表明我们的方法与CGEM之间存在显著差异。计算成本统计还表明，与其他算法相比，我们的方法平均运行时间明显更短，为5.22×10 ± 1.65×10 秒。富集分析表明，我们方法中的基因在驱动基因方面显著富集。我们的算法快速且稳健，能够从RNA-seq数据中高效提取共表达基因和相关的共表达条件双聚类。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcc7/11428256/0a4adad9402c/biomedicines-12-02086-g001.jpg

相似文献

Evolutionary Mechanism Based Conserved Gene Expression Biclustering Module Analysis for Breast Cancer Genomics.基于进化机制的乳腺癌基因组保守基因表达双聚类模块分析

Biomedicines. 2024 Sep 12;12(9):2086. doi: 10.3390/biomedicines12092086.

Bi-EB: Empirical Bayesian Biclustering for Multi-Omics Data Integration Pattern Identification among Species.双 EB：用于物种间多组学数据整合模式识别的经验贝叶斯双聚类

Genes (Basel). 2022 Oct 30;13(11):1982. doi: 10.3390/genes13111982.

Integrating biological knowledge based on functional annotations for biclustering of gene expression data.基于功能注释整合生物学知识以进行基因表达数据的双聚类分析。

Comput Methods Programs Biomed. 2015 May;119(3):163-80. doi: 10.1016/j.cmpb.2015.02.010. Epub 2015 Mar 18.

Cancer-specific functional profiling in microsatellite-unstable (MSI) colon and endometrial cancers using combined differentially expressed genes and biclustering analysis.使用联合差异表达基因和双聚类分析对微卫星不稳定（MSI）结肠癌和子宫内膜癌进行癌症特异性功能分析。

Medicine (Baltimore). 2023 May 12;102(19):e33647. doi: 10.1097/MD.0000000000033647.

Discovery of error-tolerant biclusters from noisy gene expression data.从嘈杂的基因表达数据中发现容错双聚类。

BMC Bioinformatics. 2011 Nov 24;12 Suppl 12(Suppl 12):S1. doi: 10.1186/1471-2105-12-S12-S1.

Identification of coherent patterns in gene expression data using an efficient biclustering algorithm and parallel coordinate visualization.使用高效双聚类算法和并行坐标可视化技术识别基因表达数据中的连贯模式。

BMC Bioinformatics. 2008 Apr 23;9:210. doi: 10.1186/1471-2105-9-210.

Robust biclustering by sparse singular value decomposition incorporating stability selection.基于稀疏奇异值分解和稳定性选择的稳健双聚类。

Bioinformatics. 2011 Aug 1;27(15):2089-97. doi: 10.1093/bioinformatics/btr322. Epub 2011 Jun 2.

KMeans greedy search hybrid algorithm for biclustering gene expression data.用于基因表达数据的分聚类的 KMeans 贪婪搜索混合算法。

Adv Exp Med Biol. 2010;680:181-8. doi: 10.1007/978-1-4419-5913-3_21.

A novel biclustering approach with iterative optimization to analyze gene expression data.一种用于分析基因表达数据的具有迭代优化的新型双聚类方法。

Adv Appl Bioinform Chem. 2012;5:23-59. doi: 10.2147/AABC.S32622. Epub 2012 Sep 7.

A New Binary Biclustering Algorithm Based on Weight Adjacency Difference Matrix for Analyzing Gene Expression Data.基于权重邻接差矩阵的新型二元分簇算法在基因表达数据分析中的应用。

IEEE/ACM Trans Comput Biol Bioinform. 2023 Sep-Oct;20(5):2802-2809. doi: 10.1109/TCBB.2023.3283801. Epub 2023 Oct 9.

本文引用的文献

A parameter free relative density based biclustering method for identifying non-linear feature relations.一种基于无参数相对密度的双聚类方法，用于识别非线性特征关系。

Heliyon. 2024 Jul 20;10(15):e34736. doi: 10.1016/j.heliyon.2024.e34736. eCollection 2024 Aug 15.

An advanced nomogram model using deep learning radiomics and clinical data for predicting occult lymph node metastasis in lung adenocarcinoma.一种使用深度学习影像组学和临床数据预测肺腺癌隐匿性淋巴结转移的高级列线图模型。

Transl Oncol. 2024 Jun;44:101922. doi: 10.1016/j.tranon.2024.101922. Epub 2024 Mar 29.

The application of targeted RNA sequencing for the analysis of fusion genes, gene mutations, IKZF1 intragenic deletion, and CRLF2 overexpression in acute lymphoblastic leukemia.靶向 RNA 测序在急性淋巴细胞白血病中融合基因、基因突变、IKZF1 基因内缺失和 CRLF2 过表达分析中的应用。

Int J Lab Hematol. 2024 Aug;46(4):670-677. doi: 10.1111/ijlh.14269. Epub 2024 Mar 29.

Analyzing entropy features in time-series data for pattern recognition in neurological conditions.分析时间序列数据中的熵特征，以识别神经状况中的模式。

Artif Intell Med. 2024 Apr;150:102821. doi: 10.1016/j.artmed.2024.102821. Epub 2024 Feb 22.

CTEC: a cross-tabulation ensemble clustering approach for single-cell RNA sequencing data analysis.CTEC：一种用于单细胞 RNA 测序数据分析的交叉制表集成聚类方法。

Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae130.

Predicting mechanism of immune response in microsatellite instability colorectal cancer.微卫星不稳定型结直肠癌免疫反应的预测机制

Heliyon. 2024 Mar 16;10(6):e28120. doi: 10.1016/j.heliyon.2024.e28120. eCollection 2024 Mar 30.

Comparison of RNA-Seq and microarray in the prediction of protein expression and survival prediction.RNA测序与微阵列在蛋白质表达预测和生存预测中的比较。

Front Genet. 2024 Feb 23;15:1342021. doi: 10.3389/fgene.2024.1342021. eCollection 2024.

The entanglement of DNA damage and pattern recognition receptor signaling.DNA 损伤与模式识别受体信号的纠缠。

DNA Repair (Amst). 2024 Jan;133:103595. doi: 10.1016/j.dnarep.2023.103595. Epub 2023 Nov 15.

(-)-Epicatechin Inhibits Metastatic-Associated Proliferation, Migration, and Invasion of Murine Breast Cancer Cells In Vitro.(-)-表儿茶素抑制体外小鼠乳腺癌细胞转移相关增殖、迁移和侵袭。

Molecules. 2023 Aug 24;28(17):6229. doi: 10.3390/molecules28176229.

Frequency of genetic alterations differs in advanced breast cancer between metastatic sites.晚期乳腺癌转移部位的基因改变频率存在差异。

Genes Chromosomes Cancer. 2024 Jan;63(1):e23199. doi: 10.1002/gcc.23199. Epub 2023 Sep 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于进化机制的乳腺癌基因组保守基因表达双聚类模块分析

Evolutionary Mechanism Based Conserved Gene Expression Biclustering Module Analysis for Breast Cancer Genomics.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献