Suppr超能文献

PWSC:一种基于多项式权重调整稀疏聚类的新型聚类方法,用于稀疏生物医学数据及其在癌症亚型分析中的应用。

PWSC: a novel clustering method based on polynomial weight-adjusted sparse clustering for sparse biomedical data and its application in cancer subtyping.

机构信息

Department of Nephrology, Tongji Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430030, Hubei Province, China.

School of Mathematics and Statistics, Wuhan University, Wuhan, 430070, Hubei Province, China.

出版信息

BMC Bioinformatics. 2023 Dec 21;24(1):490. doi: 10.1186/s12859-023-05595-4.

Abstract

BACKGROUND

Clustering analysis is widely used to interpret biomedical data and uncover new knowledge and patterns. However, conventional clustering methods are not effective when dealing with sparse biomedical data. To overcome this limitation, we propose a hierarchical clustering method called polynomial weight-adjusted sparse clustering (PWSC).

RESULTS

The PWSC algorithm adjusts feature weights using a polynomial function, redefines the distances between samples, and performs hierarchical clustering analysis based on these adjusted distances. Additionally, we incorporate a consensus clustering approach to determine the optimal number of classifications. This consensus approach utilizes relative change in the cumulative distribution function to identify the best number of clusters, resulting in more stable clustering results. Leveraging the PWSC algorithm, we successfully classified a cohort of gastric cancer patients, enabling categorization of patients carrying different types of altered genes. Further evaluation using Entropy showed a significant improvement (p = 2.905e-05), while using the Calinski-Harabasz index demonstrates a remarkable 100% improvement in the quality of the best classification compared to conventional algorithms. Similarly, significantly increased entropy (p = 0.0336) and comparable CHI, were observed when classifying another colorectal cancer cohort with microbial abundance. The above attempts in cancer subtyping demonstrate that PWSC is highly applicable to different types of biomedical data. To facilitate its application, we have developed a user-friendly tool that implements the PWSC algorithm, which canbe accessed at http://pwsc.aiyimed.com/ .

CONCLUSIONS

PWSC addresses the limitations of conventional approaches when clustering sparse biomedical data. By adjusting feature weights and employing consensus clustering, we achieve improved clustering results compared to conventional methods. The PWSC algorithm provides a valuable tool for researchers in the field, enabling more accurate and stable clustering analysis. Its application can enhance our understanding of complex biological systems and contribute to advancements in various biomedical disciplines.

摘要

背景

聚类分析被广泛用于解释生物医学数据,以揭示新知识和模式。然而,传统的聚类方法在处理稀疏的生物医学数据时效果不佳。为了克服这一限制,我们提出了一种层次聚类方法,称为多项式加权稀疏聚类(PWSC)。

结果

PWSC 算法使用多项式函数调整特征权重,重新定义样本之间的距离,并根据这些调整后的距离进行层次聚类分析。此外,我们还采用共识聚类方法来确定最佳分类数。该共识方法利用累积分布函数的相对变化来确定最佳聚类数,从而得到更稳定的聚类结果。利用 PWSC 算法,我们成功地对一批胃癌患者进行了分类,能够对携带不同类型基因突变的患者进行分类。使用 Entropy 进行进一步评估显示出显著的改善(p=2.905e-05),而使用 Calinski-Harabasz 指数则显示与传统算法相比,最佳分类质量的改善达到了惊人的 100%。同样,在对另一个含有微生物丰度的结直肠癌队列进行分类时,观察到显著增加的熵(p=0.0336)和可比的 CHI。以上在癌症亚分类中的尝试表明 PWSC 非常适用于不同类型的生物医学数据。为了方便其应用,我们开发了一个用户友好的工具,实现了 PWSC 算法,可在 http://pwsc.aiyimed.com/ 访问。

结论

PWSC 解决了传统方法在聚类稀疏生物医学数据时的局限性。通过调整特征权重和采用共识聚类,我们与传统方法相比,获得了更好的聚类结果。PWSC 算法为该领域的研究人员提供了一个有价值的工具,使更准确和稳定的聚类分析成为可能。它的应用可以增强我们对复杂生物系统的理解,并有助于推动各个生物医学领域的发展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/07da/10740247/ac6bdeec5abd/12859_2023_5595_Figa_HTML.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验