• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用混合社交网络特征选择中校正的家养度进行生物标志物检测,以提高分类器性能。

Biomarker detection using corrected degree of domesticity in hybrid social network feature selection for improving classifier performance.

机构信息

Department of Biostatistics, Hacettepe University Faculty of Medicine, Sıhhiye, 06230, Ankara, Türkiye.

出版信息

BMC Bioinformatics. 2023 Oct 30;24(1):407. doi: 10.1186/s12859-023-05540-5.

DOI:10.1186/s12859-023-05540-5
PMID:37904081
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10617059/
Abstract

BACKGROUND

Dimension reduction, especially feature selection, is an important step in improving classification performance for high-dimensional data. Particularly in cancer research, when reducing the number of features, i.e., genes, it is important to select the most informative features/potential biomarkers that could affect the diagnostic accuracy. Therefore, researchers continuously try to explore more efficient ways to reduce the large number of features/genes to a small but informative subset before the classification task. Hybrid methods have been extensively investigated for this purpose, and research to find the optimal approach is ongoing. Social network analysis is used as a part of a hybrid method, although there are several issues that have arisen when using social network tools, such as using a single environment for computing, constructing an adjacency matrix or computing network measures. Therefore, in our study, we apply a hybrid feature selection method consisting of several machine learning algorithms in addition to social network analysis with our proposed network metric, called the corrected degree of domesticity, in a single environment, R, to improve the support vector machine classifier's performance. In addition, we evaluate and compare the performances of several combinations used in the different steps of the method with a simulation experiment.

RESULTS

The proposed method improves the classifier's performance compared to using the whole feature set in all the cases we investigate. Additionally, in terms of the area under the receiver operating characteristic (ROC) curve, our approach improves classification performance compared to several approaches in the literature.

CONCLUSION

When using the corrected degree of domesticity as a network degree centrality measure, it is important to use our correction to compare nodes/features with no connection outside of their community since it provides a more accurate ranking among the features. Due to the nature of the hybrid method, which includes social network analysis, it is necessary to investigate possible combinations to provide an optimal solution for the microarray data used in the research.

摘要

背景

降维,尤其是特征选择,是提高高维数据分类性能的重要步骤。特别是在癌症研究中,当减少特征数量,即基因数量时,选择最有信息量的特征/潜在生物标志物以影响诊断准确性是很重要的。因此,研究人员不断尝试探索更有效的方法,以便在分类任务之前,将大量的特征/基因减少到一个小但信息量丰富的子集。为此目的,已经广泛研究了混合方法,并且正在寻找最佳方法的研究仍在进行中。社会网络分析被用作混合方法的一部分,尽管在使用社会网络工具时出现了几个问题,例如在单个环境中计算、构建邻接矩阵或计算网络度量值。因此,在我们的研究中,我们应用了一种混合特征选择方法,该方法由除了社会网络分析之外的几种机器学习算法组成,并且在单个环境 R 中,使用我们提出的网络度量标准,即校正的内婚度,来改进支持向量机分类器的性能。此外,我们使用模拟实验评估并比较了方法的不同步骤中使用的几种组合的性能。

结果

与使用整个特征集的情况相比,所提出的方法提高了分类器的性能。此外,就接收器操作特性(ROC)曲线下的面积而言,与文献中的几种方法相比,我们的方法提高了分类性能。

结论

当将校正的内婚度用作网络度中心性度量标准时,使用我们的校正标准来比较节点/特征与社区之外没有连接的节点/特征是很重要的,因为它在特征之间提供了更准确的排名。由于混合方法的性质,其中包括社会网络分析,因此有必要研究可能的组合,以为研究中使用的微阵列数据提供最佳解决方案。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acfc/10617059/8ff0e734129b/12859_2023_5540_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acfc/10617059/8ff0e734129b/12859_2023_5540_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/acfc/10617059/8ff0e734129b/12859_2023_5540_Fig1_HTML.jpg

相似文献

1
Biomarker detection using corrected degree of domesticity in hybrid social network feature selection for improving classifier performance.使用混合社交网络特征选择中校正的家养度进行生物标志物检测,以提高分类器性能。
BMC Bioinformatics. 2023 Oct 30;24(1):407. doi: 10.1186/s12859-023-05540-5.
2
Machine Learning Hybrid Model for the Prediction of Chronic Kidney Disease.机器学习混合模型预测慢性肾脏病。
Comput Intell Neurosci. 2023 Mar 14;2023:9266889. doi: 10.1155/2023/9266889. eCollection 2023.
3
Robust biomarker screening from gene expression data by stable machine learning-recursive feature elimination methods.基于稳健机器学习-递归特征消除方法的基因表达数据的稳健生物标志物筛选。
Comput Biol Chem. 2022 Oct;100:107747. doi: 10.1016/j.compbiolchem.2022.107747. Epub 2022 Jul 29.
4
Breast cancer prediction with transcriptome profiling using feature selection and machine learning methods.基于转录组谱特征选择和机器学习方法的乳腺癌预测。
BMC Bioinformatics. 2022 Oct 1;23(1):410. doi: 10.1186/s12859-022-04965-8.
5
Hybrid Feature-Learning-Based PSO-PCA Feature Engineering Approach for Blood Cancer Classification.基于混合特征学习的粒子群优化-主成分分析特征工程方法用于血癌分类
Diagnostics (Basel). 2023 Aug 14;13(16):2672. doi: 10.3390/diagnostics13162672.
6
Resting-State Functional Network Scale Effects and Statistical Significance-Based Feature Selection in Machine Learning Classification.基于静息态功能网络尺度效应和统计显著性的机器学习分类特征选择。
Comput Math Methods Med. 2019 Nov 4;2019:9108108. doi: 10.1155/2019/9108108. eCollection 2019.
7
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers.机器学习算法在(放化疗)治疗结果预测中的应用:分类器的实证比较。
Med Phys. 2018 Jul;45(7):3449-3459. doi: 10.1002/mp.12967. Epub 2018 Jun 13.
8
Incorporating feature ranking and evolutionary methods for the classification of high-dimensional DNA microarray gene expression data.结合特征排序和进化方法用于高维DNA微阵列基因表达数据的分类
Australas Med J. 2013 May 30;6(5):272-9. doi: 10.4066/AMJ.2013.1641. Print 2013.
9
Upper-Limb Motion Recognition Based on Hybrid Feature Selection: Algorithm Development and Validation.基于混合特征选择的上肢运动识别:算法开发与验证。
JMIR Mhealth Uhealth. 2021 Sep 2;9(9):e24402. doi: 10.2196/24402.
10
A novel biomarker selection method combining graph neural network and gene relationships applied to microarray data.一种结合图神经网络和基因关系的新型生物标志物选择方法,应用于微阵列数据。
BMC Bioinformatics. 2022 Jul 26;23(1):303. doi: 10.1186/s12859-022-04848-y.

本文引用的文献

1
Evaluation and Exploration of Machine Learning and Convolutional Neural Network Classifiers in Detection of Lung Cancer from Microarray Gene-A Paradigm Shift.机器学习和卷积神经网络分类器在微阵列基因检测肺癌中的评估与探索——一种范式转变
Bioengineering (Basel). 2023 Aug 6;10(8):933. doi: 10.3390/bioengineering10080933.
2
Deep learning-based microarray cancer classification and ensemble gene selection approach.基于深度学习的微阵列癌症分类和集成基因选择方法。
IET Syst Biol. 2022 May;16(3-4):120-131. doi: 10.1049/syb2.12044. Epub 2022 Jul 4.
3
AltWOA: Altruistic Whale Optimization Algorithm for feature selection on microarray datasets.
AltWOA:基于微阵列数据集的特征选择的利他鲸鱼优化算法。
Comput Biol Med. 2022 May;144:105349. doi: 10.1016/j.compbiomed.2022.105349. Epub 2022 Mar 10.
4
An efficient alpha seeding method for optimized extreme learning machine-based feature selection algorithm.一种用于优化基于极端学习机的特征选择算法的高效 alpha 种子生成方法。
Comput Biol Med. 2021 Jul;134:104505. doi: 10.1016/j.compbiomed.2021.104505. Epub 2021 May 23.
5
Applying social network analysis to the examination of interruptions in healthcare.运用社交网络分析考察医疗保健中的中断现象。
Appl Ergon. 2018 Feb;67:50-60. doi: 10.1016/j.apergo.2017.08.014. Epub 2017 Sep 26.
6
Social network Analysis-based classifier (SNAc): A case study on time course gene expression data.基于社交网络分析的分类器(SNAc):时间进程基因表达数据的案例研究
Comput Methods Programs Biomed. 2017 Oct;150:73-84. doi: 10.1016/j.cmpb.2017.06.015. Epub 2017 Jul 24.
7
A Monte Carlo Evaluation of Weighted Community Detection Algorithms.加权社区检测算法的蒙特卡罗评估
Front Neuroinform. 2016 Nov 10;10:45. doi: 10.3389/fninf.2016.00045. eCollection 2016.
8
Employing social network analysis for disease biomarker detection.运用社会网络分析进行疾病生物标志物检测。
Int J Data Min Bioinform. 2015;12(3):343-62. doi: 10.1504/ijdmb.2015.069661.
9
Over-optimism in bioinformatics: an illustration.生物信息学中的过度乐观:一个例证。
Bioinformatics. 2010 Aug 15;26(16):1990-8. doi: 10.1093/bioinformatics/btq323. Epub 2010 Jun 26.
10
Signed weighted gene co-expression network analysis of transcriptional regulation in murine embryonic stem cells.小鼠胚胎干细胞转录调控的加权基因共表达网络分析
BMC Genomics. 2009 Jul 20;10:327. doi: 10.1186/1471-2164-10-327.