• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于特征区分能力和网络影响的新特征选择方法。

A new feature selection method based on feature distinguishing ability and network influence.

机构信息

School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, Liaoning, China.

School of Computer Science and Technology, Dalian University of Technology, Dalian 116024, Liaoning, China.

出版信息

J Biomed Inform. 2022 Apr;128:104048. doi: 10.1016/j.jbi.2022.104048. Epub 2022 Mar 3.

DOI:10.1016/j.jbi.2022.104048
PMID:35248795
Abstract

The occurrence and development of diseases are related to the dysfunction of biomolecules (genes, metabolites, etc.) and the changes of molecule interactions. Identifying the key molecules related to the physiological and pathological changes of organisms from omics data is of great significance for disease diagnosis, early warning and drug-target prediction, etc. A novel feature selection algorithm based on the feature individual distinguishing ability and feature influence in the biological network (FS-DANI) is proposed for defining important biomolecules (features) to discriminate different disease conditions. The feature individual distinguishing ability is evaluated based on the overlapping area of the feature effective ranges in different classes. FS-DANI measures the feature network influence based on the module importance in the correlation network and the feature centrality in the modules. The feature comprehensive weight is obtained by combining the feature individual distinguishing ability and feature influence in the network. Then crucial feature subset is determined by the sequential forward search (SFS) on the feature list sorted according to the comprehensive weights of features. FS-DANI is compared with the six efficient feature selection methods on ten public omics datasets. The ablation experiment is also conducted. Experimental results show that FS-DANI is better than the compared algorithms in accuracy, sensitivity and specificity on the whole. On analyzing the gastric cancer miRNA expression data, FS-DANI identified two miRNAs (hsa-miR-18a* and hsa-miR-381), whose AUCs for distinguishing gastric cancer samples and normal samples are 0.959 and 0.879 in the discovery set and an independent validation set, respectively. Hence, evaluating biomolecules from the molecular level and network level is helpful for identifying the potential disease biomarkers of high performance.

摘要

疾病的发生和发展与生物分子(基因、代谢物等)的功能障碍以及分子相互作用的变化有关。从组学数据中识别与生物体生理和病理变化相关的关键分子,对于疾病诊断、预警和药物靶点预测等具有重要意义。为了定义重要的生物分子(特征)以区分不同的疾病状态,提出了一种基于生物网络中特征个体区分能力和特征影响的新特征选择算法(FS-DANI)。特征个体区分能力基于不同类别中特征有效范围的重叠区域进行评估。FS-DANI 根据相关网络中的模块重要性和模块中的特征中心性来衡量特征网络影响。通过结合网络中特征个体区分能力和特征影响,得到特征综合权重。然后通过对根据特征综合权重排序的特征列表进行顺序向前搜索(SFS),确定关键特征子集。FS-DANI 在十个公共组学数据集上与六种高效特征选择方法进行了比较。还进行了消融实验。实验结果表明,FS-DANI 在整体准确性、敏感性和特异性方面均优于比较算法。在分析胃癌 miRNA 表达数据时,FS-DANI 鉴定了两个 miRNA(hsa-miR-18a* 和 hsa-miR-381),在发现集中区分胃癌样本和正常样本的 AUC 分别为 0.959 和 0.879,在独立验证集中。因此,从分子水平和网络水平评估生物分子有助于识别高性能的潜在疾病生物标志物。

相似文献

1
A new feature selection method based on feature distinguishing ability and network influence.基于特征区分能力和网络影响的新特征选择方法。
J Biomed Inform. 2022 Apr;128:104048. doi: 10.1016/j.jbi.2022.104048. Epub 2022 Mar 3.
2
Analyzing omics data based on sample network.基于样本网络分析组学数据。
J Bioinform Comput Biol. 2024 Feb;22(1):2450002. doi: 10.1142/S0219720024500021. Epub 2024 Mar 25.
3
A novel method for feature selection based on molecular interactive effect network.一种基于分子相互作用网络的特征选择新方法。
J Pharm Biomed Anal. 2022 Sep 5;218:114873. doi: 10.1016/j.jpba.2022.114873. Epub 2022 Jun 6.
4
A Novel Rank Aggregation-Based Hybrid Multifilter Wrapper Feature Selection Method in Software Defect Prediction.一种新颖的基于排序聚合的混合多过滤器包装特征选择方法在软件缺陷预测中。
Comput Intell Neurosci. 2021 Nov 24;2021:5069016. doi: 10.1155/2021/5069016. eCollection 2021.
5
A new feature selection algorithm based on relevance, redundancy and complementarity.一种基于相关性、冗余性和互补性的新特征选择算法。
Comput Biol Med. 2020 Apr;119:103667. doi: 10.1016/j.compbiomed.2020.103667. Epub 2020 Feb 19.
6
An omics data analysis method based on feature linear relationship and graph convolutional network.基于特征线性关系和图卷积网络的组学数据分析方法。
J Biomed Inform. 2023 Sep;145:104479. doi: 10.1016/j.jbi.2023.104479. Epub 2023 Aug 25.
7
Guilt-by-association feature selection: identifying biomarkers from proteomic profiles.基于关联的特征选择:从蛋白质组学图谱中识别生物标志物。
J Biomed Inform. 2008 Feb;41(1):124-36. doi: 10.1016/j.jbi.2007.04.003. Epub 2007 Apr 14.
8
Evaluation of Feature Selection Methods for Mammographic Breast Cancer Diagnosis in a Unified Framework.在统一框架下评估用于乳腺 X 线摄影乳腺癌诊断的特征选择方法。
Biomed Res Int. 2021 Oct 4;2021:6079163. doi: 10.1155/2021/6079163. eCollection 2021.
9
A new feature selection method based on symmetrical uncertainty and interaction gain.一种基于对称不确定性和交互增益的新特征选择方法。
Comput Biol Chem. 2019 Dec;83:107149. doi: 10.1016/j.compbiolchem.2019.107149. Epub 2019 Nov 6.
10
Cancer survival classification using integrated data sets and intermediate information.基于整合数据集和中间信息的癌症生存分类。
Artif Intell Med. 2014 Sep;62(1):23-31. doi: 10.1016/j.artmed.2014.06.003. Epub 2014 Jun 21.

引用本文的文献

1
A new feature selection approach with binary exponential henry gas solubility optimization and hybrid data transformation methods.一种采用二元指数亨利气体溶解度优化和混合数据变换方法的新特征选择方法。
MethodsX. 2024 May 20;12:102770. doi: 10.1016/j.mex.2024.102770. eCollection 2024 Jun.
2
Physiological Status Prediction Based on a Novel Hybrid Intelligent Scheme.基于新型混合智能方案的生理状态预测。
Comput Intell Neurosci. 2022 Dec 15;2022:4610747. doi: 10.1155/2022/4610747. eCollection 2022.