• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于图的枢纽基因选择技术利用蛋白质相互作用信息:在样本分类中的应用。

Graph-Based Hub Gene Selection Technique Using Protein Interaction Information: Application to Sample Classification.

出版信息

IEEE J Biomed Health Inform. 2019 Nov;23(6):2670-2676. doi: 10.1109/JBHI.2019.2894374. Epub 2019 Jan 21.

DOI:10.1109/JBHI.2019.2894374
PMID:30676987
Abstract

Classification of samples of gene expression profile plays a significant role in prediction and diagnosis of diseases. In the task of sample classification, a robust feature selection algorithm is very much essential to identify the important genes from the high dimensional gene expression data. This paper explores the information of protein-protein interaction with a graph mining technique for finding a proper subset of features (genes), which further takes part in sample classification. Here, our contribution for feature selection is three-fold: first, all the genes are grouped into different clusters based on the integrated information of the gene expression values and their protein interactions using a multi-objective optimization based clustering approach. Second, the confidence scores of the protein interactions are incorporated in a popular graph mining algorithm namely Goldberg algorithm to find out the relevant features. These features are the topologically and functionally significant genes, named as hub genes. Finally, these hub genes are identified varying the degrees of the nodes, and those are utilized for the sample classification task. Different machine learning classifiers are exploited for this purpose, and the classification performance is measured with respect to various performance metrics namely accuracy, sensitivity, specificity, precision, F-measure, and Mathews coefficient correlation. Comparative analysis with respect to two baselines and several existing approaches proves the efficiency of the proposed approach. Furthermore, the robustness of the identified hub-gene modules is endorsed using some strong biological significance analysis.

摘要

基因表达谱样本的分类在疾病的预测和诊断中起着重要作用。在样本分类任务中,稳健的特征选择算法对于从高维基因表达数据中识别重要基因非常重要。本文利用图挖掘技术探索蛋白质-蛋白质相互作用的信息,以找到适当的特征(基因)子集,进一步参与样本分类。在这里,我们的特征选择贡献有三方面:首先,使用基于多目标优化的聚类方法,根据基因表达值及其蛋白质相互作用的综合信息将所有基因分为不同的簇。其次,将蛋白质相互作用的置信分数纳入流行的图挖掘算法 Goldberg 算法中,以找出相关特征。这些特征是拓扑和功能上重要的基因,称为枢纽基因。最后,通过改变节点的度数来识别这些枢纽基因,并将其用于样本分类任务。为此目的利用了不同的机器学习分类器,并根据各种性能指标(即准确性、敏感性、特异性、精度、F 度量和 Matthews 系数相关性)来衡量分类性能。与两个基线和几个现有方法的比较分析证明了所提出方法的效率。此外,还使用一些强大的生物学意义分析来证明所识别的枢纽基因模块的稳健性。

相似文献

1
Graph-Based Hub Gene Selection Technique Using Protein Interaction Information: Application to Sample Classification.基于图的枢纽基因选择技术利用蛋白质相互作用信息:在样本分类中的应用。
IEEE J Biomed Health Inform. 2019 Nov;23(6):2670-2676. doi: 10.1109/JBHI.2019.2894374. Epub 2019 Jan 21.
2
A consensus multi-view multi-objective gene selection approach for improved sample classification.一种共识多视角多目标基因选择方法,用于提高样本分类。
BMC Bioinformatics. 2020 Sep 17;21(Suppl 13):386. doi: 10.1186/s12859-020-03681-5.
3
A graph-based gene selection method for medical diagnosis problems using a many-objective PSO algorithm.基于图的基因选择方法,用于使用多目标 PSO 算法解决医学诊断问题。
BMC Med Inform Decis Mak. 2021 Nov 27;21(1):333. doi: 10.1186/s12911-021-01696-3.
4
Application of Genetic Algorithm-Based Support Vector Machine in Identification of Gene Expression Signatures for Psoriasis Classification: A Hybrid Model.基于遗传算法的支持向量机在鉴定银屑病分类基因表达特征中的应用:一种混合模型。
Biomed Res Int. 2021 Sep 8;2021:5520710. doi: 10.1155/2021/5520710. eCollection 2021.
5
Unsupervised gene selection using biological knowledge : application in sample clustering.利用生物学知识进行无监督基因选择:在样本聚类中的应用
BMC Bioinformatics. 2017 Nov 22;18(1):513. doi: 10.1186/s12859-017-1933-0.
6
Attribute clustering for grouping, selection, and classification of gene expression data.用于基因表达数据分组、选择和分类的属性聚类
IEEE/ACM Trans Comput Biol Bioinform. 2005 Apr-Jun;2(2):83-101. doi: 10.1109/TCBB.2005.17.
7
Ensembling of Gene Clusters Utilizing Deep Learning and Protein-Protein Interaction Information.利用深度学习和蛋白质-蛋白质相互作用信息进行基因簇组装。
IEEE/ACM Trans Comput Biol Bioinform. 2020 Nov-Dec;17(6):2005-2016. doi: 10.1109/TCBB.2019.2918523. Epub 2020 Dec 8.
8
A nine-hub-gene signature of metabolic syndrome identified using machine learning algorithms and integrated bioinformatics.基于机器学习算法和整合生物信息学的代谢综合征九基因特征签名。
Bioengineered. 2021 Dec;12(1):5727-5738. doi: 10.1080/21655979.2021.1968249.
9
An efficient statistical feature selection approach for classification of gene expression data.一种用于基因表达数据分类的高效统计特征选择方法。
J Biomed Inform. 2011 Aug;44(4):529-35. doi: 10.1016/j.jbi.2011.01.001. Epub 2011 Jan 15.
10
Fusion of expression values and protein interaction information using multi-objective optimization for improving gene clustering.使用多目标优化融合表达值和蛋白质相互作用信息以改进基因聚类。
Comput Biol Med. 2017 Oct 1;89:31-43. doi: 10.1016/j.compbiomed.2017.07.015. Epub 2017 Aug 1.

引用本文的文献

1
Determination of biomarkers from microarray data using graph neural network and spectral clustering.基于图神经网络和谱聚类的基因表达谱数据中生物标志物的确定。
Sci Rep. 2021 Dec 13;11(1):23828. doi: 10.1038/s41598-021-03316-6.
2
R-JaunLab: Automatic Multi-Class Recognition of Jaundice on Photos of Subjects with Region Annotation Networks.R-JaunLab:基于区域标注网络的带注释受试者照片黄疸的自动多类别识别。
J Digit Imaging. 2021 Apr;34(2):337-350. doi: 10.1007/s10278-021-00432-7. Epub 2021 Feb 25.
3
A Protein Interaction Information-based Generative Model for Enhancing Gene Clustering.
基于蛋白质相互作用信息的基因聚类增强生成模型。
Sci Rep. 2020 Jan 20;10(1):665. doi: 10.1038/s41598-020-57437-5.
4
Weighted gene coexpression network analysis identifies a new biomarker of CENPF for prediction disease prognosis and progression in nonmuscle invasive bladder cancer.加权基因共表达网络分析确定了 CENPF 作为非肌肉浸润性膀胱癌疾病预后和进展的新生物标志物。
Mol Genet Genomic Med. 2019 Nov;7(11):e982. doi: 10.1002/mgg3.982. Epub 2019 Sep 30.