• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用GPNet并结合新颖的P值计算方法进行基于分类的通路分析。

Classification-based pathway analysis using GPNet with novel P-value computation.

作者信息

Lu Hao, Rezapour Mostafa, Baha Haseebullah, Khalid Khan Niazi Muhammad, Narayanan Aarthi, Nafi Gurcan Metin

机构信息

Center for Artificial Intelligence Research, Wake Forest University School of Medicine, Winston-Salem, NC 27101, United States.

School of Systems Biology, College of Science, George Mason University, Fairfax, VA 22030, United States.

出版信息

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf039.

DOI:10.1093/bib/bbaf039
PMID:39879387
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11775473/
Abstract

Pathway analysis plays a critical role in bioinformatics, enabling researchers to identify biological pathways associated with various conditions by analyzing gene expression data. However, the rise of large, multi-center datasets has highlighted limitations in traditional methods like Over-Representation Analysis (ORA) and Functional Class Scoring (FCS), which struggle with low signal-to-noise ratios (SNR) and large sample sizes. To tackle these challenges, we use a deep learning-based classification method, Gene PointNet, and a novel $P$-value computation approach leveraging the confusion matrix to address pathway analysis tasks. We validated our method effectiveness through a comparative study using a simulated dataset and RNA-Seq data from The Cancer Genome Atlas breast cancer dataset. Our method was benchmarked against traditional techniques (ORA, FCS), shallow machine learning models (logistic regression, support vector machine), and deep learning approaches (DeepHisCom, PASNet). The results demonstrate that GPNet outperforms these methods in low-SNR, large-sample datasets, where it remains robust and reliable, significantly reducing both Type I error and improving power. This makes our method well suited for pathway analysis in large, multi-center studies. The code can be found at https://github.com/haolu123/GPNet_pathway">https://github.com/haolu123/GPNet_pathway.

摘要

通路分析在生物信息学中起着关键作用,它使研究人员能够通过分析基因表达数据来识别与各种疾病相关的生物通路。然而,大型多中心数据集的出现凸显了传统方法(如过表达分析(ORA)和功能类评分(FCS))的局限性,这些方法在低信噪比(SNR)和大样本量的情况下存在困难。为了应对这些挑战,我们使用了一种基于深度学习的分类方法——基因点云网络(Gene PointNet),以及一种利用混淆矩阵的新型P值计算方法来处理通路分析任务。我们通过使用模拟数据集和来自癌症基因组图谱乳腺癌数据集的RNA测序数据进行比较研究,验证了我们方法的有效性。我们的方法与传统技术(ORA、FCS)、浅层机器学习模型(逻辑回归、支持向量机)和深度学习方法(DeepHisCom、PASNet)进行了基准测试。结果表明,在低信噪比、大样本数据集中,基因点云网络(GPNet)优于这些方法,它在这些数据集中保持稳健可靠,显著降低了I型错误并提高了检验效能。这使得我们的方法非常适合在大型多中心研究中进行通路分析。代码可在https://github.com/haolu123/GPNet_pathway上找到。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e672/11775473/eab2b29e604c/bbaf039f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e672/11775473/9fe53d4498c3/bbaf039f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e672/11775473/43c669185ff8/bbaf039f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e672/11775473/eab2b29e604c/bbaf039f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e672/11775473/9fe53d4498c3/bbaf039f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e672/11775473/43c669185ff8/bbaf039f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e672/11775473/eab2b29e604c/bbaf039f3.jpg

相似文献

1
Classification-based pathway analysis using GPNet with novel P-value computation.使用GPNet并结合新颖的P值计算方法进行基于分类的通路分析。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbaf039.
2
Interpretable deep neural network for cancer survival analysis by integrating genomic and clinical data.基于基因组和临床数据的可解释深度神经网络在癌症生存分析中的应用。
BMC Med Genomics. 2019 Dec 23;12(Suppl 10):189. doi: 10.1186/s12920-019-0624-2.
3
Multi-level multi-view network based on structural contrastive learning for scRNA-seq data clustering.基于结构对比学习的多层次多视图网络用于 scRNA-seq 数据聚类。
Brief Bioinform. 2024 Sep 23;25(6). doi: 10.1093/bib/bbae562.
4
Enhancing the prediction of IDC breast cancer staging from gene expression profiles using hybrid feature selection methods and deep learning architecture.使用混合特征选择方法和深度学习架构增强从基因表达谱预测浸润性导管癌乳腺癌分期的能力。
Med Biol Eng Comput. 2023 Nov;61(11):2895-2919. doi: 10.1007/s11517-023-02892-1. Epub 2023 Aug 2.
5
Comprehensive bioinformatics and machine learning analyses for breast cancer staging using TCGA dataset.使用TCGA数据集进行乳腺癌分期的综合生物信息学和机器学习分析。
Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae628.
6
EDCLoc: a prediction model for mRNA subcellular localization using improved focal loss to address multi-label class imbalance.EDCLoc:一种用于mRNA亚细胞定位的预测模型,使用改进的焦点损失来解决多标签类不平衡问题。
BMC Genomics. 2024 Dec 27;25(1):1252. doi: 10.1186/s12864-024-11173-6.
7
Network-based drug sensitivity prediction.基于网络的药物敏感性预测。
BMC Med Genomics. 2020 Dec 28;13(Suppl 11):193. doi: 10.1186/s12920-020-00829-3.
8
NNBGWO-BRCA marker: Neural Network and binary grey wolf optimization based Breast cancer biomarker discovery framework using multi-omics dataset.基于神经网络和二进制灰狼优化的乳腺癌生物标志物发现框架,利用多组学数据集。
Comput Methods Programs Biomed. 2024 Sep;254:108291. doi: 10.1016/j.cmpb.2024.108291. Epub 2024 Jun 18.
9
PASNet: pathway-associated sparse deep neural network for prognosis prediction from high-throughput data.PASNet:基于通路关联稀疏深度神经网络的高通量数据预后预测方法。
BMC Bioinformatics. 2018 Dec 17;19(1):510. doi: 10.1186/s12859-018-2500-z.
10
Unsupervised construction of computational graphs for gene expression data with explicit structural inductive biases.无监督构建具有显式结构归纳偏差的基因表达数据的计算图。
Bioinformatics. 2022 Feb 7;38(5):1320-1327. doi: 10.1093/bioinformatics/btab830.

本文引用的文献

1
Deep learning in structural bioinformatics: current applications and future perspectives.结构生物信息学中的深度学习:当前应用与未来展望。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae042.
2
A review of deep learning applications in human genomics using next-generation sequencing data.深度学习在人类基因组学中应用的研究进展:利用下一代测序数据
Hum Genomics. 2022 Jul 25;16(1):26. doi: 10.1186/s40246-022-00396-x.
3
DeepHisCoM: deep learning pathway analysis using hierarchical structural component models.DeepHisCoM:基于层次结构组件模型的深度学习通路分析。
Brief Bioinform. 2022 Sep 20;23(5). doi: 10.1093/bib/bbac171.
4
Contamination detection in genomic data: more is not enough.基因组数据中的污染检测:更多并不一定更好。
Genome Biol. 2022 Feb 21;23(1):60. doi: 10.1186/s13059-022-02619-9.
5
A comparative study of topology-based pathway enrichment analysis methods.基于拓扑的通路富集分析方法的比较研究。
BMC Bioinformatics. 2019 Nov 4;20(1):546. doi: 10.1186/s12859-019-3146-1.
6
Gene2vec: distributed representation of genes based on co-expression.Gene2vec:基于共表达的基因分布式表示。
BMC Genomics. 2019 Feb 4;20(Suppl 1):82. doi: 10.1186/s12864-018-5370-x.
7
PASNet: pathway-associated sparse deep neural network for prognosis prediction from high-throughput data.PASNet:基于通路关联稀疏深度神经网络的高通量数据预后预测方法。
BMC Bioinformatics. 2018 Dec 17;19(1):510. doi: 10.1186/s12859-018-2500-z.
8
Opportunities and challenges of whole-genome and -exome sequencing.全基因组和外显子组测序的机遇与挑战
BMC Genet. 2017 Feb 14;18(1):14. doi: 10.1186/s12863-017-0479-5.
9
Genetic effects and genotype × environment interactions govern seed oil content in Brassica napus L.遗传效应和基因型×环境互作决定甘蓝型油菜种子的含油量。
BMC Genet. 2017 Jan 5;18(1):1. doi: 10.1186/s12863-016-0468-0.
10
Deep learning in bioinformatics.生物信息学中的深度学习。
Brief Bioinform. 2017 Sep 1;18(5):851-869. doi: 10.1093/bib/bbw068.