• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于带有节点观测缺失的网络进行特征选择和分类。

Feature selection and classification over the network with missing node observations.

机构信息

Splunk Inc., San Francisco, California, USA.

Department of Biostatistics, University of Michigan, Ann Arbor, Michigan, USA.

出版信息

Stat Med. 2022 Mar 30;41(7):1242-1262. doi: 10.1002/sim.9267. Epub 2021 Nov 23.

DOI:10.1002/sim.9267
PMID:34816464
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9773124/
Abstract

Jointly analyzing transcriptomic data and the existing biological networks can yield more robust and informative feature selection results, as well as better understanding of the biological mechanisms. Selecting and classifying node features over genome-scale networks has become increasingly important in genomic biology and genomic medicine. Existing methods have some critical drawbacks. The first is they do not allow flexible modeling of different subtypes of selected nodes. The second is they ignore nodes with missing values, very likely to increase bias in estimation. To address these limitations, we propose a general modeling framework for Bayesian node classification (BNC) with missing values. A new prior model is developed for the class indicators incorporating the network structure. For posterior computation, we resort to the Swendsen-Wang algorithm for efficiently updating class indicators. BNC can naturally handle missing values in the Bayesian modeling framework, which improves the node classification accuracy and reduces the bias in estimating gene effects. We demonstrate the advantages of our methods via extensive simulation studies and the analysis of the cutaneous melanoma dataset from The Cancer Genome Atlas.

摘要

联合分析转录组数据和现有的生物网络可以产生更稳健和信息丰富的特征选择结果,并更好地理解生物学机制。在基因组生物学和基因组医学中,对全基因组网络上的节点特征进行选择和分类变得越来越重要。现有的方法存在一些关键的缺点。第一个缺点是,它们不允许对所选节点的不同亚型进行灵活建模。第二个缺点是,它们忽略了具有缺失值的节点,这很可能会增加估计的偏差。为了解决这些局限性,我们提出了一种带有缺失值的贝叶斯节点分类(BNC)的通用建模框架。为了整合网络结构,我们为类别指标开发了一个新的先验模型。对于后验计算,我们求助于 Swendsen-Wang 算法来有效地更新类别指标。BNC 可以在贝叶斯建模框架中自然地处理缺失值,从而提高节点分类的准确性,并减少估计基因效应的偏差。我们通过广泛的模拟研究和对来自癌症基因组图谱的皮肤黑色素瘤数据集的分析,展示了我们方法的优势。

相似文献

1
Feature selection and classification over the network with missing node observations.基于带有节点观测缺失的网络进行特征选择和分类。
Stat Med. 2022 Mar 30;41(7):1242-1262. doi: 10.1002/sim.9267. Epub 2021 Nov 23.
2
Kernel-imbedded Gaussian processes for disease classification using microarray gene expression data.使用微阵列基因表达数据的用于疾病分类的核嵌入高斯过程。
BMC Bioinformatics. 2007 Feb 28;8:67. doi: 10.1186/1471-2105-8-67.
3
Gene-gene interaction analysis incorporating network information via a structured Bayesian approach.基于结构贝叶斯方法的纳入网络信息的基因-基因交互作用分析。
Stat Med. 2021 Dec 20;40(29):6619-6633. doi: 10.1002/sim.9202. Epub 2021 Sep 20.
4
Bayesian inference of hub nodes across multiple networks.多个网络中枢纽节点的贝叶斯推理
Biometrics. 2019 Mar;75(1):172-182. doi: 10.1111/biom.12958. Epub 2018 Aug 23.
5
Participants' outcomes gone missing within a network of interventions: Bayesian modeling strategies.参与者的结局在干预网络中丢失:贝叶斯建模策略。
Stat Med. 2019 Sep 10;38(20):3861-3879. doi: 10.1002/sim.8207. Epub 2019 May 27.
6
Bayesian network feature finder (BANFF): an R package for gene network feature selection.贝叶斯网络特征查找器(BANFF):一个用于基因网络特征选择的R包。
Bioinformatics. 2016 Dec 1;32(23):3685-3687. doi: 10.1093/bioinformatics/btw522. Epub 2016 Aug 8.
7
Accounting for network noise in graph-guided Bayesian modeling of structured high-dimensional data.在基于图引导的贝叶斯建模对结构化高维数据进行建模时,考虑网络噪声的影响。
Biometrics. 2024 Jan 29;80(1). doi: 10.1093/biomtc/ujae012.
8
Bayesian Network Marker Selection via the Thresholded Graph Laplacian Gaussian Prior.基于阈值化图拉普拉斯高斯先验的贝叶斯网络标记选择
Bayesian Anal. 2020 Mar;15(1):79-102. doi: 10.1214/18-ba1142. Epub 2019 Jan 5.
9
Incorporating biological prior knowledge for Bayesian learning via maximal knowledge-driven information priors.通过最大知识驱动信息先验将生物先验知识纳入贝叶斯学习。
BMC Bioinformatics. 2017 Dec 28;18(Suppl 14):552. doi: 10.1186/s12859-017-1893-4.
10
Using feature selection and Bayesian network identify cancer subtypes based on proteomic data.基于蛋白质组学数据,使用特征选择和贝叶斯网络识别癌症亚型。
J Proteomics. 2023 May 30;280:104895. doi: 10.1016/j.jprot.2023.104895. Epub 2023 Apr 5.

引用本文的文献

1
Bayesian functional analysis for untargeted metabolomics data with matching uncertainty and small sample sizes.贝叶斯功能分析用于具有匹配不确定性和小样本量的非靶向代谢组学数据。
Brief Bioinform. 2024 Mar 27;25(3). doi: 10.1093/bib/bbae141.
2
Risk factors assessment and a Bayesian network model for predicting ischemic stroke in patients with cardiac myxoma.心脏黏液瘤患者缺血性卒中预测的危险因素评估及贝叶斯网络模型
Front Cardiovasc Med. 2023 Mar 24;10:1128022. doi: 10.3389/fcvm.2023.1128022. eCollection 2023.

本文引用的文献

1
Bayesian Graphical Regression.贝叶斯图形回归
J Am Stat Assoc. 2019;114(525):184-197. doi: 10.1080/01621459.2017.1389739. Epub 2018 Jun 28.
2
Bayesian biclustering for microbial metagenomic sequencing data via multinomial matrix factorization.基于多项矩阵分解的微生物宏基因组测序数据的贝叶斯双聚类分析。
Biostatistics. 2022 Jul 18;23(3):891-909. doi: 10.1093/biostatistics/kxab002.
3
A Bayesian approach to restricted latent class models for scientifically structured clustering of multivariate binary outcomes.一种贝叶斯方法,用于对多元二分类结局进行科学结构聚类的约束潜类模型。
Biometrics. 2021 Dec;77(4):1431-1444. doi: 10.1111/biom.13388. Epub 2020 Oct 28.
4
Robust network-based regularization and variable selection for high-dimensional genomic data in cancer prognosis.用于癌症预后高维基因组数据的基于网络的稳健正则化和变量选择
Genet Epidemiol. 2019 Apr;43(3):276-291. doi: 10.1002/gepi.22194. Epub 2019 Feb 11.
5
A graph-embedded deep feedforward network for disease outcome classification and feature selection using gene expression data.基于基因表达数据的疾病预后分类和特征选择的图嵌入深度前馈网络。
Bioinformatics. 2018 Nov 1;34(21):3727-3737. doi: 10.1093/bioinformatics/bty429.
6
Bayesian graphical models for computational network biology.贝叶斯计算网络生物学图形模型。
BMC Bioinformatics. 2018 Mar 21;19(Suppl 3):63. doi: 10.1186/s12859-018-2063-z.
7
Powerful differential expression analysis incorporating network topology for next-generation sequencing data.结合网络拓扑结构用于下一代测序数据的强大差异表达分析。
Bioinformatics. 2017 May 15;33(10):1505-1513. doi: 10.1093/bioinformatics/btw833.
8
Bayesian network feature finder (BANFF): an R package for gene network feature selection.贝叶斯网络特征查找器(BANFF):一个用于基因网络特征选择的R包。
Bioinformatics. 2016 Dec 1;32(23):3685-3687. doi: 10.1093/bioinformatics/btw522. Epub 2016 Aug 8.
9
Heterozygous colon cancer-associated mutations of SAMHD1 have functional significance.SAMHD1的杂合性结肠癌相关突变具有功能意义。
Proc Natl Acad Sci U S A. 2016 Apr 26;113(17):4723-8. doi: 10.1073/pnas.1519128113. Epub 2016 Apr 11.
10
Tumor-Induced Hyperlipidemia Contributes to Tumor Growth.肿瘤诱导的高脂血症促进肿瘤生长。
Cell Rep. 2016 Apr 12;15(2):336-48. doi: 10.1016/j.celrep.2016.03.020. Epub 2016 Mar 31.