• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用贝叶斯基因选择的逻辑回归进行癌症分类和预测。

Cancer classification and prediction using logistic regression with Bayesian gene selection.

作者信息

Zhou Xiaobo, Liu Kuang-Yu, Wong Stephen T C

机构信息

Harvard Center for Neurodegeneration and Repair, Center for Bioinformatics, Harvard Medical School, 220 Longwood Avenue, Boston, MA 02115, USA.

出版信息

J Biomed Inform. 2004 Aug;37(4):249-59. doi: 10.1016/j.jbi.2004.07.009.

DOI:10.1016/j.jbi.2004.07.009
PMID:15465478
Abstract

In microarray-based cancer classification and prediction, gene selection is an important research problem owing to the large number of genes and the small number of experimental conditions. In this paper, we propose a Bayesian approach to gene selection and classification using the logistic regression model. The basic idea of our approach is in conjunction with a logistic regression model to relate the gene expression with the class labels. We use Gibbs sampling and Markov chain Monte Carlo (MCMC) methods to discover important genes. To implement Gibbs Sampler and MCMC search, we derive a posterior distribution of selected genes given the observed data. After the important genes are identified, the same logistic regression model is then used for cancer classification and prediction. Issues for efficient implementation for the proposed method are discussed. The proposed method is evaluated against several large microarray data sets, including hereditary breast cancer, small round blue-cell tumors, and acute leukemia. The results show that the method can effectively identify important genes consistent with the known biological findings while the accuracy of the classification is also high. Finally, the robustness and sensitivity properties of the proposed method are also investigated.

摘要

在基于微阵列的癌症分类和预测中,由于基因数量众多而实验条件较少,基因选择是一个重要的研究问题。在本文中,我们提出了一种使用逻辑回归模型进行基因选择和分类的贝叶斯方法。我们方法的基本思想是结合逻辑回归模型,将基因表达与类别标签联系起来。我们使用吉布斯采样和马尔可夫链蒙特卡罗(MCMC)方法来发现重要基因。为了实现吉布斯采样器和MCMC搜索,我们在给定观测数据的情况下推导所选基因的后验分布。在识别出重要基因后,然后使用相同的逻辑回归模型进行癌症分类和预测。讨论了所提方法有效实现的相关问题。所提方法针对几个大型微阵列数据集进行了评估,包括遗传性乳腺癌、小圆蓝细胞肿瘤和急性白血病。结果表明,该方法能够有效地识别与已知生物学发现一致的重要基因,同时分类准确率也很高。最后,还研究了所提方法的稳健性和敏感性特性。

相似文献

1
Cancer classification and prediction using logistic regression with Bayesian gene selection.使用贝叶斯基因选择的逻辑回归进行癌症分类和预测。
J Biomed Inform. 2004 Aug;37(4):249-59. doi: 10.1016/j.jbi.2004.07.009.
2
Gene selection in cancer classification using sparse logistic regression with Bayesian regularization.使用带贝叶斯正则化的稀疏逻辑回归进行癌症分类中的基因选择。
Bioinformatics. 2006 Oct 1;22(19):2348-55. doi: 10.1093/bioinformatics/btl386. Epub 2006 Jul 14.
3
Independent component analysis-based penalized discriminant method for tumor classification using gene expression data.基于独立成分分析的惩罚判别方法用于利用基因表达数据进行肿瘤分类
Bioinformatics. 2006 Aug 1;22(15):1855-62. doi: 10.1093/bioinformatics/btl190. Epub 2006 May 18.
4
Comprehensive vertical sample-based KNN/LSVM classification for gene expression analysis.用于基因表达分析的基于垂直样本的综合KNN/LSVM分类
J Biomed Inform. 2004 Aug;37(4):240-8. doi: 10.1016/j.jbi.2004.07.003.
5
Tumor classification ranking from microarray data.基于微阵列数据的肿瘤分类排名
BMC Genomics. 2008 Sep 16;9 Suppl 2(Suppl 2):S21. doi: 10.1186/1471-2164-9-S2-S21.
6
Induction of comprehensible models for gene expression datasets by subgroup discovery methodology.通过子群发现方法为基因表达数据集诱导可理解模型。
J Biomed Inform. 2004 Aug;37(4):269-84. doi: 10.1016/j.jbi.2004.07.007.
7
Bayesian variable selection for the analysis of microarray data with censored outcomes.用于分析具有删失结局的微阵列数据的贝叶斯变量选择
Bioinformatics. 2006 Sep 15;22(18):2262-8. doi: 10.1093/bioinformatics/btl362. Epub 2006 Jul 15.
8
Differential gene expression detection and sample classification using penalized linear regression models.使用惩罚线性回归模型进行差异基因表达检测和样本分类。
Bioinformatics. 2006 Feb 15;22(4):472-6. doi: 10.1093/bioinformatics/bti827. Epub 2005 Dec 13.
9
Structured polychotomous machine diagnosis of multiple cancer types using gene expression.使用基因表达对多种癌症类型进行结构化多分类机器诊断。
Bioinformatics. 2006 Apr 15;22(8):950-8. doi: 10.1093/bioinformatics/btl029. Epub 2006 Feb 1.
10
A combination of rough-based feature selection and RBF neural network for classification using gene expression data.一种基于粗糙集的特征选择与径向基函数神经网络相结合的方法,用于利用基因表达数据进行分类。
IEEE Trans Nanobioscience. 2008 Mar;7(1):91-9. doi: 10.1109/TNB.2008.2000142.

引用本文的文献

1
An Ensembled Framework for Human Breast Cancer Survivability Prediction Using Deep Learning.一种使用深度学习的人类乳腺癌生存能力预测集成框架。
Diagnostics (Basel). 2023 May 10;13(10):1688. doi: 10.3390/diagnostics13101688.
2
Mathematical Modelling of Cervical Precancerous Lesion Grade Risk Scores: Linear Regression Analysis of Cellular Protein Biomarkers and Human Papillomavirus / RNA Staining Patterns.子宫颈癌前病变分级风险评分的数学建模:细胞蛋白生物标志物与人类乳头瘤病毒/RNA染色模式的线性回归分析
Diagnostics (Basel). 2023 Mar 13;13(6):1084. doi: 10.3390/diagnostics13061084.
3
A cost-effective machine learning-based method for preeclampsia risk assessment and driver genes discovery.
一种基于机器学习的具有成本效益的子痫前期风险评估和驱动基因发现方法。
Cell Biosci. 2023 Feb 28;13(1):41. doi: 10.1186/s13578-023-00991-y.
4
Toward Predicting 30-Day Readmission Among Oncology Patients: Identifying Timely and Actionable Risk Factors.面向肿瘤患者 30 天再入院预测:识别及时且可操作的风险因素。
JCO Clin Cancer Inform. 2023 Feb;7:e2200097. doi: 10.1200/CCI.22.00097.
5
An AI-driven clinical care pathway to reduce 30-day readmission for chronic obstructive pulmonary disease (COPD) patients.人工智能驱动的临床护理路径,以降低慢性阻塞性肺疾病(COPD)患者 30 天再入院率。
Sci Rep. 2022 Nov 30;12(1):20633. doi: 10.1038/s41598-022-22434-3.
6
Deep Learning and Machine Learning with Grid Search to Predict Later Occurrence of Breast Cancer Metastasis Using Clinical Data.利用临床数据,通过深度学习和带网格搜索的机器学习预测乳腺癌转移的后期发生情况。
J Clin Med. 2022 Sep 29;11(19):5772. doi: 10.3390/jcm11195772.
7
Cluster Analysis of Cell Nuclei in H&E-Stained Histological Sections of Prostate Cancer and Classification Based on Traditional and Modern Artificial Intelligence Techniques.前列腺癌苏木精-伊红染色组织切片中细胞核的聚类分析及基于传统和现代人工智能技术的分类
Diagnostics (Basel). 2021 Dec 22;12(1):15. doi: 10.3390/diagnostics12010015.
8
Directly and Simultaneously Expressing Absolute and Relative Treatment Effects in Medical Data Models and Applications.在医学数据模型与应用中直接且同时表达绝对和相对治疗效果
Entropy (Basel). 2021 Nov 15;23(11):1517. doi: 10.3390/e23111517.
9
Leveraging Genetic Reports and Electronic Health Records for the Prediction of Primary Cancers: Algorithm Development and Validation Study.利用基因报告和电子健康记录预测原发性癌症:算法开发与验证研究。
JMIR Med Inform. 2021 May 25;9(5):e23586. doi: 10.2196/23586.
10
LogSum + L penalized logistic regression model for biomarker selection and cancer classification.LogSum+L 惩罚逻辑回归模型用于生物标志物选择和癌症分类。
Sci Rep. 2020 Dec 17;10(1):22125. doi: 10.1038/s41598-020-79028-0.