• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

肺癌基因表达数据库分析,结合了支持向量机分类方法和先验知识。

Lung cancer gene expression database analysis incorporating prior knowledge with support vector machine-based classification method.

机构信息

Department of Epidemiology, School of Public Health, China Medical University, Shenyang 110001, PR China.

出版信息

J Exp Clin Cancer Res. 2009 Jul 18;28(1):103. doi: 10.1186/1756-9966-28-103.

DOI:10.1186/1756-9966-28-103
PMID:19615083
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2719616/
Abstract

BACKGROUND

A reliable and precise classification is essential for successful diagnosis and treatment of cancer. Gene expression microarrays have provided the high-throughput platform to discover genomic biomarkers for cancer diagnosis and prognosis. Rational use of the available bioinformation can not only effectively remove or suppress noise in gene chips, but also avoid one-sided results of separate experiment. However, only some studies have been aware of the importance of prior information in cancer classification.

METHODS

Together with the application of support vector machine as the discriminant approach, we proposed one modified method that incorporated prior knowledge into cancer classification based on gene expression data to improve accuracy. A public well-known dataset, Malignant pleural mesothelioma and lung adenocarcinoma gene expression database, was used in this study. Prior knowledge is viewed here as a means of directing the classifier using known lung adenocarcinoma related genes. The procedures were performed by software R 2.80.

RESULTS

The modified method performed better after incorporating prior knowledge. Accuracy of the modified method improved from 98.86% to 100% in training set and from 98.51% to 99.06% in test set. The standard deviations of the modified method decreased from 0.26% to 0 in training set and from 3.04% to 2.10% in test set.

CONCLUSION

The method that incorporates prior knowledge into discriminant analysis could effectively improve the capacity and reduce the impact of noise. This idea may have good future not only in practice but also in methodology.

摘要

背景

可靠和精确的分类对于癌症的成功诊断和治疗至关重要。基因表达微阵列为癌症诊断和预后的基因组生物标志物的发现提供了高通量平台。合理利用现有的生物信息不仅可以有效地去除或抑制基因芯片中的噪声,还可以避免单独实验的结果片面。然而,只有一些研究意识到了先验信息在癌症分类中的重要性。

方法

我们提出了一种基于基因表达数据的改进方法,将先验知识纳入癌症分类中,以提高准确性。该方法结合支持向量机作为判别方法,共同使用。我们使用了一个公共的、众所周知的数据集,即恶性胸膜间皮瘤和肺腺癌基因表达数据库,来进行这项研究。在这里,先验知识被视为利用已知的肺腺癌相关基因指导分类器的一种手段。这些程序是通过软件 R 2.80 执行的。

结果

在纳入先验知识后,改进的方法表现更好。改进方法在训练集的准确率从 98.86%提高到 100%,在测试集的准确率从 98.51%提高到 99.06%。改进方法在训练集的标准差从 0.26%降至 0,在测试集的标准差从 3.04%降至 2.10%。

结论

将先验知识纳入判别分析的方法可以有效地提高能力并降低噪声的影响。这个想法不仅在实践中,而且在方法论上都有很好的前景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89c4/2719616/13c2700f167f/1756-9966-28-103-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89c4/2719616/5070dc6a038b/1756-9966-28-103-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89c4/2719616/cb81485d6fa2/1756-9966-28-103-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89c4/2719616/13c2700f167f/1756-9966-28-103-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89c4/2719616/5070dc6a038b/1756-9966-28-103-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89c4/2719616/cb81485d6fa2/1756-9966-28-103-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/89c4/2719616/13c2700f167f/1756-9966-28-103-3.jpg

相似文献

1
Lung cancer gene expression database analysis incorporating prior knowledge with support vector machine-based classification method.肺癌基因表达数据库分析,结合了支持向量机分类方法和先验知识。
J Exp Clin Cancer Res. 2009 Jul 18;28(1):103. doi: 10.1186/1756-9966-28-103.
2
Cancer classification from gene expression data by NPPC ensemble.基于 NPPC 集成的基因表达数据的癌症分类。
IEEE/ACM Trans Comput Biol Bioinform. 2011 May-Jun;8(3):659-71. doi: 10.1109/TCBB.2010.36.
3
Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。
Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.
4
Evaluation of Machine Learning Algorithm Utilization for Lung Cancer Classification Based on Gene Expression Levels.基于基因表达水平的肺癌分类机器学习算法应用评估
Asian Pac J Cancer Prev. 2016;17(2):835-8. doi: 10.7314/apjcp.2016.17.2.835.
5
Classification and Clustering on Microarray Data for Gene Functional Prediction Using R.使用R语言对微阵列数据进行分类和聚类以进行基因功能预测
Methods Mol Biol. 2016;1375:41-54. doi: 10.1007/7651_2015_240.
6
A combinational feature selection and ensemble neural network method for classification of gene expression data.一种用于基因表达数据分类的组合特征选择与集成神经网络方法。
BMC Bioinformatics. 2004 Sep 27;5:136. doi: 10.1186/1471-2105-5-136.
7
Recursive fuzzy granulation for gene subsets extraction and cancer classification.用于基因子集提取和癌症分类的递归模糊粒度化
IEEE Trans Inf Technol Biomed. 2008 Nov;12(6):723-30. doi: 10.1109/TITB.2008.920787.
8
A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue.一种用于从癌组织基因表达数据中进行特征选择和规则提取的多核支持向量机方案。
Artif Intell Med. 2007 Oct;41(2):161-75. doi: 10.1016/j.artmed.2007.07.008. Epub 2007 Sep 11.
9
The classification of cancer based on DNA microarray data that uses diverse ensemble genetic programming.基于使用多种集成遗传编程的DNA微阵列数据的癌症分类。
Artif Intell Med. 2006 Jan;36(1):43-58. doi: 10.1016/j.artmed.2005.06.002. Epub 2005 Aug 15.
10
Gene expression data classification using locally linear discriminant embedding.基于局部线性判别嵌入的基因表达数据分类。
Comput Biol Med. 2010 Oct;40(10):802-10. doi: 10.1016/j.compbiomed.2010.08.003. Epub 2010 Sep 22.

引用本文的文献

1
Knowledge-Informed Machine Learning for Cancer Diagnosis and Prognosis: A Review.用于癌症诊断和预后的知识驱动型机器学习综述
IEEE Trans Autom Sci Eng. 2025;22:10008-10028. doi: 10.1109/tase.2024.3515839. Epub 2024 Dec 18.
2
Performance enhancement of classifiers through Bio inspired feature selection methods for early detection of lung cancer from microarray genes.通过受生物启发的特征选择方法提高分类器性能,用于从微阵列基因中早期检测肺癌。
Heliyon. 2024 Aug 17;10(16):e36419. doi: 10.1016/j.heliyon.2024.e36419. eCollection 2024 Aug 30.
3
Enhancement of Classifier Performance with Adam and RanAdam Hyper-Parameter Tuning for Lung Cancer Detection from Microarray Data-In Pursuit of Precision.

本文引用的文献

1
Regularized F-measure maximization for feature selection and classification.用于特征选择和分类的正则化F值最大化
J Biomed Biotechnol. 2009;2009:617946. doi: 10.1155/2009/617946. Epub 2009 Apr 27.
2
An introduction to artificial neural networks in bioinformatics--application to complex microarray and mass spectrometry datasets in cancer studies.生物信息学中的人工神经网络介绍——在癌症研究中复杂微阵列和质谱数据集的应用
Brief Bioinform. 2009 May;10(3):315-29. doi: 10.1093/bib/bbp012. Epub 2009 Mar 23.
3
Sample entropy analysis of cervical neoplasia gene-expression signatures.
通过Adam和RanAdam超参数调优提高从微阵列数据检测肺癌的分类器性能——追求精准度
Bioengineering (Basel). 2024 Mar 26;11(4):314. doi: 10.3390/bioengineering11040314.
4
Evaluation and Exploration of Machine Learning and Convolutional Neural Network Classifiers in Detection of Lung Cancer from Microarray Gene-A Paradigm Shift.机器学习和卷积神经网络分类器在微阵列基因检测肺癌中的评估与探索——一种范式转变
Bioengineering (Basel). 2023 Aug 6;10(8):933. doi: 10.3390/bioengineering10080933.
5
Informed Attentive Predictors: A Generalisable Architecture for Prior Knowledge-Based Assisted Diagnosis of Cancers.知情注意预测器:一种基于先验知识辅助癌症诊断的可推广架构。
Sensors (Basel). 2021 Sep 28;21(19):6484. doi: 10.3390/s21196484.
6
A Holistic Performance Comparison for Lung Cancer Classification Using Swarm Intelligence Techniques.基于群集智能技术的肺癌分类的整体性能比较。
J Healthc Eng. 2021 Jul 29;2021:6680424. doi: 10.1155/2021/6680424. eCollection 2021.
7
Feature selection with the Fisher score followed by the Maximal Clique Centrality algorithm can accurately identify the hub genes of hepatocellular carcinoma.基于 Fisher 得分的特征选择,再结合最大团中心度算法,可以准确识别肝细胞癌的枢纽基因。
Sci Rep. 2019 Nov 21;9(1):17283. doi: 10.1038/s41598-019-53471-0.
8
Construction of a 26‑feature gene support vector machine classifier for smoking and non‑smoking lung adenocarcinoma sample classification.构建一个 26 特征基因支持向量机分类器,用于吸烟和非吸烟肺腺癌样本分类。
Mol Med Rep. 2018 Feb;17(2):3005-3013. doi: 10.3892/mmr.2017.8220. Epub 2017 Dec 7.
9
Lung cancer prediction from microarray data by gene expression programming.通过基因表达编程从微阵列数据预测肺癌
IET Syst Biol. 2016 Oct;10(5):168-178. doi: 10.1049/iet-syb.2015.0082.
10
Identifying Significant Features in Cancer Methylation Data Using Gene Pathway Segmentation.利用基因通路分割识别癌症甲基化数据中的显著特征
Cancer Inform. 2016 Sep 20;15:189-98. doi: 10.4137/CIN.S39859. eCollection 2016.
宫颈肿瘤基因表达特征的样本熵分析
BMC Bioinformatics. 2009 Feb 20;10:66. doi: 10.1186/1471-2105-10-66.
4
A new regularized least squares support vector regression for gene selection.一种用于基因选择的新型正则化最小二乘支持向量回归方法。
BMC Bioinformatics. 2009 Feb 3;10:44. doi: 10.1186/1471-2105-10-44.
5
Integrating biological knowledge with gene expression profiles for survival prediction of cancer.整合生物学知识与基因表达谱以预测癌症患者的生存情况。
J Comput Biol. 2009 Feb;16(2):265-78. doi: 10.1089/cmb.2008.12TT.
6
Mortality attributable to smoking in China.中国吸烟所致的死亡率。
N Engl J Med. 2009 Jan 8;360(2):150-9. doi: 10.1056/NEJMsa0802902.
7
Identification of genes down-regulated during lung cancer progression: a cDNA array study.肺癌进展过程中下调基因的鉴定:一项cDNA阵列研究。
J Exp Clin Cancer Res. 2008 Sep 15;27(1):38. doi: 10.1186/1756-9966-27-38.
8
Incremental forward feature selection with application to microarray gene expression data.应用于微阵列基因表达数据的增量前向特征选择
J Biopharm Stat. 2008;18(5):827-40. doi: 10.1080/10543400802277868.
9
A comprehensive comparison of random forests and support vector machines for microarray-based cancer classification.基于微阵列的癌症分类中随机森林与支持向量机的全面比较
BMC Bioinformatics. 2008 Jul 22;9:319. doi: 10.1186/1471-2105-9-319.
10
Microarray-based classification and clinical predictors: on combined classifiers and additional predictive value.基于微阵列的分类及临床预测指标:关于联合分类器及附加预测价值
Bioinformatics. 2008 Aug 1;24(15):1698-706. doi: 10.1093/bioinformatics/btn262. Epub 2008 Jun 9.