• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于从微阵列基因表达数据中发现生物标志物以进行癌症分类的混合方法。

A hybrid approach for biomarker discovery from microarray gene expression data for cancer classification.

作者信息

Peng Yanxiong, Li Wenyuan, Liu Ying

机构信息

Laboratory for Bioinformatics and Medical Informatics, University of Texas at Dallas, Richardson, TX 75083-0688, USA.

出版信息

Cancer Inform. 2007 Feb 22;2:301-11.

PMID:19458773
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2675487/
Abstract

Microarrays allow researchers to monitor the gene expression patterns for tens of thousands of genes across a wide range of cellular responses, phenotype and conditions. Selecting a small subset of discriminate genes from thousands of genes is important for accurate classification of diseases and phenotypes. Many methods have been proposed to find subsets of genes with maximum relevance and minimum redundancy, which can distinguish accurately between samples with different labels. To find the minimum subset of relevant genes is often referred as biomarker discovery. Two main approaches, filter and wrapper techniques, have been applied to biomarker discovery. In this paper, we conducted a comparative study of different biomarker discovery methods, including six filter methods and three wrapper methods. We then proposed a hybrid approach, FR-Wrapper, for biomarker discovery. The aim of this approach is to find an optimum balance between the precision of the biomarker discovery and the computation cost, by taking advantages of both filter method's efficiency and wrapper method's high accuracy. Our hybrid approach applies Fisher's ratio, a simple method easy to understand and implement, to filter out most of the irrelevant genes, then a wrapper method is employed to reduce the redundancy. The performance of FR-Wrapper approach is evaluated over four widely used microarray datasets. Analysis of experimental results reveals that the hybrid approach can achieve the goal of maximum relevance with minimum redundancy.

摘要

微阵列技术使研究人员能够监测数万个基因在广泛的细胞反应、表型和条件下的基因表达模式。从数千个基因中选择一小部分具有区分性的基因对于疾病和表型的准确分类至关重要。已经提出了许多方法来寻找具有最大相关性和最小冗余性的基因子集,这些子集能够准确区分具有不同标签的样本。寻找相关基因的最小子集通常被称为生物标志物发现。两种主要方法,即过滤法和包装法,已被应用于生物标志物发现。在本文中,我们对不同的生物标志物发现方法进行了比较研究,包括六种过滤法和三种包装法。然后,我们提出了一种用于生物标志物发现的混合方法FR-Wrapper。这种方法的目的是通过利用过滤法的效率和包装法的高精度,在生物标志物发现的精度和计算成本之间找到最佳平衡。我们的混合方法应用费舍尔比率(一种易于理解和实现的简单方法)来过滤掉大多数不相关的基因,然后采用包装法来减少冗余。在四个广泛使用的微阵列数据集上评估了FR-Wrapper方法的性能。实验结果分析表明,该混合方法能够实现最大相关性和最小冗余性的目标。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8696/2675487/ef570e7d18f1/CIN-02-301-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8696/2675487/54a31cf4b972/CIN-02-301-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8696/2675487/3bc98deb5248/CIN-02-301-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8696/2675487/699d7eddae8e/CIN-02-301-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8696/2675487/ef570e7d18f1/CIN-02-301-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8696/2675487/54a31cf4b972/CIN-02-301-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8696/2675487/3bc98deb5248/CIN-02-301-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8696/2675487/699d7eddae8e/CIN-02-301-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8696/2675487/ef570e7d18f1/CIN-02-301-g004.jpg

相似文献

1
A hybrid approach for biomarker discovery from microarray gene expression data for cancer classification.一种用于从微阵列基因表达数据中发现生物标志物以进行癌症分类的混合方法。
Cancer Inform. 2007 Feb 22;2:301-11.
2
Filter versus wrapper gene selection approaches in DNA microarray domains.DNA微阵列领域中过滤法与包装法基因选择方法
Artif Intell Med. 2004 Jun;31(2):91-103. doi: 10.1016/j.artmed.2004.01.007.
3
A novel bio-inspired hybrid multi-filter wrapper gene selection method with ensemble classifier for microarray data.一种用于微阵列数据的、基于集成分类器的新型生物启发式混合多滤波器包装基因选择方法。
Neural Comput Appl. 2023;35(16):11531-11561. doi: 10.1007/s00521-021-06459-9. Epub 2021 Sep 12.
4
A multiple-filter-multiple-wrapper approach to gene selection and microarray data classification.一种多重过滤-多重包装方法用于基因选择和微阵列数据分类。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Jan-Mar;7(1):108-17. doi: 10.1109/TCBB.2008.46.
5
Filter and Wrapper Stacking Ensemble (FWSE): a robust approach for reliable biomarker discovery in high-dimensional omics data.过滤器和包装器堆叠集成 (FWSE):一种在高维组学数据中可靠发现生物标志物的稳健方法。
Brief Bioinform. 2023 Sep 22;24(6). doi: 10.1093/bib/bbad382.
6
A new hybrid filter/wrapper algorithm for feature selection in classification.一种用于分类中特征选择的新型混合过滤/包装算法。
Anal Chim Acta. 2019 Nov 8;1080:43-54. doi: 10.1016/j.aca.2019.06.054. Epub 2019 Jun 28.
7
A TRIZ-inspired bat algorithm for gene selection in cancer classification.基于 TRIZ 的蝙蝠算法在癌症分类中的基因选择。
Genomics. 2020 Jan;112(1):114-126. doi: 10.1016/j.ygeno.2019.09.015. Epub 2019 Oct 30.
8
Identification of potential biomarkers on microarray data using distributed gene selection approach.基于分布式基因选择方法的芯片数据中潜在生物标志物的识别。
Math Biosci. 2019 Sep;315:108230. doi: 10.1016/j.mbs.2019.108230. Epub 2019 Jul 18.
9
A novel feature selection approach for biomedical data classification.一种用于生物医学数据分类的新特征选择方法。
J Biomed Inform. 2010 Feb;43(1):15-23. doi: 10.1016/j.jbi.2009.07.008. Epub 2009 Jul 30.
10
Development of a two-stage gene selection method that incorporates a novel hybrid approach using the cuckoo optimization algorithm and harmony search for cancer classification.一种两阶段基因选择方法的开发,该方法结合了一种使用布谷鸟优化算法和和声搜索的新型混合方法用于癌症分类。
J Biomed Inform. 2017 Mar;67:11-20. doi: 10.1016/j.jbi.2017.01.016. Epub 2017 Feb 3.

引用本文的文献

1
Enhancement of Classifier Performance with Adam and RanAdam Hyper-Parameter Tuning for Lung Cancer Detection from Microarray Data-In Pursuit of Precision.通过Adam和RanAdam超参数调优提高从微阵列数据检测肺癌的分类器性能——追求精准度
Bioengineering (Basel). 2024 Mar 26;11(4):314. doi: 10.3390/bioengineering11040314.
2
Refining breast cancer biomarker discovery and drug targeting through an advanced data-driven approach.通过先进的数据驱动方法改进乳腺癌生物标志物的发现和药物靶向。
BMC Bioinformatics. 2024 Jan 22;25(1):33. doi: 10.1186/s12859-024-05657-1.
3
A comprehensive survey on computational learning methods for analysis of gene expression data.

本文引用的文献

1
Applications of machine learning in cancer prediction and prognosis.机器学习在癌症预测和预后中的应用。
Cancer Inform. 2007 Feb 11;2:59-77.
2
Efficient generalized matrix approximations for biomarker discovery and visualization in gene expression data.用于基因表达数据中生物标志物发现与可视化的高效广义矩阵近似法。
Comput Syst Bioinformatics Conf. 2006:133-44.
3
Text mining biomedical literature for discovering gene-to-gene relationships: a comparative study of algorithms.挖掘生物医学文献以发现基因与基因之间的关系:算法的比较研究
关于用于基因表达数据分析的计算学习方法的全面综述。
Front Mol Biosci. 2022 Nov 7;9:907150. doi: 10.3389/fmolb.2022.907150. eCollection 2022.
4
Classification of Microarray Data Using Kernel Fuzzy Inference System.使用核模糊推理系统对微阵列数据进行分类
Int Sch Res Notices. 2014 Aug 21;2014:769159. doi: 10.1155/2014/769159. eCollection 2014.
5
Identification of common tumor signatures based on gene set enrichment analysis.基于基因集富集分析的常见肿瘤特征识别。
In Silico Biol. 2011;11(1-2):1-10. doi: 10.3233/ISB-2012-0440.
6
Data mining approaches for genome-wide association of mood disorders.用于情绪障碍全基因组关联研究的数据挖掘方法。
Psychiatr Genet. 2012 Apr;22(2):55-61. doi: 10.1097/YPG.0b013e32834dc40d.
7
Gene expression profiles identify inflammatory signatures in dendritic cells.基因表达谱可鉴定树突状细胞中的炎症特征。
PLoS One. 2010 Feb 24;5(2):e9404. doi: 10.1371/journal.pone.0009404.
8
Feature selection for predicting tumor metastases in microarray experiments using paired design.使用配对设计在微阵列实验中预测肿瘤转移的特征选择
Cancer Inform. 2007 Mar 20;3:213-8.
9
Fuzzy logic for elimination of redundant information of microarray data.用于消除微阵列数据冗余信息的模糊逻辑。
Genomics Proteomics Bioinformatics. 2008 Jun;6(2):61-73. doi: 10.1016/S1672-0229(08)60021-2.
IEEE/ACM Trans Comput Biol Bioinform. 2005 Jan-Mar;2(1):62-76. doi: 10.1109/TCBB.2005.14.
4
Gene selection and classification of microarray data using random forest.使用随机森林进行微阵列数据的基因选择与分类
BMC Bioinformatics. 2006 Jan 6;7:3. doi: 10.1186/1471-2105-7-3.
5
Biomarker discovery in microarray gene expression data with Gaussian processes.利用高斯过程在微阵列基因表达数据中发现生物标志物。
Bioinformatics. 2005 Aug 15;21(16):3385-93. doi: 10.1093/bioinformatics/bti526. Epub 2005 Jun 2.
6
Minimum redundancy feature selection from microarray gene expression data.从微阵列基因表达数据中进行最小冗余特征选择。
J Bioinform Comput Biol. 2005 Apr;3(2):185-205. doi: 10.1142/s0219720005001004.
7
A comparative study on feature selection methods for drug discovery.药物发现中特征选择方法的比较研究。
J Chem Inf Comput Sci. 2004 Sep-Oct;44(5):1823-8. doi: 10.1021/ci049875d.
8
Filter versus wrapper gene selection approaches in DNA microarray domains.DNA微阵列领域中过滤法与包装法基因选择方法
Artif Intell Med. 2004 Jun;31(2):91-103. doi: 10.1016/j.artmed.2004.01.007.
9
A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression.基于基因表达的组织分类中特征选择与多类分类方法的比较研究
Bioinformatics. 2004 Oct 12;20(15):2429-37. doi: 10.1093/bioinformatics/bth267. Epub 2004 Apr 15.
10
A comparative study on feature selection and classification methods using gene expression profiles and proteomic patterns.一项关于利用基因表达谱和蛋白质组学模式进行特征选择和分类方法的比较研究。
Genome Inform. 2002;13:51-60.