• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于基因表达数据的用于肿瘤分类的新型稀疏编码算法。

A novel sparse coding algorithm for classification of tumors based on gene expression data.

作者信息

Kolali Khormuji Morteza, Bazrafkan Mehrnoosh

机构信息

Islamic Azad University, Science and Research Branch, Bushehr, Iran.

出版信息

Med Biol Eng Comput. 2016 Jun;54(6):869-76. doi: 10.1007/s11517-015-1382-8. Epub 2015 Sep 4.

DOI:10.1007/s11517-015-1382-8
PMID:26337064
Abstract

High-dimensional genomic and proteomic data play an important role in many applications in medicine such as prognosis of diseases, diagnosis, prevention and molecular biology, to name a few. Classifying such data is a challenging task due to the various issues such as curse of dimensionality, noise and redundancy. Recently, some researchers have used the sparse representation (SR) techniques to analyze high-dimensional biological data in various applications in classification of cancer patients based on gene expression datasets. A common problem with all SR-based biological data classification methods is that they cannot utilize the topological (geometrical) structure of data. More precisely, these methods transfer the data into sparse feature space without preserving the local structure of data points. In this paper, we proposed a novel SR-based cancer classification algorithm based on gene expression data that takes into account the geometrical information of all data. Precisely speaking, we incorporate the local linear embedding algorithm into the sparse coding framework, by which we can preserve the geometrical structure of all data. For performance comparison, we applied our algorithm on six tumor gene expression datasets, by which we demonstrate that the proposed method achieves higher classification accuracy than state-of-the-art SR-based tumor classification algorithms.

摘要

高维基因组和蛋白质组数据在医学的许多应用中发挥着重要作用,如疾病预后、诊断、预防和分子生物学等。由于维度诅咒、噪声和冗余等各种问题,对这类数据进行分类是一项具有挑战性的任务。最近,一些研究人员使用稀疏表示(SR)技术,基于基因表达数据集,在癌症患者分类的各种应用中分析高维生物数据。所有基于SR的生物数据分类方法的一个共同问题是,它们无法利用数据的拓扑(几何)结构。更确切地说,这些方法将数据转换到稀疏特征空间,而不保留数据点的局部结构。在本文中,我们提出了一种基于基因表达数据的新型基于SR的癌症分类算法,该算法考虑了所有数据的几何信息。确切地说,我们将局部线性嵌入算法纳入稀疏编码框架,通过这种方式我们可以保留所有数据的几何结构。为了进行性能比较,我们将我们的算法应用于六个肿瘤基因表达数据集,由此证明所提出的方法比基于SR的现有肿瘤分类算法具有更高的分类准确率。

相似文献

1
A novel sparse coding algorithm for classification of tumors based on gene expression data.一种基于基因表达数据的用于肿瘤分类的新型稀疏编码算法。
Med Biol Eng Comput. 2016 Jun;54(6):869-76. doi: 10.1007/s11517-015-1382-8. Epub 2015 Sep 4.
2
Tumor classification based on orthogonal linear discriminant analysis.基于正交线性判别分析的肿瘤分类
Biomed Mater Eng. 2014;24(1):1399-405. doi: 10.3233/BME-130944.
3
Sparse representation approaches for the classification of high-dimensional biological data.用于高维生物数据分类的稀疏表示方法。
BMC Syst Biol. 2013;7 Suppl 4(Suppl 4):S6. doi: 10.1186/1752-0509-7-S4-S6. Epub 2013 Oct 23.
4
Molecular cancer classification using a meta-sample-based regularized robust coding method.基于元样本的正则化稳健编码方法进行分子癌症分类。
BMC Bioinformatics. 2014;15 Suppl 15(Suppl 15):S2. doi: 10.1186/1471-2105-15-S15-S2. Epub 2014 Dec 3.
5
Sparse representation for tumor classification based on feature extraction using latent low-rank representation.基于潜在低秩表示特征提取的肿瘤分类稀疏表示
Biomed Res Int. 2014;2014:420856. doi: 10.1155/2014/420856. Epub 2014 Feb 11.
6
Metasample-based sparse representation for tumor classification.基于元样本稀疏表示的肿瘤分类。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1273-82. doi: 10.1109/TCBB.2011.20.
7
Maxdenominator Reweighted Sparse Representation for Tumor Classification.最大公因式重加权稀疏表示用于肿瘤分类。
Sci Rep. 2017 Apr 10;7:46030. doi: 10.1038/srep46030.
8
Improving gene expression cancer molecular pattern discovery using nonnegative principal component analysis.使用非负主成分分析改进基因表达癌症分子模式发现
Genome Inform. 2008;21:200-11.
9
Graph regularized sparse coding for image representation.基于图正则化稀疏编码的图像表示方法。
IEEE Trans Image Process. 2011 May;20(5):1327-36. doi: 10.1109/TIP.2010.2090535. Epub 2010 Nov 1.
10
Sparse representation for classification of tumors using gene expression data.利用基因表达数据进行肿瘤分类的稀疏表示
J Biomed Biotechnol. 2009;2009:403689. doi: 10.1155/2009/403689. Epub 2009 Mar 15.

引用本文的文献

1
Sparse representation learning derives biological features with explicit gene weights from the Allen Mouse Brain Atlas.稀疏表示学习从艾伦老鼠大脑图谱中获得具有明确基因权重的生物学特征。
PLoS One. 2023 Mar 6;18(3):e0282171. doi: 10.1371/journal.pone.0282171. eCollection 2023.
2
Developing a Novel Machine Learning-Based Classification Scheme for Predicting SPCs in Breast Cancer Survivors.开发一种基于机器学习的新型分类方案,用于预测乳腺癌幸存者的特定并发症。
Front Genet. 2019 Sep 18;10:848. doi: 10.3389/fgene.2019.00848. eCollection 2019.
3
Gene selection for microarray data classification via subspace learning and manifold regularization.

本文引用的文献

1
Biomedical application of fuzzy association rules for identifying breast cancer biomarkers.模糊关联规则在识别乳腺癌生物标志物中的生物医学应用。
Med Biol Eng Comput. 2012 Sep;50(9):981-90. doi: 10.1007/s11517-012-0914-8. Epub 2012 May 24.
2
Sparse representation via ℓ1-minimization for underdetermined systems in classification of tumors with gene expression data.通过 ℓ1 最小化对具有基因表达数据的肿瘤分类中的欠定系统进行稀疏表示。
Annu Int Conf IEEE Eng Med Biol Soc. 2011;2011:3362-6. doi: 10.1109/IEMBS.2011.6090911.
3
Metasample-based sparse representation for tumor classification.
基于子空间学习和流形正则化的基因选择在微阵列数据分类中的应用。
Med Biol Eng Comput. 2018 Jul;56(7):1271-1284. doi: 10.1007/s11517-017-1751-6. Epub 2017 Dec 19.
4
Lossless medical image compression using geometry-adaptive partitioning and least square-based prediction.基于几何自适应分区和最小二乘预测的无损医学图像压缩。
Med Biol Eng Comput. 2018 Jun;56(6):957-966. doi: 10.1007/s11517-017-1741-8. Epub 2017 Nov 6.
5
Liver segmentation from CT images using a sparse priori statistical shape model (SP-SSM).使用稀疏先验统计形状模型(SP-SSM)从CT图像中进行肝脏分割。
PLoS One. 2017 Oct 5;12(10):e0185249. doi: 10.1371/journal.pone.0185249. eCollection 2017.
6
Highly undersampled MR image reconstruction using an improved dual-dictionary learning method with self-adaptive dictionaries.使用具有自适应字典的改进双字典学习方法进行高度欠采样磁共振图像重建。
Med Biol Eng Comput. 2017 May;55(5):807-822. doi: 10.1007/s11517-016-1556-z. Epub 2016 Aug 18.
基于元样本稀疏表示的肿瘤分类。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1273-82. doi: 10.1109/TCBB.2011.20.
4
Prediction of cancer class with majority voting genetic programming classifier using gene expression data.使用基因表达数据,通过多数投票遗传编程分类器预测癌症类别。
IEEE/ACM Trans Comput Biol Bioinform. 2009 Apr-Jun;6(2):353-67. doi: 10.1109/TCBB.2007.70245.
5
Sparse representation for classification of tumors using gene expression data.利用基因表达数据进行肿瘤分类的稀疏表示
J Biomed Biotechnol. 2009;2009:403689. doi: 10.1155/2009/403689. Epub 2009 Mar 15.
6
Robust face recognition via sparse representation.基于稀疏表示的鲁棒人脸识别。
IEEE Trans Pattern Anal Mach Intell. 2009 Feb;31(2):210-27. doi: 10.1109/TPAMI.2008.79.
7
Development of two-stage SVM-RFE gene selection strategy for microarray expression data analysis.用于微阵列表达数据分析的两阶段支持向量机-递归特征消除基因选择策略的开发。
IEEE/ACM Trans Comput Biol Bioinform. 2007 Jul-Sep;4(3):365-81. doi: 10.1109/TCBB.2007.70224.
8
Classification of breast masses via nonlinear transformation of features based on a kernel matrix.基于核矩阵的特征非线性变换对乳腺肿块进行分类。
Med Biol Eng Comput. 2007 Aug;45(8):769-80. doi: 10.1007/s11517-007-0211-0. Epub 2007 Jul 21.
9
Application of simulated annealing to the biclustering of gene expression data.模拟退火算法在基因表达数据双聚类中的应用。
IEEE Trans Inf Technol Biomed. 2006 Jul;10(3):519-25. doi: 10.1109/titb.2006.872073.
10
Independent component analysis-based penalized discriminant method for tumor classification using gene expression data.基于独立成分分析的惩罚判别方法用于利用基因表达数据进行肿瘤分类
Bioinformatics. 2006 Aug 1;22(15):1855-62. doi: 10.1093/bioinformatics/btl190. Epub 2006 May 18.