• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于邻域粗糙集和熵测度的肿瘤分类基因选择

Gene selection for tumor classification using neighborhood rough sets and entropy measures.

作者信息

Chen Yumin, Zhang Zunjun, Zheng Jianzhong, Ma Ying, Xue Yu

机构信息

College of Computer & Information Engineering, Xiamen University of Technology, Xiamen 361024, China.

Department of Urinary Surgery, The Third Xiamen Hospital of Fujian University of Traditional Chinese Medicine, Xiamen 316000, China.

出版信息

J Biomed Inform. 2017 Mar;67:59-68. doi: 10.1016/j.jbi.2017.02.007. Epub 2017 Feb 13.

DOI:10.1016/j.jbi.2017.02.007
PMID:28215562
Abstract

With the development of bioinformatics, tumor classification from gene expression data becomes an important useful technology for cancer diagnosis. Since a gene expression data often contains thousands of genes and a small number of samples, gene selection from gene expression data becomes a key step for tumor classification. Attribute reduction of rough sets has been successfully applied to gene selection field, as it has the characters of data driving and requiring no additional information. However, traditional rough set method deals with discrete data only. As for the gene expression data containing real-value or noisy data, they are usually employed by a discrete preprocessing, which may result in poor classification accuracy. In this paper, we propose a novel gene selection method based on the neighborhood rough set model, which has the ability of dealing with real-value data whilst maintaining the original gene classification information. Moreover, this paper addresses an entropy measure under the frame of neighborhood rough sets for tackling the uncertainty and noisy of gene expression data. The utilization of this measure can bring about a discovery of compact gene subsets. Finally, a gene selection algorithm is designed based on neighborhood granules and the entropy measure. Some experiments on two gene expression data show that the proposed gene selection is an effective method for improving the accuracy of tumor classification.

摘要

随着生物信息学的发展,基于基因表达数据的肿瘤分类成为癌症诊断中一项重要且有用的技术。由于基因表达数据通常包含数千个基因和少量样本,从基因表达数据中进行基因选择成为肿瘤分类的关键步骤。粗糙集的属性约简已成功应用于基因选择领域,因为它具有数据驱动且无需额外信息的特点。然而,传统粗糙集方法仅处理离散数据。对于包含实值或噪声数据的基因表达数据,通常采用离散预处理,这可能导致分类精度较差。在本文中,我们提出了一种基于邻域粗糙集模型的新型基因选择方法,该方法能够处理实值数据,同时保持原始基因分类信息。此外,本文在邻域粗糙集框架下提出了一种熵度量,用于处理基因表达数据的不确定性和噪声。该度量的使用可以发现紧凑的基因子集。最后,基于邻域粒度和熵度量设计了一种基因选择算法。在两个基因表达数据集上的一些实验表明,所提出的基因选择方法是提高肿瘤分类准确性的有效方法。

相似文献

1
Gene selection for tumor classification using neighborhood rough sets and entropy measures.基于邻域粗糙集和熵测度的肿瘤分类基因选择
J Biomed Inform. 2017 Mar;67:59-68. doi: 10.1016/j.jbi.2017.02.007. Epub 2017 Feb 13.
2
A Neighborhood Rough Sets-Based Attribute Reduction Method Using Lebesgue and Entropy Measures.一种基于邻域粗糙集的使用勒贝格测度和熵测度的属性约简方法。
Entropy (Basel). 2019 Feb 1;21(2):138. doi: 10.3390/e21020138.
3
An Attribute Reduction Method Using Neighborhood Entropy Measures in Neighborhood Rough Sets.一种基于邻域粗糙集邻域熵测度的属性约简方法。
Entropy (Basel). 2019 Feb 7;21(2):155. doi: 10.3390/e21020155.
4
Feature selection using mutual information based uncertainty measures for tumor classification.基于互信息不确定性度量的特征选择用于肿瘤分类
Biomed Mater Eng. 2014;24(1):763-70. doi: 10.3233/BME-130865.
5
Feature Genes Selection Using Fuzzy Rough Uncertainty Metric for Tumor Diagnosis.基于模糊粗糙不确定性度量的肿瘤诊断特征基因选择
Comput Math Methods Med. 2019 Jan 27;2019:6705648. doi: 10.1155/2019/6705648. eCollection 2019.
6
A Gene selection approach based on the fisher linear discriminant and the neighborhood rough set.基于 Fisher 线性判别和邻域粗糙集的基因选择方法。
Bioengineered. 2018 Jan 1;9(1):144-151. doi: 10.1080/21655979.2017.1403678. Epub 2017 Dec 19.
7
Gene Selection Integrated with Biological Knowledge for Plant Stress Response Using Neighborhood System and Rough Set Theory.利用邻域系统和粗糙集理论结合生物学知识进行植物胁迫响应的基因选择
IEEE/ACM Trans Comput Biol Bioinform. 2015 Mar-Apr;12(2):433-44. doi: 10.1109/TCBB.2014.2361329.
8
Feature Selection Combining Information Theory View and Algebraic View in the Neighborhood Decision System.邻域决策系统中结合信息论视角与代数视角的特征选择
Entropy (Basel). 2021 Jun 2;23(6):704. doi: 10.3390/e23060704.
9
A method of tumor classification based on wavelet packet transforms and neighborhood rough set.基于小波包变换和邻域粗糙集的肿瘤分类方法。
Comput Biol Med. 2010 Apr;40(4):430-7. doi: 10.1016/j.compbiomed.2010.02.007. Epub 2010 Mar 12.
10
Locally linear embedding and neighborhood rough set-based gene selection for gene expression data classification.基于局部线性嵌入和邻域粗糙集的基因选择用于基因表达数据分类。
Genet Mol Res. 2016 Aug 30;15(3):gmr8990. doi: 10.4238/gmr.15038990.

引用本文的文献

1
Unsupervised attribute reduction based on variable precision weighted neighborhood dependency.
iScience. 2024 Oct 29;27(12):111270. doi: 10.1016/j.isci.2024.111270. eCollection 2024 Dec 20.
2
A Novel Neighborhood Rough Set-Based Feature Selection Method and Its Application to Biomarker Identification of Schizophrenia.一种基于新型邻域粗糙集的特征选择方法及其在精神分裂症生物标志物识别中的应用。
IEEE J Biomed Health Inform. 2023 Jan;27(1):215-226. doi: 10.1109/JBHI.2022.3212479. Epub 2023 Jan 4.
3
Quantitative Detection of Gastrointestinal Tumor Markers Using a Machine Learning Algorithm and Multicolor Quantum Dot Biosensor.基于机器学习算法和多色量子点生物传感器的胃肠道肿瘤标志物定量检测。
Comput Intell Neurosci. 2022 Sep 1;2022:9022821. doi: 10.1155/2022/9022821. eCollection 2022.
4
Feature Selection Combining Information Theory View and Algebraic View in the Neighborhood Decision System.邻域决策系统中结合信息论视角与代数视角的特征选择
Entropy (Basel). 2021 Jun 2;23(6):704. doi: 10.3390/e23060704.
5
A novel gene expression test method of minimizing breast cancer risk in reduced cost and time by improving SVM-RFE gene selection method combined with LASSO.一种新的基因表达测试方法,通过改进 SVM-RFE 基因选择方法并结合 LASSO,以降低成本和时间的方式最小化乳腺癌风险。
J Integr Bioinform. 2020 Dec 29;18(2):139-153. doi: 10.1515/jib-2019-0110.
6
Feature Selection Using Approximate Conditional Entropy Based on Fuzzy Information Granule for Gene Expression Data Classification.基于模糊信息粒的近似条件熵在基因表达数据分类中的特征选择
Front Genet. 2021 Mar 30;12:631505. doi: 10.3389/fgene.2021.631505. eCollection 2021.
7
R.ROSETTA: an interpretable machine learning framework.R.ROSETTA:一个可解释的机器学习框架。
BMC Bioinformatics. 2021 Mar 6;22(1):110. doi: 10.1186/s12859-021-04049-z.
8
Dimensionality reduction using singular vectors.基于奇异向量的降维。
Sci Rep. 2021 Feb 15;11(1):3832. doi: 10.1038/s41598-021-83150-y.
9
Machine Learning Based Computational Gene Selection Models: A Survey, Performance Evaluation, Open Issues, and Future Research Directions.基于机器学习的计算基因选择模型:综述、性能评估、开放问题及未来研究方向
Front Genet. 2020 Dec 10;11:603808. doi: 10.3389/fgene.2020.603808. eCollection 2020.
10
An Attribute Reduction Method Using Neighborhood Entropy Measures in Neighborhood Rough Sets.一种基于邻域粗糙集邻域熵测度的属性约简方法。
Entropy (Basel). 2019 Feb 7;21(2):155. doi: 10.3390/e21020155.