• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于基因表达数据的渐进式模糊挖掘的基因功能预测。

Incremental fuzzy mining of gene expression data for gene function prediction.

机构信息

Department of Computing, The Hong Kong Polytechnic University, Hong Kong, China.

出版信息

IEEE Trans Biomed Eng. 2011 May;58(5):1246-52. doi: 10.1109/TBME.2010.2047724. Epub 2010 Apr 15.

DOI:10.1109/TBME.2010.2047724
PMID:20403777
Abstract

Due to the complexity of the underlying biological processes, gene expression data obtained from DNA microarray technologies are typically noisy and have very high dimensionality and these make the mining of such data for gene function prediction very difficult. To tackle these difficulties, we propose to use an incremental fuzzy mining technique called incremental fuzzy mining (IFM). By transforming quantitative expression values into linguistic terms, such as highly or lowly expressed, IFM can effectively capture heterogeneity in expression data for pattern discovery. It does so using a fuzzy measure to determine if interesting association patterns exist between the linguistic gene expression levels. Based on these patterns, IFM can make accurate gene function predictions and these predictions can be made in such a way that each gene can be allowed to belong to more than one functional class with different degrees of membership. Gene function prediction problem can be formulated both as classification and clustering problems, and IFM can be used either as a classification technique or together with existing clustering algorithms to improve the cluster groupings discovered for greater prediction accuracies. IFM is characterized also by its being an incremental data mining technique so that the discovered patterns can be continually refined based only on newly collected data without the need for retraining using the whole dataset. For performance evaluation, IFM has been tested with real expression datasets for both classification and clustering tasks. Experimental results show that it can effectively uncover hidden patterns for accurate gene function predictions.

摘要

由于潜在生物过程的复杂性,从 DNA 微阵列技术获得的基因表达数据通常是嘈杂的,并且具有非常高的维度,这使得挖掘这些数据以进行基因功能预测非常困难。为了解决这些困难,我们提出使用一种称为增量模糊挖掘(IFM)的增量模糊挖掘技术。通过将定量表达值转换为语言术语,例如高度或低度表达,IFM 可以有效地捕获表达数据中的异质性以进行模式发现。它通过使用模糊测度来确定语言基因表达水平之间是否存在有趣的关联模式。基于这些模式,IFM 可以进行准确的基因功能预测,并且可以以允许每个基因以不同程度的隶属度属于多个功能类的方式进行预测。基因功能预测问题可以被表述为分类和聚类问题,并且 IFM 可以被用作分类技术或与现有的聚类算法一起使用,以提高发现的聚类分组,从而提高预测精度。IFM 的特点还在于它是一种增量数据挖掘技术,因此可以仅基于新收集的数据不断细化发现的模式,而无需使用整个数据集进行重新训练。为了进行性能评估,IFM 已经在分类和聚类任务中使用真实表达数据集进行了测试。实验结果表明,它可以有效地揭示隐藏模式,以进行准确的基因功能预测。

相似文献

1
Incremental fuzzy mining of gene expression data for gene function prediction.基于基因表达数据的渐进式模糊挖掘的基因功能预测。
IEEE Trans Biomed Eng. 2011 May;58(5):1246-52. doi: 10.1109/TBME.2010.2047724. Epub 2010 Apr 15.
2
An iterative data mining approach for mining overlapping coexpression patterns in noisy gene expression data.一种用于在嘈杂基因表达数据中挖掘重叠共表达模式的迭代数据挖掘方法。
IEEE Trans Nanobioscience. 2009 Sep;8(3):252-8. doi: 10.1109/TNB.2009.2026747. Epub 2009 Jul 14.
3
Data mining of gene expression data by fuzzy and hybrid fuzzy methods.利用模糊和混合模糊方法对基因表达数据进行数据挖掘。
IEEE Trans Inf Technol Biomed. 2010 Jan;14(1):23-9. doi: 10.1109/TITB.2009.2033590. Epub 2009 Oct 20.
4
Efficiently mining time-delayed gene expression patterns.高效挖掘时间延迟基因表达模式。
IEEE Trans Syst Man Cybern B Cybern. 2010 Apr;40(2):400-11. doi: 10.1109/TSMCB.2009.2025564. Epub 2009 Oct 30.
5
A novel approach for discovering overlapping clusters in gene expression data.一种在基因表达数据中发现重叠簇的新方法。
IEEE Trans Biomed Eng. 2009 Jul;56(7):1803-9. doi: 10.1109/TBME.2009.2015055. Epub 2009 Feb 20.
6
Mixture classification model based on clinical markers for breast cancer prognosis.基于临床标志物的乳腺癌预后混合分类模型。
Artif Intell Med. 2010 Feb-Mar;48(2-3):129-37. doi: 10.1016/j.artmed.2009.07.008. Epub 2009 Dec 14.
7
Clustering microarray gene expression data using weighted Chinese restaurant process.使用加权中国餐馆过程对微阵列基因表达数据进行聚类
Bioinformatics. 2006 Aug 15;22(16):1988-97. doi: 10.1093/bioinformatics/btl284. Epub 2006 Jun 9.
8
Detecting clusters of different geometrical shapes in microarray gene expression data.在微阵列基因表达数据中检测不同几何形状的聚类。
Bioinformatics. 2005 May 1;21(9):1927-34. doi: 10.1093/bioinformatics/bti251. Epub 2005 Jan 12.
9
Fuzzy-rough supervised attribute clustering algorithm and classification of microarray data.模糊粗糙监督属性聚类算法与微阵列数据分类
IEEE Trans Syst Man Cybern B Cybern. 2011 Feb;41(1):222-33. doi: 10.1109/TSMCB.2010.2050684. Epub 2010 Jun 10.
10
Discovering interesting molecular substructures for molecular classification.发现分子分类的有趣分子亚结构。
IEEE Trans Nanobioscience. 2010 Jun;9(2):77-89. doi: 10.1109/TNB.2010.2042609.

引用本文的文献

1
Discovery and disentanglement of aligned residue associations from aligned pattern clusters to reveal subgroup characteristics.从比对模式簇中发现并解开比对残基关联以揭示亚组特征。
BMC Med Genomics. 2018 Nov 20;11(Suppl 5):103. doi: 10.1186/s12920-018-0417-z.