• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于非参数相关性的基因表达双聚类新方法

A new measure for gene expression biclustering based on non-parametric correlation.

机构信息

Intelligent Systems Group, Department of Computer Sciences and Artificial Intelligence, University of the Basque Country, P.O. Box 649, 20080 Donostia - San Sebastian, Spain.

出版信息

Comput Methods Programs Biomed. 2013 Dec;112(3):367-97. doi: 10.1016/j.cmpb.2013.07.025. Epub 2013 Aug 19.

DOI:10.1016/j.cmpb.2013.07.025
PMID:24079964
Abstract

BACKGROUND

One of the emerging techniques for performing the analysis of the DNA microarray data known as biclustering is the search of subsets of genes and conditions which are coherently expressed. These subgroups provide clues about the main biological processes. Until now, different approaches to this problem have been proposed. Most of them use the mean squared residue as quality measure but relevant and interesting patterns can not be detected such as shifting, or scaling patterns. Furthermore, recent papers show that there exist new coherence patterns involved in different kinds of cancer and tumors such as inverse relationships between genes which can not be captured.

RESULTS

The proposed measure is called Spearman's biclustering measure (SBM) which performs an estimation of the quality of a bicluster based on the non-linear correlation among genes and conditions simultaneously. The search of biclusters is performed by using a evolutionary technique called estimation of distribution algorithms which uses the SBM measure as fitness function. This approach has been examined from different points of view by using artificial and real microarrays. The assessment process has involved the use of quality indexes, a set of bicluster patterns of reference including new patterns and a set of statistical tests. It has been also examined the performance using real microarrays and comparing to different algorithmic approaches such as Bimax, CC, OPSM, Plaid and xMotifs.

CONCLUSIONS

SBM shows several advantages such as the ability to recognize more complex coherence patterns such as shifting, scaling and inversion and the capability to selectively marginalize genes and conditions depending on the statistical significance.

摘要

背景

DNA 微阵列数据分析的新兴技术之一是对基因和条件进行一致表达的子群搜索。这些子群提供了关于主要生物过程的线索。到目前为止,已经提出了许多针对这个问题的方法。大多数方法都使用均方残差作为质量度量标准,但无法检测到相关且有趣的模式,例如移位或缩放模式。此外,最近的论文表明,在不同类型的癌症和肿瘤中存在新的一致性模式,例如不能捕获的基因之间的反向关系。

结果

所提出的度量标准称为 Spearman 的双聚类度量(SBM),它根据基因和条件之间的非线性相关性同时对双聚类的质量进行估计。通过使用称为分布估计算法的进化技术来搜索双聚类,该算法使用 SBM 度量作为适应度函数。已经从不同的角度使用人工和真实微阵列对该方法进行了检查。评估过程涉及使用质量指标、一组包括新模式的双聚类模式参考集和一组统计检验。还使用真实微阵列检查了性能,并与不同的算法方法(如 Bimax、CC、OPSM、Plaid 和 xMotifs)进行了比较。

结论

SBM 具有多种优势,例如能够识别更复杂的一致性模式(如移位、缩放和反转)的能力,以及根据统计意义选择性地边缘化基因和条件的能力。

相似文献

1
A new measure for gene expression biclustering based on non-parametric correlation.基于非参数相关性的基因表达双聚类新方法
Comput Methods Programs Biomed. 2013 Dec;112(3):367-97. doi: 10.1016/j.cmpb.2013.07.025. Epub 2013 Aug 19.
2
Biclustering of gene expression data by correlation-based scatter search.基于相关性散列搜索的基因表达数据的双聚类。
BioData Min. 2011 Jan 24;4(1):3. doi: 10.1186/1756-0381-4-3.
3
Measuring the quality of linear patterns in biclusters.测量双簇中线性模式的质量。
Methods. 2015 Jul 15;83:18-27. doi: 10.1016/j.ymeth.2015.04.005. Epub 2015 Apr 15.
4
Shifting and scaling patterns from gene expression data.基因表达数据中的转移和缩放模式。
Bioinformatics. 2005 Oct 15;21(20):3840-5. doi: 10.1093/bioinformatics/bti641. Epub 2005 Sep 6.
5
COSCEB: Comprehensive search for column-coherent evolution biclusters and its application to hub gene identification.COSCEB:列一致进化双聚类的全面搜索及其在枢纽基因识别中的应用。
J Biosci. 2019 Jun;44(2).
6
A systematic comparison and evaluation of biclustering methods for gene expression data.基因表达数据双聚类方法的系统比较与评估
Bioinformatics. 2006 May 1;22(9):1122-9. doi: 10.1093/bioinformatics/btl060. Epub 2006 Feb 24.
7
A novel coherence measure for discovering scaling biclusters from gene expression data.一种用于从基因表达数据中发现缩放双聚类的新型相干度量。
J Bioinform Comput Biol. 2009 Oct;7(5):853-68. doi: 10.1142/s0219720009004370.
8
A graph spectrum based geometric biclustering algorithm.基于图谱的几何二分聚类算法。
J Theor Biol. 2013 Jan 21;317:200-11. doi: 10.1016/j.jtbi.2012.10.012. Epub 2012 Oct 16.
9
Shifting-and-Scaling Correlation Based Biclustering Algorithm.基于移位-缩放相关性的双聚类算法
IEEE/ACM Trans Comput Biol Bioinform. 2014 Nov-Dec;11(6):1239-52. doi: 10.1109/TCBB.2014.2323054.
10
Biclustering on expression data: A review.基于表达数据的双聚类分析:综述
J Biomed Inform. 2015 Oct;57:163-80. doi: 10.1016/j.jbi.2015.06.028. Epub 2015 Jul 6.

引用本文的文献

1
Biclustering data analysis: a comprehensive survey.双聚类数据分析:全面综述。
Brief Bioinform. 2024 May 23;25(4). doi: 10.1093/bib/bbae342.
2
ARES: Automated Risk Estimation in Smart Sensor Environments.ARES:智能传感器环境中的自动风险评估。
Sensors (Basel). 2020 Aug 17;20(16):4617. doi: 10.3390/s20164617.
3
Pairwise gene GO-based measures for biclustering of high-dimensional expression data.基于成对基因GO的高维表达数据双聚类方法
BioData Min. 2018 Mar 27;11:4. doi: 10.1186/s13040-018-0165-9. eCollection 2018.
4
MCbiclust: a novel algorithm to discover large-scale functionally related gene sets from massive transcriptomics data collections.MCbiclust:一种从海量转录组学数据集中发现大规模功能相关基因集的新算法。
Nucleic Acids Res. 2017 Sep 6;45(15):8712-8730. doi: 10.1093/nar/gkx590.
5
A New Approach for Mining Order-Preserving Submatrices Based on All Common Subsequences.一种基于所有公共子序列挖掘保序子矩阵的新方法。
Comput Math Methods Med. 2015;2015:680434. doi: 10.1155/2015/680434. Epub 2015 May 28.
6
Quality measures for gene expression biclusters.基因表达双聚类的质量度量
PLoS One. 2015 Mar 12;10(3):e0115497. doi: 10.1371/journal.pone.0115497. eCollection 2015.
7
biDCG: a new method for discovering global features of DNA microarray data via an iterative re-clustering procedure.双向密度聚类图:一种通过迭代重新聚类过程发现DNA微阵列数据全局特征的新方法。
PLoS One. 2014 Jul 21;9(7):e102445. doi: 10.1371/journal.pone.0102445. eCollection 2014.