• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于系统抽样和层次聚类算法的基因芯片表达数据中显著基因的挖掘。

Digging for Significant Genes in Microarray Expression Data Based on Systematic Sampling and Hierarchal Clustering Algorithm.

机构信息

University of Sulaimani, Collage of Science, Computer Department, Sulaymaniyah, Iraq.

出版信息

Adv Exp Med Biol. 2021;1338:1-6. doi: 10.1007/978-3-030-78775-2_1.

DOI:10.1007/978-3-030-78775-2_1
PMID:34973003
Abstract

Obesity is a worldwide health problem. Eating habits have changed during this decade and an increase in high-fat foods as well as sugar intake has been observed, which is associated with obesity and weight gaining. Therefore, in this chapter, we have analysed microarray expression data for obese and lean individuals. The microarray technology simultaneously records the expression levels of thousands of genes across related samples and during biological process. The microarray data sets are enriched with crucial information which have to be examined. In the study discussed in this chapter, the microarray data sets are pre-processed prior to analysis, in which upregulated and downregulated gene groups have been identified. Clustering is one of the learning techniques and it is applied in different fields of study. Clustering with microarray data can be accomplished based on genes or samples and depending on the type of datasets. Hierarchal clustering algorithm was used to detect gene patterns in our candidate datasets, since microarray data are considered big and complex. Systematic sampling technique was used to reduce the complexity of microarray datasets and to enhance the clustering quality. This technique is a simple and conductive sampling technique. The proposed algorithm, that is, Systematic Sampling with Hierarchal Clustering (SSHC), could detect significant gene patterns in the datasets, and the proposed system (SSHC) shows a better performance. The validity index utilized to evaluate the SSHC algorithm is adjusted Rand index (ARI).

摘要

肥胖是一个全球性的健康问题。在这十年中,饮食习惯发生了变化,高脂肪食物以及糖的摄入量增加,这与肥胖和体重增加有关。因此,在本章中,我们分析了肥胖和正常个体的基因表达数据。基因表达谱芯片技术可以同时记录数千个相关样本在生物过程中的表达水平。基因表达谱芯片数据集中包含了大量需要分析的关键信息。在本章中讨论的研究中,在进行分析之前,对基因表达谱芯片数据集进行了预处理,确定了上调和下调的基因群。聚类是一种学习技术,应用于不同的研究领域。基于基因或样本,可以根据数据集的类型对基因表达谱芯片数据进行聚类。由于基因表达谱芯片数据通常较大且复杂,因此使用层次聚类算法来检测我们的候选数据集的基因模式。系统抽样技术用于降低基因表达谱芯片数据集的复杂性并提高聚类质量。该技术是一种简单而有效的抽样技术。提出的算法,即层次聚类的系统抽样(SSHC),可以检测数据集中的显著基因模式,并且所提出的系统(SSHC)显示出更好的性能。用于评估 SSHC 算法的有效性指标是调整 Rand 指数(ARI)。

相似文献

1
Digging for Significant Genes in Microarray Expression Data Based on Systematic Sampling and Hierarchal Clustering Algorithm.基于系统抽样和层次聚类算法的基因芯片表达数据中显著基因的挖掘。
Adv Exp Med Biol. 2021;1338:1-6. doi: 10.1007/978-3-030-78775-2_1.
2
Gene Classification Based on Multi-Class SVMs with Systematic Sampling and Hierarchical Clustering (SSHC) Algorithm.基于系统抽样和层次聚类 (SSHC) 算法的多类 SVM 基因分类。
Adv Exp Med Biol. 2021;1338:231-237. doi: 10.1007/978-3-030-78775-2_28.
3
Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes.使用监督学习组合帕累托最优聚类以识别共表达基因。
BMC Bioinformatics. 2009 Jan 20;10:27. doi: 10.1186/1471-2105-10-27.
4
Metric for measuring the effectiveness of clustering of DNA microarray expression.用于测量 DNA 微阵列表达聚类有效性的度量。
BMC Bioinformatics. 2006 Sep 6;7 Suppl 2(Suppl 2):S5. doi: 10.1186/1471-2105-7-S2-S5.
5
Graph-based consensus clustering for class discovery from gene expression data.基于图的共识聚类用于从基因表达数据中发现类别
Bioinformatics. 2007 Nov 1;23(21):2888-96. doi: 10.1093/bioinformatics/btm463. Epub 2007 Sep 14.
6
A formal concept analysis approach to consensus clustering of multi-experiment expression data.一种正式的概念分析方法,用于多实验表达数据的共识聚类。
BMC Bioinformatics. 2014 May 19;15:151. doi: 10.1186/1471-2105-15-151.
7
Rank-based clustering analysis for the time-course microarray data.基于秩次的时间序列微阵列数据聚类分析。
J Bioinform Comput Biol. 2009 Feb;7(1):75-91. doi: 10.1142/s0219720009004035.
8
Biclustering of microarray data with MOSPO based on crowding distance.基于拥挤距离使用MOSPO对微阵列数据进行双聚类分析。
BMC Bioinformatics. 2009 Apr 29;10 Suppl 4(Suppl 4):S9. doi: 10.1186/1471-2105-10-S4-S9.
9
Analysis of a Gibbs sampler method for model-based clustering of gene expression data.一种基于模型的基因表达数据聚类的吉布斯采样器方法分析。
Bioinformatics. 2008 Jan 15;24(2):176-83. doi: 10.1093/bioinformatics/btm562. Epub 2007 Nov 22.
10
Comprehensive evaluation of matrix factorization methods for the analysis of DNA microarray gene expression data.基于 DNA 微阵列基因表达数据的矩阵分解方法分析的综合评估。
BMC Bioinformatics. 2011;12 Suppl 13(Suppl 13):S8. doi: 10.1186/1471-2105-12-S13-S8. Epub 2011 Nov 30.