• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基因表达分化的迭代无监督方法。

An Iterative Unsupervised Method for Gene Expression Differentiation.

机构信息

Faculty of Mathematics and Informatics, Sofia University "St. Kliment Ohridski", 125 Tsarigradsko Shosse Blvd., bl. 2, 1113 Sofia, Bulgaria.

出版信息

Genes (Basel). 2023 Feb 4;14(2):412. doi: 10.3390/genes14020412.

DOI:10.3390/genes14020412
PMID:36833339
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9956932/
Abstract

For several decades, intensive research for understanding gene activity and its role in organism's lives is the research focus of scientists in different areas. A part of these investigations is the analysis of gene expression data for selecting differentially expressed genes. Methods that identify the interested genes have been proposed on statistical data analysis. The problem is that there is no good agreement among them, as different results are produced by distinct methods. By taking the advantage of the unsupervised data analysis, an iterative clustering procedure that finds differentially expressed genes shows promising results. In the present paper, a comparative study of the clustering methods applied for gene expression analysis is presented to explicate the choice of the clustering algorithm implemented in the method. An investigation of different distance measures is provided to reveal those that increase the efficiency of the method in finding the real data structure. Further, the method is improved by incorporating an additional aggregation measure based on the standard deviation of the expression levels. Its usage increases the gene distinction as a new amount of differentially expressed genes is found. The method is summarized in a detailed procedure. The significance of the method is proved by an analysis of two mice strain data sets. The differentially expressed genes defined by the proposed method are compared with those selected by the well-known statistical methods applied to the same data set.

摘要

几十年来,深入研究基因活性及其在生物体生命中的作用一直是不同领域科学家的研究重点。这些研究的一部分是分析基因表达数据以选择差异表达基因。基于统计数据分析的识别感兴趣基因的方法已经被提出。问题在于它们之间没有很好的一致性,因为不同的方法会产生不同的结果。通过利用无监督数据分析,一种发现差异表达基因的迭代聚类过程显示出了有希望的结果。在本文中,对应用于基因表达分析的聚类方法进行了比较研究,以阐明所实现的聚类算法的选择。提供了对不同距离度量的研究,以揭示那些能够提高该方法在发现真实数据结构方面的效率的度量。此外,通过基于表达水平的标准差的附加聚合度量来改进该方法。发现了新的差异表达基因,从而增加了基因的区分度。该方法总结在一个详细的过程中。通过对两个老鼠品系数据集的分析,证明了该方法的有效性。所提出的方法定义的差异表达基因与应用于同一数据集的著名统计方法选择的基因进行了比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/5817cc0fbb0d/genes-14-00412-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/84c6ff559009/genes-14-00412-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/d262357fc7a4/genes-14-00412-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/d4541dfb413a/genes-14-00412-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/dd8a54ea189f/genes-14-00412-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/d06fa3b7705f/genes-14-00412-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/32b0cb581315/genes-14-00412-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/5817cc0fbb0d/genes-14-00412-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/84c6ff559009/genes-14-00412-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/d262357fc7a4/genes-14-00412-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/d4541dfb413a/genes-14-00412-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/dd8a54ea189f/genes-14-00412-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/d06fa3b7705f/genes-14-00412-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/32b0cb581315/genes-14-00412-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0ff1/9956932/5817cc0fbb0d/genes-14-00412-g007.jpg

相似文献

1
An Iterative Unsupervised Method for Gene Expression Differentiation.一种基因表达分化的迭代无监督方法。
Genes (Basel). 2023 Feb 4;14(2):412. doi: 10.3390/genes14020412.
2
Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes.使用功能类别参考集评估基因表达数据聚类算法的方法。
BMC Bioinformatics. 2006 Aug 31;7:397. doi: 10.1186/1471-2105-7-397.
3
Combining Pareto-optimal clusters using supervised learning for identifying co-expressed genes.使用监督学习组合帕累托最优聚类以识别共表达基因。
BMC Bioinformatics. 2009 Jan 20;10:27. doi: 10.1186/1471-2105-10-27.
4
A unified framework for finding differentially expressed genes from microarray experiments.一种从微阵列实验中寻找差异表达基因的统一框架。
BMC Bioinformatics. 2007 Sep 18;8:347. doi: 10.1186/1471-2105-8-347.
5
Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区,服用抗叶酸抗疟药物的人群中,叶酸补充剂与疟疾易感性和严重程度的关系。
Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.
6
Gene expression analysis in clear cell renal cell carcinoma using gene set enrichment analysis for biostatistical management.基于基因集富集分析的 clear cell 肾细胞癌基因表达分析用于生物统计学管理。
BJU Int. 2011 Jul;108(2 Pt 2):E29-35. doi: 10.1111/j.1464-410X.2010.09794.x. Epub 2011 Mar 16.
7
Robust complementary hierarchical clustering for gene expression data analysis by β-divergence.基于β散度的稳健互补层次聚类在基因表达数据分析中的应用。
J Biosci Bioeng. 2013 Sep;116(3):397-407. doi: 10.1016/j.jbiosc.2013.03.010. Epub 2013 Apr 19.
8
Clustering and re-clustering for pattern discovery in gene expression data.用于基因表达数据中模式发现的聚类和再聚类。
J Bioinform Comput Biol. 2005 Apr;3(2):281-301. doi: 10.1142/s0219720005001053.
9
Clustering gene expression data using a diffraction-inspired framework.基于衍射启发式框架的基因表达数据聚类。
Biomed Eng Online. 2012 Nov 19;11:85. doi: 10.1186/1475-925X-11-85.
10
A new unsupervised gene clustering algorithm based on the integration of biological knowledge into expression data.一种新的基于将生物学知识整合到表达数据中的无监督基因聚类算法。
BMC Bioinformatics. 2013 Feb 7;14:42. doi: 10.1186/1471-2105-14-42.

引用本文的文献

1
Special Issue: New Advances in Bioinformatics and Biomedical Engineering Using Machine Learning Techniques, IWBBIO-2022.特刊:机器学习技术在生物信息学和生物医学工程中的新进展,IWBBIO-2022。
Genes (Basel). 2023 Aug 1;14(8):1574. doi: 10.3390/genes14081574.

本文引用的文献

1
GMMchi: gene expression clustering using Gaussian mixture modeling.GMMchi:基于高斯混合模型的基因表达聚类。
BMC Bioinformatics. 2022 Nov 2;23(1):457. doi: 10.1186/s12859-022-05006-0.
2
A Python Clustering Analysis Protocol of Genes Expression Data Sets.基于基因表达数据集的 Python 聚类分析方案。
Genes (Basel). 2022 Oct 12;13(10):1839. doi: 10.3390/genes13101839.
3
Deep learning suggests that gene expression is encoded in all parts of a co-evolving interacting gene regulatory structure.深度学习表明,基因表达是由共同进化的相互作用基因调控结构的所有部分编码的。
Nat Commun. 2020 Dec 1;11(1):6141. doi: 10.1038/s41467-020-19921-4.
4
Machine learning techniques for sequence-based prediction of viral-host interactions between SARS-CoV-2 and human proteins.基于序列的 SARS-CoV-2 与人类蛋白质之间病毒-宿主相互作用的预测的机器学习技术。
Biomed J. 2020 Oct;43(5):438-450. doi: 10.1016/j.bj.2020.08.003. Epub 2020 Sep 3.
5
Comparative analysis of differential gene expression analysis tools for single-cell RNA sequencing data.单细胞 RNA 测序数据分析工具的差异基因表达分析比较。
BMC Bioinformatics. 2019 Jan 18;20(1):40. doi: 10.1186/s12859-019-2599-6.
6
Comparative analysis of differential gene expression tools for RNA sequencing time course data.RNA 测序时间序列数据差异基因表达工具的比较分析。
Brief Bioinform. 2019 Jan 18;20(1):288-298. doi: 10.1093/bib/bbx115.
7
Statistical methods for identifying differentially expressed genes in RNA-Seq experiments.RNA-Seq 实验中差异表达基因的识别的统计方法。
Cell Biosci. 2012 Jul 31;2(1):26. doi: 10.1186/2045-3701-2-26.
8
Evaluating gene expression in C57BL/6J and DBA/2J mouse striatum using RNA-Seq and microarrays.使用 RNA-Seq 和微阵列评估 C57BL/6J 和 DBA/2J 小鼠纹状体中的基因表达。
PLoS One. 2011 Mar 24;6(3):e17820. doi: 10.1371/journal.pone.0017820.
9
How does gene expression clustering work?基因表达聚类是如何工作的?
Nat Biotechnol. 2005 Dec;23(12):1499-501. doi: 10.1038/nbt1205-1499.
10
Analyzing microarray data using cluster analysis.使用聚类分析来分析微阵列数据。
Pharmacogenomics. 2003 Jan;4(1):41-52. doi: 10.1517/phgs.4.1.41.22581.