• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

TriRNSC:基于受限邻域搜索的基因表达微阵列数据的三重聚类。

TriRNSC: triclustering of gene expression microarray data using restricted neighbourhood search.

机构信息

DST-FIST Bioinformatics Lab, Department of Computer Science and Engineering, International Institute of Information Technology (IIIT), Bhubaneswar, India.

出版信息

IET Syst Biol. 2020 Dec;14(6):323-333. doi: 10.1049/iet-syb.2020.0024.

DOI:10.1049/iet-syb.2020.0024
PMID:33399096
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8687346/
Abstract

Computational analysis of microarray data is crucial for understanding the gene behaviours and deriving meaningful results. Clustering and biclustering of gene expression microarray data in the unsupervised domain are extremely important as their outcomes directly dominate healthcare research in many aspects. However, these approaches fail when the time factor is added as the third dimension to the microarray datasets. This three-dimensional data set can be analysed using triclustering that discovers similar gene sets that pursue identical behaviour under a subset of conditions at a specific time point. A novel triclustering algorithm (TriRNSC) is proposed in this manuscript to discover meaningful triclusters in gene expression profiles. TriRNSC is based on restricted neighbourhood search clustering (RNSC), a popular graph-based clustering approach considering the genes, the experimental conditions and the time points at an instance. The performance of the proposed algorithm is evaluated in terms of volume and some performance measures. Gene Ontology and KEGG pathway analysis are used to validate the TriRNSC results biologically. The efficiency of TriRNSC indicates its capability and reliability and also demonstrates its usability over other state-of-art schemes. The proposed framework initiates the application of the RNSC algorithm in the triclustering of gene expression profiles.

摘要

微阵列数据分析的计算分析对于理解基因行为和得出有意义的结果至关重要。在无监督领域对基因表达微阵列数据进行聚类和双聚类非常重要,因为它们的结果直接主导着许多方面的医疗保健研究。然而,当时间因素作为微阵列数据集的第三个维度添加时,这些方法就会失败。可以使用三聚类来分析这个三维数据集,该方法可以发现相似的基因集,这些基因集在特定时间点的一组条件下表现出相同的行为。本文提出了一种新的三聚类算法(TriRNSC),用于发现基因表达谱中的有意义的三聚类。TriRNSC 基于受限邻域搜索聚类(RNSC),这是一种流行的基于图的聚类方法,考虑了基因、实验条件和时间点。根据体积和一些性能指标来评估所提出算法的性能。使用基因本体论和 KEGG 通路分析对 TriRNSC 结果进行生物学验证。TriRNSC 的效率表明了它的能力和可靠性,也证明了它在其他最先进的方案中的可用性。所提出的框架将 RNSC 算法应用于基因表达谱的三聚类中。

相似文献

1
TriRNSC: triclustering of gene expression microarray data using restricted neighbourhood search.TriRNSC:基于受限邻域搜索的基因表达微阵列数据的三重聚类。
IET Syst Biol. 2020 Dec;14(6):323-333. doi: 10.1049/iet-syb.2020.0024.
2
Discovering biclusters in gene expression data based on high-dimensional linear geometries.基于高维线性几何在基因表达数据中发现双簇。
BMC Bioinformatics. 2008 Apr 23;9:209. doi: 10.1186/1471-2105-9-209.
3
Multiobjective triclustering of time-series transcriptome data reveals key genes of biological processes.时间序列转录组数据的多目标三聚类揭示生物过程的关键基因。
BMC Bioinformatics. 2015 Jun 26;16:200. doi: 10.1186/s12859-015-0635-8.
4
TimesVector: a vectorized clustering approach to the analysis of time series transcriptome data from multiple phenotypes.TimesVector:一种用于分析来自多种表型的时间序列转录组数据的向量化聚类方法。
Bioinformatics. 2017 Dec 1;33(23):3827-3835. doi: 10.1093/bioinformatics/btw780.
5
Methods for evaluating clustering algorithms for gene expression data using a reference set of functional classes.使用功能类别参考集评估基因表达数据聚类算法的方法。
BMC Bioinformatics. 2006 Aug 31;7:397. doi: 10.1186/1471-2105-7-397.
6
Biclustering of microarray data with MOSPO based on crowding distance.基于拥挤距离使用MOSPO对微阵列数据进行双聚类分析。
BMC Bioinformatics. 2009 Apr 29;10 Suppl 4(Suppl 4):S9. doi: 10.1186/1471-2105-10-S4-S9.
7
G-Tric: generating three-way synthetic datasets with triclustering solutions.G-Tric:使用三聚类解决方案生成三路合成数据集。
BMC Bioinformatics. 2021 Jan 7;22(1):16. doi: 10.1186/s12859-020-03925-4.
8
THD-Tricluster: A robust triclustering technique and its application in condition specific change analysis in HIV-1 progression data.THD-Tricluster:一种稳健的三聚类技术及其在 HIV-1 进展数据中条件特异性变化分析中的应用。
Comput Biol Chem. 2018 Aug;75:154-167. doi: 10.1016/j.compbiolchem.2018.05.007. Epub 2018 May 7.
9
Mining 3D patterns from gene expression temporal data: a new tricluster evaluation measure.从基因表达时间数据中挖掘三维模式:一种新的三聚类评估方法。
ScientificWorldJournal. 2014;2014:624371. doi: 10.1155/2014/624371. Epub 2014 Mar 31.
10
Dynamic biclustering of microarray data by multi-objective immune optimization.基于多目标免疫优化算法的基因表达数据动态双聚类分析
BMC Genomics. 2011;12 Suppl 2(Suppl 2):S11. doi: 10.1186/1471-2164-12-S2-S11. Epub 2011 Jul 27.

本文引用的文献

1
Tri-Clustered Tensor Completion for Social-Aware Image Tag Refinement.三聚类张量补全在社交感知图像标签细化中的应用。
IEEE Trans Pattern Anal Mach Intell. 2017 Aug;39(8):1662-1674. doi: 10.1109/TPAMI.2016.2608882. Epub 2016 Sep 13.
2
MSL: A Measure to Evaluate Three-dimensional Patterns in Gene Expression Data.MSL:一种评估基因表达数据中三维模式的方法。
Evol Bioinform Online. 2015 Jun 23;11:121-35. doi: 10.4137/EBO.S25822. eCollection 2015.
3
Multiobjective triclustering of time-series transcriptome data reveals key genes of biological processes.时间序列转录组数据的多目标三聚类揭示生物过程的关键基因。
BMC Bioinformatics. 2015 Jun 26;16:200. doi: 10.1186/s12859-015-0635-8.
4
Mining 3D patterns from gene expression temporal data: a new tricluster evaluation measure.从基因表达时间数据中挖掘三维模式:一种新的三聚类评估方法。
ScientificWorldJournal. 2014;2014:624371. doi: 10.1155/2014/624371. Epub 2014 Mar 31.
5
Coexpression and coregulation analysis of time-series gene expression data in estrogen-induced breast cancer cell.雌激素诱导的乳腺癌细胞中时间序列基因表达数据的共表达和共调控分析
Algorithms Mol Biol. 2013 Mar 23;8(1):9. doi: 10.1186/1748-7188-8-9.
6
Mining biological information from 3D short time-series gene expression data: the OPTricluster algorithm.从 3D 短期基因表达时间序列数据中挖掘生物信息:OPTricluster 算法。
BMC Bioinformatics. 2012 Apr 4;13:54. doi: 10.1186/1471-2105-13-54.
7
A coclustering approach for mining large protein-protein interaction networks.一种用于挖掘大规模蛋白质相互作用网络的 coclustering 方法。
IEEE/ACM Trans Comput Biol Bioinform. 2012 May-Jun;9(3):717-30. doi: 10.1109/TCBB.2011.158.
8
Searching for functional gene modules with interaction component models.使用相互作用成分模型寻找功能基因模块。
BMC Syst Biol. 2010 Jan 25;4:4. doi: 10.1186/1752-0509-4-4.
9
Efficiently mining time-delayed gene expression patterns.高效挖掘时间延迟基因表达模式。
IEEE Trans Syst Man Cybern B Cybern. 2010 Apr;40(2):400-11. doi: 10.1109/TSMCB.2009.2025564. Epub 2009 Oct 30.
10
KEGG for representation and analysis of molecular networks involving diseases and drugs.KEGG 用于表示和分析涉及疾病和药物的分子网络。
Nucleic Acids Res. 2010 Jan;38(Database issue):D355-60. doi: 10.1093/nar/gkp896. Epub 2009 Oct 30.