• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种改进的最大信息系数算法及其应用。

An improved algorithm for the maximal information coefficient and its application.

作者信息

Cao Dan, Chen Yuan, Chen Jin, Zhang Hongyan, Yuan Zheming

机构信息

Hunan Engineering and Technology Research Centre for Agricultural Big Data Analysis and Decision-making, Hunan Agricultural University, Changsha 410000, People's Republic of China.

Orient Science and Technology College of Hunan Agricultural University, Changsha 410000, Hunan, People's Republic of China.

出版信息

R Soc Open Sci. 2021 Feb 10;8(2):201424. doi: 10.1098/rsos.201424.

DOI:10.1098/rsos.201424
PMID:33972855
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8074658/
Abstract

The maximal information coefficient (MIC) captures both linear and nonlinear correlations between variable pairs. In this paper, we proposed the BackMIC algorithm for MIC estimation. The BackMIC algorithm adds a searching back process on the equipartitioned axis to obtain a better grid partition than the original implementation algorithm ApproxMaxMI. And similar to the ChiMIC algorithm, it terminates the grid search process by the -test instead of the maximum number of bins B(, ). Results on simulated data show that the BackMIC algorithm maintains the generality of MIC, and gives more reasonable grid partition and MIC values for independent and dependent variable pairs under comparable running times. Moreover, it is robust under different in B(, ). MIC calculated by the BackMIC algorithm reveals an improvement in statistical power and equitability. We applied (1-MIC) as the distance measurement in the K-means algorithm to perform a clustering of the cancer/normal samples. The results on four cancer datasets demonstrated that the MIC values calculated by the BackMIC algorithm can obtain better clustering results, indicating the correlations between samples measured by the BackMIC algorithm were more credible than those measured by other algorithms.

摘要

最大信息系数(MIC)能够捕捉变量对之间的线性和非线性相关性。在本文中,我们提出了用于MIC估计的BackMIC算法。BackMIC算法在等分区轴上添加了一个回溯搜索过程,以获得比原始实现算法ApproxMaxMI更好的网格划分。并且与ChiMIC算法类似,它通过t检验而非最大箱数B(n, k)来终止网格搜索过程。模拟数据结果表明,BackMIC算法保持了MIC的通用性,并且在可比的运行时间下,为独立和相关变量对给出了更合理的网格划分和MIC值。此外,它在B(n, k)的不同k值下具有鲁棒性。通过BackMIC算法计算得到的MIC在统计功效和公平性方面有所改进。我们将(1 - MIC)用作K均值算法中的距离度量,对癌症/正常样本进行聚类。四个癌症数据集的结果表明,由BackMIC算法计算得到的MIC值能够获得更好的聚类结果,这表明BackMIC算法所测量的样本之间的相关性比其他算法所测量的更可信。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/e46b1b70ae94/rsos201424f08.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/5dd37e43b422/rsos201424f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/1b7b378434fa/rsos201424f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/29b1df167433/rsos201424f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/adcf8e912b96/rsos201424f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/509a4298fe40/rsos201424f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/29260235e8ae/rsos201424f06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/7c7dccf4792a/rsos201424f07.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/e46b1b70ae94/rsos201424f08.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/5dd37e43b422/rsos201424f01.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/1b7b378434fa/rsos201424f02.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/29b1df167433/rsos201424f03.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/adcf8e912b96/rsos201424f04.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/509a4298fe40/rsos201424f05.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/29260235e8ae/rsos201424f06.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/7c7dccf4792a/rsos201424f07.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b9ad/8074658/e46b1b70ae94/rsos201424f08.jpg

相似文献

1
An improved algorithm for the maximal information coefficient and its application.一种改进的最大信息系数算法及其应用。
R Soc Open Sci. 2021 Feb 10;8(2):201424. doi: 10.1098/rsos.201424.
2
A New Algorithm to Optimize Maximal Information Coefficient.一种优化最大信息系数的新算法。
PLoS One. 2016 Jun 22;11(6):e0157567. doi: 10.1371/journal.pone.0157567. eCollection 2016.
3
Analysing large biological data sets with an improved algorithm for MIC.使用一种改进的最小抑菌浓度(MIC)算法分析大型生物数据集。
Int J Data Min Bioinform. 2015;13(2):158-70. doi: 10.1504/ijdmb.2015.071548.
4
A novel algorithm for the precise calculation of the maximal information coefficient.一种用于精确计算最大信息系数的新算法。
Sci Rep. 2014 Oct 17;4:6662. doi: 10.1038/srep06662.
5
SuperMIC: Analyzing Large Biological Datasets in Bioinformatics with Maximal Information Coefficient.SuperMIC:使用最大信息系数分析生物信息学中的大型生物数据集。
IEEE/ACM Trans Comput Biol Bioinform. 2017 Jul-Aug;14(4):783-795. doi: 10.1109/TCBB.2016.2550430. Epub 2016 Apr 5.
6
MICRAT: a novel algorithm for inferring gene regulatory networks using time series gene expression data.MICRAT:一种使用时间序列基因表达数据推断基因调控网络的新算法。
BMC Syst Biol. 2018 Dec 14;12(Suppl 7):115. doi: 10.1186/s12918-018-0635-1.
7
Equitability, mutual information, and the maximal information coefficient.公平性、互信息和最大信息系数。
Proc Natl Acad Sci U S A. 2014 Mar 4;111(9):3354-9. doi: 10.1073/pnas.1309933111. Epub 2014 Feb 18.
8
Uniform Partitioning of Data Grid for Association Detection.数据网格的一致划分用于关联检测。
IEEE Trans Pattern Anal Mach Intell. 2022 Feb;44(2):1098-1107. doi: 10.1109/TPAMI.2020.3029487. Epub 2022 Jan 7.
9
A novel Chinese herbal medicine clustering algorithm via artificial bee colony optimization.一种基于人工蜂群优化的中草药聚类算法。
Artif Intell Med. 2019 Nov;101:101760. doi: 10.1016/j.artmed.2019.101760. Epub 2019 Nov 10.
10
Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm.使用一致性算法对大型DNA微阵列数据集进行稳健的多尺度聚类
Bioinformatics. 2006 Jan 1;22(1):58-67. doi: 10.1093/bioinformatics/bti746. Epub 2005 Oct 27.

引用本文的文献

1
Dynamic brain-heart-gut coupling during sleep: a continuous physiological signal analysis.睡眠期间动态脑-心-肠耦合:连续生理信号分析
Front Neurosci. 2025 May 26;19:1594759. doi: 10.3389/fnins.2025.1594759. eCollection 2025.
2
An efficient, not-only-linear correlation coefficient based on clustering.一种基于聚类的高效、非线性相关系数。
Cell Syst. 2024 Sep 18;15(9):854-868.e3. doi: 10.1016/j.cels.2024.08.005. Epub 2024 Sep 6.
3
MICFuzzy: A maximal information content based fuzzy approach for reconstructing genetic networks.

本文引用的文献

1
Structural Entropy: Monitoring Correlation-Based Networks Over Time With Application To Financial Markets.结构熵:通过应用于金融市场的时间相关网络关联监测。
Sci Rep. 2019 Jul 25;9(1):10832. doi: 10.1038/s41598-019-47210-8.
2
MICOP: Maximal information coefficient-based oscillation prediction to detect biological rhythms in proteomics data.MICOP:基于最大信息系数的振荡预测检测蛋白质组学数据中的生物节律。
BMC Bioinformatics. 2018 Jun 28;19(1):249. doi: 10.1186/s12859-018-2257-4.
3
A filter feature selection method based on the Maximal Information Coefficient and Gram-Schmidt Orthogonalization for biomedical data mining.
MICFuzzy:一种基于最大信息含量的模糊方法,用于重建遗传网络。
PLoS One. 2023 Jul 7;18(7):e0288174. doi: 10.1371/journal.pone.0288174. eCollection 2023.
4
A hybrid feature selection algorithm and its application in bioinformatics.一种混合特征选择算法及其在生物信息学中的应用。
PeerJ Comput Sci. 2022 Mar 22;8:e933. doi: 10.7717/peerj-cs.933. eCollection 2022.
基于最大信息系数和 Gram-Schmidt 正交化的生物医学数据挖掘过滤特征选择方法。
Comput Biol Med. 2017 Oct 1;89:264-274. doi: 10.1016/j.compbiomed.2017.08.021. Epub 2017 Aug 24.
4
A New Algorithm to Optimize Maximal Information Coefficient.一种优化最大信息系数的新算法。
PLoS One. 2016 Jun 22;11(6):e0157567. doi: 10.1371/journal.pone.0157567. eCollection 2016.
5
Analysis of k-means clustering approach on the breast cancer Wisconsin dataset.基于威斯康星乳腺癌数据集的k均值聚类方法分析
Int J Comput Assist Radiol Surg. 2016 Nov;11(11):2033-2047. doi: 10.1007/s11548-016-1437-9. Epub 2016 Jun 16.
6
A novel method for identifying SNP disease association based on maximal information coefficient.一种基于最大信息系数识别单核苷酸多态性(SNP)与疾病关联的新方法。
Genet Mol Res. 2014 Dec 19;13(4):10863-77. doi: 10.4238/2014.December.19.7.
7
A novel algorithm for the precise calculation of the maximal information coefficient.一种用于精确计算最大信息系数的新算法。
Sci Rep. 2014 Oct 17;4:6662. doi: 10.1038/srep06662.
8
Cleaning up the record on the maximal information coefficient and equitability.清理关于最大信息系数和公平性的记录。
Proc Natl Acad Sci U S A. 2014 Aug 19;111(33):E3362-3. doi: 10.1073/pnas.1408920111.
9
Equitability, mutual information, and the maximal information coefficient.公平性、互信息和最大信息系数。
Proc Natl Acad Sci U S A. 2014 Mar 4;111(9):3354-9. doi: 10.1073/pnas.1309933111. Epub 2014 Feb 18.
10
Comparison of global gene expression of gastric cardia and noncardia cancers from a high-risk population in china.中国高危人群贲门癌和非贲门癌的全球基因表达比较。
PLoS One. 2013 May 22;8(5):e63826. doi: 10.1371/journal.pone.0063826. Print 2013.