Suppr超能文献

RapidMic:最大信息系数的快速计算。

RapidMic: Rapid Computation of the Maximal Information Coefficient.

机构信息

Institute of Information Research, Southwest Jiaotong University, Chengdu, China.

School of Mathematics, Southwest Jiaotong University, Chengdu, China.

出版信息

Evol Bioinform Online. 2014 Feb 6;10:11-6. doi: 10.4137/EBO.S13121. eCollection 2014.

Abstract

To discover relationships and associations rapidly in large-scale datasets, we propose a cross-platform tool for the rapid computation of the maximal information coefficient based on parallel computing methods. Through parallel processing, the provided tool can effectively analyze large-scale biological datasets with a markedly reduced computing time. The experimental results show that the proposed tool is notably fast, and is able to perform an all-pairs analysis of a large biological dataset using a normal computer. The source code and guidelines can be downloaded from https://github.com/HelloWorldCN/RapidMic.

摘要

为了在大规模数据集快速发现关系和关联,我们提出了一个基于并行计算方法的最大信息系数快速计算的跨平台工具。通过并行处理,该工具可以有效地分析具有大大减少计算时间的大规模生物数据集。实验结果表明,所提出的工具速度非常快,能够使用普通计算机对大型生物数据集进行全对分析。源代码和指南可从 https://github.com/HelloWorldCN/RapidMic 下载。

相似文献

1
RapidMic: Rapid Computation of the Maximal Information Coefficient.
Evol Bioinform Online. 2014 Feb 6;10:11-6. doi: 10.4137/EBO.S13121. eCollection 2014.
2
SuperMIC: Analyzing Large Biological Datasets in Bioinformatics with Maximal Information Coefficient.
IEEE/ACM Trans Comput Biol Bioinform. 2017 Jul-Aug;14(4):783-795. doi: 10.1109/TCBB.2016.2550430. Epub 2016 Apr 5.
3
GPU-FS-kNN: a software tool for fast and scalable kNN computation using GPUs.
PLoS One. 2012;7(8):e44000. doi: 10.1371/journal.pone.0044000. Epub 2012 Aug 28.
4
CMSA: a heterogeneous CPU/GPU computing system for multiple similar RNA/DNA sequence alignment.
BMC Bioinformatics. 2017 Jun 24;18(1):315. doi: 10.1186/s12859-017-1725-6.
6
A Web-based and Grid-enabled dChip version for the analysis of large sets of gene expression data.
BMC Bioinformatics. 2008 Nov 13;9:480. doi: 10.1186/1471-2105-9-480.
7
Iliski, a software for robust calculation of transfer functions.
PLoS Comput Biol. 2021 Jun 14;17(6):e1008614. doi: 10.1371/journal.pcbi.1008614. eCollection 2021 Jun.
8
Hammock: a hidden Markov model-based peptide clustering algorithm to identify protein-interaction consensus motifs in large datasets.
Bioinformatics. 2016 Jan 1;32(1):9-16. doi: 10.1093/bioinformatics/btv522. Epub 2015 Sep 5.
9
Efficient computation of motif discovery on Intel Many Integrated Core (MIC) Architecture.
BMC Bioinformatics. 2018 Aug 13;19(Suppl 9):282. doi: 10.1186/s12859-018-2276-1.
10
MRUniNovo: an efficient tool for de novo peptide sequencing utilizing the hadoop distributed computing framework.
Bioinformatics. 2017 Mar 15;33(6):944-946. doi: 10.1093/bioinformatics/btw721.

引用本文的文献

1
An efficient, not-only-linear correlation coefficient based on clustering.
Cell Syst. 2024 Sep 18;15(9):854-868.e3. doi: 10.1016/j.cels.2024.08.005. Epub 2024 Sep 6.
2
A hybrid feature selection algorithm and its application in bioinformatics.
PeerJ Comput Sci. 2022 Mar 22;8:e933. doi: 10.7717/peerj-cs.933. eCollection 2022.
3
Efficient Computation of Functional Brain Networks: toward Real-Time Functional Connectivity.
Front Neuroinform. 2017 Feb 6;11:8. doi: 10.3389/fninf.2017.00008. eCollection 2017.
4
A New Algorithm to Optimize Maximal Information Coefficient.
PLoS One. 2016 Jun 22;11(6):e0157567. doi: 10.1371/journal.pone.0157567. eCollection 2016.

本文引用的文献

1
Minerva and minepy: a C engine for the MINE suite and its R, Python and MATLAB wrappers.
Bioinformatics. 2013 Feb 1;29(3):407-8. doi: 10.1093/bioinformatics/bts707. Epub 2012 Dec 14.
2
Ten simple rules for the open development of scientific software.
PLoS Comput Biol. 2012;8(12):e1002802. doi: 10.1371/journal.pcbi.1002802. Epub 2012 Dec 6.
3
Comparison of co-expression measures: mutual information, correlation, and model based indices.
BMC Bioinformatics. 2012 Dec 9;13:328. doi: 10.1186/1471-2105-13-328.
5
Analyzing large biological datasets with association networks.
Nucleic Acids Res. 2012 Sep 1;40(17):e131. doi: 10.1093/nar/gks403. Epub 2012 May 25.
6
Genome-scale analysis of interaction dynamics reveals organization of biological networks.
Bioinformatics. 2012 Jul 15;28(14):1873-8. doi: 10.1093/bioinformatics/bts283. Epub 2012 May 9.
7
Comparing statistical methods for constructing large scale gene networks.
PLoS One. 2012;7(1):e29348. doi: 10.1371/journal.pone.0029348. Epub 2012 Jan 17.
8
Detecting novel associations in large data sets.
Science. 2011 Dec 16;334(6062):1518-24. doi: 10.1126/science.1205438.
9
Evaluation of gene-expression clustering via mutual information distance measure.
BMC Bioinformatics. 2007 Mar 30;8:111. doi: 10.1186/1471-2105-8-111.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验