csdR，一个用于差异共表达分析的 R 包。

csdR, an R package for differential co-expression analysis.

机构信息

Department of Biotechnology and Food Science, NTNU- Norwegian University of Science and Technology, Trondheim, Norway.

K.G. Jebsen Center for Genetic Epidemiology, Department of Public Health and General Practice, NTNU- Norwegian University of Science and Technology, Trondheim, Norway.

出版信息

BMC Bioinformatics. 2022 Feb 19;23(1):79. doi: 10.1186/s12859-022-04605-1.

DOI:10.1186/s12859-022-04605-1

PMID:35183100

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8858518/

Abstract

BACKGROUND

Differential co-expression network analysis has become an important tool to gain understanding of biological phenotypes and diseases. The CSD algorithm is a method to generate differential co-expression networks by comparing gene co-expressions from two different conditions. Each of the gene pairs is assigned conserved (C), specific (S) and differentiated (D) scores based on the co-expression of the gene pair between the two conditions. The result of the procedure is a network where the nodes are genes and the links are the gene pairs with the highest C-, S-, and D-scores. However, the existing CSD-implementations suffer from poor computational performance, difficult user procedures and lack of documentation.

RESULTS

We created the R-package csdR aimed at reaching good performance together with ease of use, sufficient documentation, and with the ability to play well with other tools for data analysis. csdR was benchmarked on a realistic dataset with 20,645 genes. After verifying that the chosen number of iterations gave sufficient robustness, we tested the performance against the two existing CSD implementations. csdR was superior in performance to one of the implementations, whereas the other did not run. Our implementation can utilize multiple processing cores. However, we were unable to achieve more than [Formula: see text]2.7 parallel speedup with saturation reached at about 10 cores.

CONCLUSION

The results suggest that csdR is a useful tool for differential co-expression analysis and is able to generate robust results within a workday on datasets of realistic sizes when run on a workstation or compute server.

摘要

背景

差异共表达网络分析已成为理解生物表型和疾病的重要工具。CSD 算法是一种通过比较两种不同条件下的基因共表达来生成差异共表达网络的方法。根据基因对在两种条件下的共表达情况，每个基因对都被分配了保守（C）、特异（S）和差异（D）分数。该过程的结果是一个网络，其中节点是基因，链接是具有最高 C-、S-和 D-分数的基因对。然而，现有的 CSD 实现存在计算性能差、用户程序复杂和缺乏文档等问题。

结果

我们创建了 R 包 csdR，旨在实现良好的性能、易用性、充足的文档，并能够与其他数据分析工具很好地配合使用。csdR 在一个包含 20645 个基因的真实数据集上进行了基准测试。在验证所选迭代次数具有足够的稳健性之后，我们针对现有的两个 CSD 实现对性能进行了测试。csdR 在性能上优于其中一个实现，而另一个实现则无法运行。我们的实现可以利用多个处理核心。然而，我们无法在工作站或计算服务器上达到超过 [Formula: see text]2.7 的并行加速比，并且在大约 10 个核心时达到饱和。

结论

结果表明，csdR 是差异共表达分析的有用工具，当在工作站或计算服务器上运行时，它能够在一个工作日内生成稳健的结果，并且能够处理真实大小的数据集。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/64ab/8858518/2d63261b8531/12859_2022_4605_Fig1_HTML.jpg

相似文献

csdR, an R package for differential co-expression analysis.csdR，一个用于差异共表达分析的 R 包。

BMC Bioinformatics. 2022 Feb 19;23(1):79. doi: 10.1186/s12859-022-04605-1.

Whole transcriptomic network analysis using Co-expression Differential Network Analysis (CoDiNA).使用共表达差异网络分析（CoDiNA）进行全转录组网络分析。

PLoS One. 2020 Oct 15;15(10):e0240523. doi: 10.1371/journal.pone.0240523. eCollection 2020.

Identification of regulatory modules in genome scale transcription regulatory networks.在基因组规模转录调控网络中识别调控模块。

BMC Syst Biol. 2017 Dec 15;11(1):140. doi: 10.1186/s12918-017-0493-2.

CMIP: a software package capable of reconstructing genome-wide regulatory networks using gene expression data.CMIP：一个能够利用基因表达数据重建全基因组调控网络的软件包。

BMC Bioinformatics. 2016 Dec 23;17(Suppl 17):535. doi: 10.1186/s12859-016-1324-y.

The Performance Comparison of Gene Co-expression Networks of Breast and Prostate Cancer using Different Selection Criteria.基于不同选择标准的乳腺癌和前列腺癌基因共表达网络性能比较。

Interdiscip Sci. 2021 Sep;13(3):500-510. doi: 10.1007/s12539-021-00440-9. Epub 2021 May 18.

Ranking differential hubs in gene co-expression networks.基因共表达网络中差异枢纽的排名

J Bioinform Comput Biol. 2012 Feb;10(1):1240002. doi: 10.1142/S0219720012400021.

MICRAT: a novel algorithm for inferring gene regulatory networks using time series gene expression data.MICRAT：一种使用时间序列基因表达数据推断基因调控网络的新算法。

BMC Syst Biol. 2018 Dec 14;12(Suppl 7):115. doi: 10.1186/s12918-018-0635-1.

petal: Co-expression network modelling in R.花瓣：R语言中的共表达网络建模

BMC Syst Biol. 2016 Aug 1;10 Suppl 2(Suppl 2):51. doi: 10.1186/s12918-016-0298-8.

BFDCA: A Comprehensive Tool of Using Bayes Factor for Differential Co-Expression Analysis.BFDCA：一种使用贝叶斯因子进行差异共表达分析的综合工具。

J Mol Biol. 2017 Feb 3;429(3):446-453. doi: 10.1016/j.jmb.2016.10.030. Epub 2016 Oct 27.

mAPKL: R/ Bioconductor package for detecting gene exemplars and revealing their characteristics.mAPKL：用于检测基因范例并揭示其特征的R/Bioconductor软件包。

BMC Bioinformatics. 2015 Sep 15;16(1):291. doi: 10.1186/s12859-015-0719-5.

引用本文的文献

Impact of air pollution exposure on cytokines and histone modification profiles at single-cell levels during pregnancy.孕期内空气污染暴露对细胞因子和组蛋白修饰谱的单细胞水平影响。

Sci Adv. 2024 Nov 29;10(48):eadp5227. doi: 10.1126/sciadv.adp5227.

Dysregulation of long non-coding RNA gene expression pathways in monocytes of type 2 diabetes patients with cardiovascular disease.2 型糖尿病合并心血管疾病患者单核细胞中长非编码 RNA 基因表达途径失调。

Cardiovasc Diabetol. 2024 Jun 7;23(1):196. doi: 10.1186/s12933-024-02292-1.

Identification of Spatial Proteomic Signatures of Colon Tumor Metastasis: A Digital Spatial Profiling Approach.鉴定结肠肿瘤转移的空间蛋白质组学特征：一种数字空间分析方法。

Am J Pathol. 2023 Jun;193(6):778-795. doi: 10.1016/j.ajpath.2023.02.020. Epub 2023 Apr 8.

本文引用的文献

Co-expression Network Analysis Reveals Key Genes Related to Ankylosing spondylitis Arthritis Disease: Computational and Experimental Validation.共表达网络分析揭示与强直性脊柱炎相关的关键基因：计算与实验验证

Iran J Biotechnol. 2021 Jan 1;19(1):e2630. doi: 10.30498/IJB.2021.2630. eCollection 2021 Jan.

Whole transcriptomic network analysis using Co-expression Differential Network Analysis (CoDiNA).使用共表达差异网络分析（CoDiNA）进行全转录组网络分析。

PLoS One. 2020 Oct 15;15(10):e0240523. doi: 10.1371/journal.pone.0240523. eCollection 2020.

Comparison of Methods for Differential Co-expression Analysis for Disease Biomarker Prediction.比较用于疾病生物标志物预测的差异共表达分析方法。

Comput Biol Med. 2019 Oct;113:103380. doi: 10.1016/j.compbiomed.2019.103380. Epub 2019 Aug 10.

(Differential) Co-Expression Analysis of Gene Expression: A Survey of Best Practices.（差异）基因表达的共表达分析：最佳实践综述。

IEEE/ACM Trans Comput Biol Bioinform. 2020 Jul-Aug;17(4):1154-1173. doi: 10.1109/TCBB.2019.2893170. Epub 2019 Jan 16.

A composite network of conserved and tissue specific gene interactions reveals possible genetic interactions in glioma.一个由保守和组织特异性基因相互作用组成的复合网络揭示了胶质瘤中可能存在的基因相互作用。

PLoS Comput Biol. 2017 Sep 28;13(9):e1005739. doi: 10.1371/journal.pcbi.1005739. eCollection 2017 Sep.

Dissection of regulatory networks that are altered in disease via differential co-expression.通过差异共表达来剖析疾病中改变的调控网络。

PLoS Comput Biol. 2013;9(3):e1002955. doi: 10.1371/journal.pcbi.1002955. Epub 2013 Mar 7.

DiffCorr: an R package to analyze and visualize differential correlations in biological networks.DiffCorr：一个用于分析和可视化生物网络中差异相关性的 R 包。

Gene. 2013 Apr 10;518(1):209-14. doi: 10.1016/j.gene.2012.11.028. Epub 2012 Dec 13.

Fast R Functions for Robust Correlations and Hierarchical Clustering.用于稳健相关性和层次聚类的快速R函数。

J Stat Softw. 2012 Mar;46(11).

Integrating gene expression and protein-protein interaction network to prioritize cancer-associated genes.整合基因表达和蛋白质-蛋白质相互作用网络，以确定与癌症相关的基因。

BMC Bioinformatics. 2012 Jul 28;13:182. doi: 10.1186/1471-2105-13-182.

Link-based quantitative methods to identify differentially coexpressed genes and gene pairs.基于链接的定量方法来识别差异共表达基因和基因对。

BMC Bioinformatics. 2011 Aug 2;12:315. doi: 10.1186/1471-2105-12-315.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

csdR，一个用于差异共表达分析的 R 包。

csdR, an R package for differential co-expression analysis.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSION

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献